https://github.com/aleksandarhaber/policy-iteration-algorithm-in-python-with-tests-on-frozen-lake-openai-gym-environment