nanoPPO ========= |PyPI| |Changelog| |Python 3.x| |Tests| |License| .. |PyPI| image:: https://img.shields.io/pypi/v/nanoPPO.svg :target: https://pypi.org/project/nanoPPO/ .. |Changelog| image:: https://img.shields.io/github/v/release/jamesliu/nanoPPO?label=changelog :target: https://readthedocs.org/en/stable/changelog.html .. |Python 3.x| image:: https://img.shields.io/pypi/pyversions/nanoPPO.svg?logo=python&logoColor=white :target: https://pypi.org/project/nanoPPO/ .. |Tests| image:: https://github.com/jamesliu/nanoPPO/workflows/Test/badge.svg :target: https://github.com/jamesliu/nanoPPO/actions?query=workflow%3ATest .. |License| image:: https://img.shields.io/badge/license-Apache%202.0-blue.svg :target: https://github.com/jamesliu/nanoPPO/blob/main/LICENSE =================================== .. toctree:: :maxdepth: 2 :caption: Contents: README.md reward_rescaling contributing changelog