.. _changelog: =========== Changelog =========== .. _v0_15: 0.15 (2023-11-06) ----------------- - Adding Actor Critic Causal Attention Policy - Adding Cosine Annealing LR Scheduler - Adding gradient and weight NAN and INF check - Adding a debug flag when detecting NAN and INF .. _v0_14: 0.14 (2023-10-15) ----------------- - Adding action rescaling: rescale actions - Adding train_reward to metrics dictionary - Adding version number in __init__.py - Adding device configuration: facilitating both CPU and GPU configuration. - Adding gradient clipping: introduced gradient clipping with clip grad norm to prevent updates that are too aggressive. .. _v0_13: 0.13 (2023-09-19) ----------------- - Creating a ppo agent training on continuous action space - Adding two environments: `Pendulum-v1` and `MountainCarContinuous-v0` - Creating two new environments: `PointMass1D-v0` and `PointMass2D-v0` - Creating test cases: test_gae.py, test_returns_and_advantages.py, test_normalizer.py, test_random_utils.py , test_reward_scaler.py .. _v0_11: 0.11 (2023-07-03) ----------------- - Creating a ppo agent training on discrete action space - Adding one environment: `CartPole-v1` .. _v0_1: 0.1 (2023-07-01) ---------------- - Initial release to PyPI