Changelog¶
0.15 (2023-11-06)¶
Adding Actor Critic Causal Attention Policy
Adding Cosine Annealing LR Scheduler
Adding gradient and weight NAN and INF check
Adding a debug flag when detecting NAN and INF
0.14 (2023-10-15)¶
Adding action rescaling: rescale actions
Adding train_reward to metrics dictionary
Adding version number in __init__.py
Adding device configuration: facilitating both CPU and GPU configuration.
Adding gradient clipping: introduced gradient clipping with clip grad norm to prevent updates that are too aggressive.
0.13 (2023-09-19)¶
Creating a ppo agent training on continuous action space
Adding two environments: Pendulum-v1 and MountainCarContinuous-v0
Creating two new environments: PointMass1D-v0 and PointMass2D-v0
Creating test cases: test_gae.py, test_returns_and_advantages.py, test_normalizer.py, test_random_utils.py , test_reward_scaler.py
0.11 (2023-07-03)¶
Creating a ppo agent training on discrete action space
Adding one environment: CartPole-v1
0.1 (2023-07-01)¶
Initial release to PyPI