Changelog¶

0.15 (2023-11-06)¶

Adding action rescaling: rescale actions
Adding train_reward to metrics dictionary
Adding version number in __init__.py
Adding device configuration: facilitating both CPU and GPU configuration.
Adding gradient clipping: introduced gradient clipping with clip grad norm to prevent updates that are too aggressive.

Creating a ppo agent training on continuous action space
Adding two environments: Pendulum-v1 and MountainCarContinuous-v0
Creating two new environments: PointMass1D-v0 and PointMass2D-v0
Creating test cases: test_gae.py, test_returns_and_advantages.py, test_normalizer.py, test_random_utils.py , test_reward_scaler.py