Changelog

0.15 (2023-11-06)

  • Adding Actor Critic Causal Attention Policy

  • Adding Cosine Annealing LR Scheduler

  • Adding gradient and weight NAN and INF check

  • Adding a debug flag when detecting NAN and INF

0.14 (2023-10-15)

  • Adding action rescaling: rescale actions

  • Adding train_reward to metrics dictionary

  • Adding version number in __init__.py

  • Adding device configuration: facilitating both CPU and GPU configuration.

  • Adding gradient clipping: introduced gradient clipping with clip grad norm to prevent updates that are too aggressive.

0.13 (2023-09-19)

  • Creating a ppo agent training on continuous action space

  • Adding two environments: Pendulum-v1 and MountainCarContinuous-v0

  • Creating two new environments: PointMass1D-v0 and PointMass2D-v0

  • Creating test cases: test_gae.py, test_returns_and_advantages.py, test_normalizer.py, test_random_utils.py , test_reward_scaler.py

0.11 (2023-07-03)

  • Creating a ppo agent training on discrete action space

  • Adding one environment: CartPole-v1

0.1 (2023-07-01)

  • Initial release to PyPI