Jax ppo test
PPOLightningModule #
Bases: LightningModule
Uses the same code as JaxRLExample
, but the training loop is run with pytorch-lightning.
This is currently only meant to be used to compare the difference fully-jitted training loop and lightning.
RlThroughputCallback #
Bases: MeasureSamplesPerSecondCallback
A callback to measure the throughput of RL algorithms.