Module: tf_agents.agents.ppo.ppo_utils

View source on GitHub

Utils functions for


get_distribution_params(...): Get the params for an optionally nested action distribution.

get_metric_observers(...): Returns a list of observers, one for each metric.

make_timestep_mask(...): Create a mask for transitions and optionally final incomplete episodes.

nested_kl_divergence(...): Given two nested distributions, sum the KL divergences of the leaves.