View source on GitHub |
Utils functions for ppo_agent.py.
Functions
distribution_from_spec(...)
: Convert (spec, new_distribution_params) -> Distribution
.
get_distribution_params(...)
: Get the params for an optionally nested action distribution.
get_learning_rate(...)
: Gets the current learning rate from an optimizer to be graphed.
get_metric_observers(...)
: Returns a list of observers, one for each metric.
make_timestep_mask(...)
: Create a mask for transitions and optionally final incomplete episodes.
make_trajectory_mask(...)
: Mask boundary trajectories and those with invalid returns and advantages.
nested_kl_divergence(...)
: Given two nested distributions, sum the KL divergences of the leaves.
Other Members | |
---|---|
absolute_import |
Instance of __future__._Feature
|
division |
Instance of __future__._Feature
|
print_function |
Instance of __future__._Feature
|