tf_agents.eval.metric_utils.compute

Compute metrics using policy on the environment.

metrics List of metrics to compute.
environment py_environment instance.
policy py_policy instance used to step the environment. A tf_policy can be used in_eager_mode.
num_episodes Number of episodes to compute the metrics over.

A dictionary of results {metric_name: metric_value}