Missed TensorFlow Dev Summit? Check out the video playlist. Watch recordings

tf_agents.trajectories.time_step.termination

View source on GitHub

Returns a TimeStep with step_type set to StepType.LAST.

tf_agents.trajectories.time_step.termination(
    observation, reward
)

Used in the notebooks

Used in the tutorials

Args:

  • observation: A NumPy array, tensor, or a nested dict, list or tuple of arrays or tensors.
  • reward: A scalar, or 1D NumPy array, or tensor.

Returns:

A TimeStep.

Raises:

  • ValueError: If observations are tensors but reward's statically known rank is not 0 or 1.