tf_agents.trajectories.time_step.truncation

Returns a TimeStep with step_type set to StepType.LAST.

If discount is a scalar, and observation contains Tensors, then discount will be broadcasted to match the outer dimensions.

observation A NumPy array, tensor, or a nested dict, list or tuple of arrays or tensors.
reward A NumPy array, tensor, or a nested dict, list or tuple of arrays or tensors.
discount (optional) A scalar, or 1D NumPy array, or tensor.
outer_dims (optional) If provided, it will be used to determine the batch dimensions.

A TimeStep.