Attend the Women in ML Symposium on December 7 Register now


Stay organized with collections Save and categorize content based on your preferences.

Mask boundary trajectories and those with invalid returns and advantages.

batched_traj Trajectory, doubly-batched [batch_dim, time_dim,...]. It must be preprocessed already.

A mask, type tf.float32, that is 0.0 for all between-episode Trajectory (batched_traj.step_type is LAST) and 0.0 if the return value is unavailable.