tf_agents.trajectories.Trajectory
Stay organized with collections
Save and categorize content based on your preferences.
A tuple that represents a trajectory.
tf_agents.trajectories.Trajectory(
step_type,
observation,
action,
policy_info,
next_step_type,
reward,
discount
)
Used in the notebooks
A Trajectory
represents a sequence of aligned time steps. It captures the
observation, step_type from current time step with the computed action
and policy_info. Discount, reward and next_step_type come from the next
time step.
Attributes |
step_type
|
A StepType .
|
observation
|
An array (tensor), or a nested dict, list or tuple of arrays
(tensors) that represents the observation.
|
action
|
An array/a tensor, or a nested dict, list or tuple of actions. This
represents action generated according to the observation.
|
policy_info
|
An arbitrary nest that contains auxiliary information related
to the action. Note that this does not include the policy/RNN state which
was used to generate the action.
|
next_step_type
|
The StepType of the next time step.
|
reward
|
An array/a tensor, or a nested dict, list, or tuple of rewards. This
represents the rewards and/or constraint satisfiability after performing
the action in an environment.
|
discount
|
A scalar that representing the discount factor to multiply with
future rewards.
|
Methods
is_boundary
View source
is_boundary() -> tf_agents.typing.types.Bool
is_first
View source
is_first() -> tf_agents.typing.types.Bool
is_last
View source
is_last() -> tf_agents.typing.types.Bool
is_mid
View source
is_mid() -> tf_agents.typing.types.Bool
replace
View source
replace(
**kwargs
) -> 'Trajectory'
Exposes as namedtuple._replace.
Usage:
new_trajectory = trajectory.replace(policy_info=())
This returns a new trajectory with an empty policy_info.
Args |
**kwargs
|
key/value pairs of fields in the trajectory.
|
Returns |
A new Trajectory .
|
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2024-04-26 UTC.
[{
"type": "thumb-down",
"id": "missingTheInformationINeed",
"label":"Missing the information I need"
},{
"type": "thumb-down",
"id": "tooComplicatedTooManySteps",
"label":"Too complicated / too many steps"
},{
"type": "thumb-down",
"id": "outOfDate",
"label":"Out of date"
},{
"type": "thumb-down",
"id": "samplesCodeIssue",
"label":"Samples / code issue"
},{
"type": "thumb-down",
"id": "otherDown",
"label":"Other"
}]
[{
"type": "thumb-up",
"id": "easyToUnderstand",
"label":"Easy to understand"
},{
"type": "thumb-up",
"id": "solvedMyProblem",
"label":"Solved my problem"
},{
"type": "thumb-up",
"id": "otherUp",
"label":"Other"
}]
{"lastModified": "Last updated 2024-04-26 UTC."}
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2024-04-26 UTC."],[],[]]