tf_agents.trajectories.TimeStep
Stay organized with collections
Save and categorize content based on your preferences.
Returned with every call to step
and reset
on an environment.
tf_agents.trajectories.TimeStep(
step_type, reward, discount, observation
)
A TimeStep
contains the data emitted by an environment at each step of
interaction. A TimeStep
holds a step_type
, an observation
(typically a
NumPy array or a dict or list of arrays), and an associated reward
and
discount
.
The first TimeStep
in a sequence will equal StepType.FIRST
. The final
TimeStep
will equal StepType.LAST
. All other TimeStep
s in a sequence
will equal `StepType.MID.
Attributes |
step_type
|
a Tensor or array of StepType enum values.
|
reward
|
a Tensor or array of reward values.
|
discount
|
A discount value in the range [0, 1] .
|
observation
|
A NumPy array, or a nested dict, list or tuple of arrays.
|
Methods
is_first
View source
is_first() -> tf_agents.typing.types.Bool
is_last
View source
is_last() -> tf_agents.typing.types.Bool
is_mid
View source
is_mid() -> tf_agents.typing.types.Bool
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2024-04-26 UTC.
[{
"type": "thumb-down",
"id": "missingTheInformationINeed",
"label":"Missing the information I need"
},{
"type": "thumb-down",
"id": "tooComplicatedTooManySteps",
"label":"Too complicated / too many steps"
},{
"type": "thumb-down",
"id": "outOfDate",
"label":"Out of date"
},{
"type": "thumb-down",
"id": "samplesCodeIssue",
"label":"Samples / code issue"
},{
"type": "thumb-down",
"id": "otherDown",
"label":"Other"
}]
[{
"type": "thumb-up",
"id": "easyToUnderstand",
"label":"Easy to understand"
},{
"type": "thumb-up",
"id": "solvedMyProblem",
"label":"Solved my problem"
},{
"type": "thumb-up",
"id": "otherUp",
"label":"Other"
}]
{"lastModified": "Last updated 2024-04-26 UTC."}
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2024-04-26 UTC."],[],[]]