tf_agents.policies.scripted_py_policy.ScriptedPyPolicy

Returns actions from the given configuration.

Inherits From: PyPolicy

tf_agents.policies.scripted_py_policy.ScriptedPyPolicy(
    time_step_spec: tf_agents.trajectories.TimeStep,
    action_spec: tf_agents.typing.types.NestedArraySpec,
    action_script: Sequence[Tuple[int, types.NestedArray]]
)

Used in the notebooks

Used in the tutorials
Policies

Args
`time_step_spec`	A time_step_spec for the policy will interact with.
`action_spec`	An action_spec for the environment the policy will interact with.
`action_script`	A list of 2-tuples of the form (n, nest) where the nest of actions follow the action_spec. Each action will be executed for n steps.

Attributes
`action_spec`	Describes the ArraySpecs of the np.Array returned by `action()`. `action` can be a single np.Array, or a nested dict, list or tuple of np.Array.
`collect_data_spec`	Describes the data collected when using this policy with an environment.
`info_spec`	Describes the Arrays emitted as info by `action()`.
`observation_and_action_constraint_splitter`
`policy_state_spec`	Describes the arrays expected by functions with `policy_state` as input.
`policy_step_spec`	Describes the output of `action()`.
`time_step_spec`	Describes the `TimeStep` np.Arrays expected by `action(time_step)`.
`trajectory_spec`	Describes the data collected when using this policy with an environment.

Methods

`action`

View source

action(
    time_step: tf_agents.trajectories.TimeStep,
    policy_state: tf_agents.typing.types.NestedArray = (),
    seed: Optional[types.Seed] = None
) -> tf_agents.trajectories.PolicyStep

Generates next action given the time_step and policy_state.

Args
`time_step`	A `TimeStep` tuple corresponding to `time_step_spec()`.
`policy_state`	An optional previous policy_state.
`seed`	Seed to use if action uses sampling (optional).

Returns
A PolicyStep named tuple containing: `action`: A nest of action Arrays matching the `action_spec()`. `state`: A nest of policy states to be fed into the next call to action. `info`: Optional side information such as action log probabilities.

`get_initial_state`

View source

get_initial_state(
    batch_size: Optional[int] = None
) -> tf_agents.typing.types.NestedArray

Returns an initial state usable by the policy.

Args
`batch_size`	An optional batch size.

Returns
An initial policy state.

tf_agents.policies.scripted_py_policy.ScriptedPyPolicy Stay organized with collections Save and categorize content based on your preferences.

Used in the notebooks

Args

Attributes

Methods

action

get_initial_state

tf_agents.policies.scripted_py_policy.ScriptedPyPolicy

`action`

`get_initial_state`