![]() |
PolicyInfo(log_probability, predicted_rewards_mean, predicted_rewards_optimistic, predicted_rewards_sampled, bandit_policy_type)
tf_agents.bandits.policies.policy_utilities.PolicyInfo(
log_probability=(), predicted_rewards_mean=(), predicted_rewards_optimistic=(),
predicted_rewards_sampled=(), bandit_policy_type=()
)
Attributes | |
---|---|
log_probability
|
|
predicted_rewards_mean
|
|
predicted_rewards_optimistic
|
|
predicted_rewards_sampled
|
|
bandit_policy_type
|