tf_agents.bandits.policies.policy_utilities.PerArmPolicyInfo

View source on GitHub

PerArmPolicyInfo(log_probability, predicted_rewards_mean, predicted_rewards_sampled, bandit_policy_type, chosen_arm_features)

log_probability

predicted_rewards_mean

predicted_rewards_sampled

bandit_policy_type

chosen_arm_features