Missed TensorFlow Dev Summit? Check out the video playlist. Watch recordings

Module: tf_agents.bandits.agents.greedy_reward_prediction_agent

View source on GitHub

An agent that uses and trains a greedy reward prediction policy.

Classes

class GreedyRewardPredictionAgent: A neural reward network based bandit agent.