Missed TensorFlow Dev Summit? Check out the video playlist. Watch recordings

Module: tf_agents.bandits.agents.exp3_mixture_agent

View source on GitHub

A mixture agent that updates the mixture distribution based on EXP3.

For a reference on EXP3, see Bandit Algorithms by Tor Lattimore and Csaba Szepesvari (https://tor-lattimore.com/downloads/book/book.pdf).


class Exp3MixtureAgent: An agent that mixes a set of agents and updates the weights with Exp3.

class Exp3MixtureVariableCollection: A collection of variables used by subclasses of MixtureAgent.