Module: tf_agents.bandits.policies.mixture_policy

View source on GitHub

A policy class that chooses from a set of policies to get the actions from.

This mixture policy takes a list of policies and will randomly choose one of them for every observation. The distribution is defined by the mixture_distribution.

Classes

class MixturePolicy: A policy that chooses from a set of policies to decide the action.

Other Members

  • MIXTURE_AGENT_ID = 'mixture_agent_id'
  • SUBPOLICY_INFO = 'subpolicy_info'