Module: tf_agents.bandits.environments.non_stationary_stochastic_environment

Bandit environment that returns random observations and rewards.

Classes

class EnvironmentDynamics: Abstract class to represent a non-stationary environment dynamics.

class NonStationaryStochasticEnvironment: Implements a general non-stationary environment.