|View source on GitHub|
A batched replay buffer of nests of Tensors which can be sampled uniformly.
- Each add assumes tensors have batch_size as first dimension, and will store each element of the batch in an offset segment, so that each batch dimension has its own contiguous memory. Within batch segments, behaves as a circular buffer.
The get_next function returns 'ids' in addition to the data. This is not really needed for the batched replay buffer, but is returned to be consistent with the API for a priority replay buffer, which needs the ids to update priorities.
class BufferInfo: BufferInfo(ids, probabilities)
class TFUniformReplayBuffer: A TFUniformReplayBuffer with batched adds and uniform sampling.