Constructs a log prob parts function that all-reduces over terms.
Given a log_prob_parts function, this function will return a new one that
includes all-reduce sums over terms according to the
is_sharded property. It
will also add all-reduce sums for the gradient of sharded terms w.r.t.
a callable that takes in a structured value and returns a
structure of log densities for each of the terms, that when summed returns
a locally correct log-density.
a structure of values that matches the input and output of
log_prob_parts_fn. Each value in
axis_names is either
None, a string
name of a mapped axis in the JAX backend or any non-None
value in TF
backend, or an iterable thereof corresponding to multiple sharding axes.
, the returned function will add
all-reduce sum(s) for its term in the log prob calculation. If it isNone`, the returned function will have an all-reduce sum over the
gradient of sharded terms w.r.t. to the unsharded value.
A new log prob parts function that can be run inside of a strategy.