tfp.substrates.jax.distributions.JointDistributionNamedAutoBatched

Joint distribution parameterized by named distribution-making functions.

Inherits From: JointDistribution, Distribution

View aliases

Main aliases

tfp.experimental.substrates.jax.distributions.JointDistributionNamedAutoBatched

tfp.substrates.jax.distributions.JointDistributionNamedAutoBatched(
    model,
    batch_ndims=0,
    use_vectorized_map=True,
    validate_args=False,
    experimental_use_kahan_sum=False,
    name=None
)

This class provides automatic vectorization and alternative semantics for tfd.JointDistributionNamed, which in many cases allows for simplifications in the model specification.

#### Automatic vectorization

Auto-vectorized variants of JointDistribution allow the user to avoid explicitly annotating a model's vectorization semantics. When using manually-vectorized joint distributions, each operation in the model must account for the possibility of batch dimensions in Distributions and their samples. By contrast, auto-vectorized models need only describe a single sample from the joint distribution; any batch evaluation is automated using tf.vectorized_map as required. In many cases this allows for significant simplications. For example, the following manually-vectorized tfd.JointDistributionNamed model:

  model = tfd.JointDistributionNamed({
    'x': tfd.Normal(0., tf.ones([3])),
    'y': tfd.Normal(0., 1.),
    'z': lambda x, y: tfd.Normal(x[..., :2] + y[..., tf.newaxis], 1.)
  })

can be written in auto-vectorized form as

  model = tfd.JointDistributionNamedAutoBatched({
    'x': tfd.Normal(0., tf.ones([3])),
    'y': tfd.Normal(0., 1.),
    'z': lambda x, y: tfd.Normal(x[:2] + y, 1.)
  })

in which we were able to avoid explicitly accounting for batch dimensions when indexing and slicing computed quantities in the third line.

#### Alternative batch semantics

This class also provides alternative semantics for specifying a batch of independent (non-identical) joint distributions.

Instead of simply summing the log_probs of component distributions (which may have different shapes), it first reduces the component log_probs to ensure that jd.log_prob(jd.sample()) always returns a scalar, unless batch_ndims is explicitly set to a nonzero value (in which case the result will have the corresponding tensor rank).

The essential changes are:

An event of JointDistributionNamedAutoBatched is the dictionary of tensors produced by .sample(); thus, the event_shape is the dictionary containing the shapes of sampled tensors. These combine both the event and batch dimensions of the component distributions. By contrast, the event shape of a base JointDistributions does not include batch dimensions of component distributions.
The batch_shape is a global property of the entire model, rather than a per-component property as in base JointDistributions. The global batch shape must be a prefix of the batch shapes of each component; the length of this prefix is specified by an optional argument batch_ndims. If batch_ndims is not specified, the model has batch shape [].

Examples

Consider the following generative model:
```
e ~ Exponential(rate=[100,120])
g ~ Gamma(concentration=e[0], rate=e[1])
n ~ Normal(loc=0, scale=2.)
m ~ Normal(loc=n, scale=g)
for i = 1, ..., 12:
  x[i] ~ Bernoulli(logits=m)
```
We can code this as:
```
tfd = tfp.distributions
joint = tfd.JointDistributionNamedAutoBatched(dict(
    e=             tfd.Exponential(rate=[100, 120]),
    g=lambda    e: tfd.Gamma(concentration=e[0], rate=e[1]),
    n=             tfd.Normal(loc=0, scale=2.),
    m=lambda n, g: tfd.Normal(loc=n, scale=g),
    x=lambda    m: tfd.Sample(tfd.Bernoulli(logits=m), 12),
))
```
Notice the 1:1 correspondence between "math" and "code". In a standard JointDistributionNamed, we would have wrapped the first variable as e = tfd.Independent(tfd.Exponential(rate=[100, 120]), reinterpreted_batch_ndims=1) to specify that log_prob of the Exponential should be a scalar, summing over both dimensions. We would also have had to extend indices as tfd.Gamma(concentration=e[..., 0], rate=e[..., 1]) to account for possible batch dimensions. Both of these behaviors are implicit in JointDistributionNamedAutoBatched.

If every element of model is a CompositeTensor or a callable, the resulting JointDistributionNamedAutoBatched is a CompositeTensor. Otherwise, a non-CompositeTensor _JointDistributionNamedAutoBatched instance is created.

Args
`model`	Python `dict`, `collections.OrderedDict`, or `namedtuple` of distribution-making functions each with required args corresponding only to other keys.
`batch_ndims`	`int` `Tensor` number of batch dimensions. The `batch_shape`s of all component distributions must be such that the prefixes of length `batch_ndims` broadcast to a consistent joint batch shape. Default value: `0`.
`use_vectorized_map`	Python `bool`. Whether to use `tf.vectorized_map` to automatically vectorize evaluation of the model. This allows the model specification to focus on drawing a single sample, which is often simpler, but some ops may not be supported. Default value: `True`.
`validate_args`	Python `bool`. Whether to validate input with asserts. If `validate_args` is `False`, and the inputs are invalid, correct behavior is not guaranteed. Default value: `False`.
`experimental_use_kahan_sum`	Python `bool`. When `True`, we use Kahan summation to aggregate independent underlying log_prob values, which improves against the precision of a naive float32 sum. This can be noticeable in particular for large dimensions in float32. See CPU caveat on `tfp.math.reduce_kahan_sum`.
`name`	The name for ops managed by the distribution. Default value: `None` (i.e., `JointDistributionNamed`).

Attributes
`allow_nan_stats`	Python `bool` describing behavior when a stat is undefined. Stats return +/- infinity when it makes sense. E.g., the variance of a Cauchy distribution is infinity. However, sometimes the statistic is undefined, e.g., if a distribution's pdf does not achieve a maximum within the support of the distribution, the mode is undefined. If the mean is undefined, then by definition the variance is undefined. E.g. the mean for Student's T for df = 1 is undefined (no clear way to say it is either + or - infinity), so the variance = E[(X - mean)**2] is also undefined.
`batch_ndims`
`batch_shape`	Shape of a single sample from a single event index as a `TensorShape`. May be partially defined or unknown. The batch dimensions are indexes into independent, non-identical parameterizations of this distribution.
`dtype`	The `DType` of `Tensor`s handled by this `Distribution`.
`event_shape`	Shape of a single sample from a single batch as a `TensorShape`. May be partially defined or unknown.
`experimental_shard_axis_names`	Indicates whether part distributions have active shard axis names.
`model`
`name`	Name prepended to all ops created by this `Distribution`.
`parameters`	Dictionary of parameters used to instantiate this `Distribution`.
`reparameterization_type`	Describes how samples from the distribution are reparameterized. Currently this is one of the static instances `tfd.FULLY_REPARAMETERIZED` or `tfd.NOT_REPARAMETERIZED`.
`trainable_variables`
`use_vectorized_map`
`validate_args`	Python `bool` indicating possibly expensive checks are enabled.
`variables`

Args
`value`	`float` or `double` `Tensor`.
`name`	Python `str` prepended to names of ops created by this function.
`**kwargs`	Named arguments forwarded to subclass implementation.

Args
`other`	`tfp.distributions.Distribution` instance.
`name`	Python `str` prepended to names of ops created by this function.

Args
`*args`	Passed to implementation `_default_event_space_bijector`.
`**kwargs`	Passed to implementation `_default_event_space_bijector`.

Args
`value`	a `Tensor` valid sample from this distribution family.
`sample_ndims`	Positive `int` Tensor number of leftmost dimensions of `value` that index i.i.d. samples. Default value: `1`.
`validate_args`	Python `bool`, default `False`. When `True`, distribution parameters are checked for validity despite possibly degrading runtime performance. When `False`, invalid inputs may silently render incorrect outputs. Default value: `False`.
`**init_kwargs`	Additional keyword arguments passed through to `cls.__init__`. These take precedence in case of collision with the fitted parameters; for example, `tfd.Normal.experimental_fit([1., 1.], scale=20.)` returns a Normal distribution with `scale=20.` rather than the maximum likelihood parameter `scale=0.`.

Args
`value`	`float` or `double` `Tensor`.
`backward_compat`	`bool` specifying whether to fall back to returning `FullSpace` as the tangent space, and representing R^n with the standard basis.
`**kwargs`	Named arguments forwarded to subclass implementation.

Returns
`log_prob`	a `Tensor` representing the log probability density, of shape `sample_shape(x) + self.batch_shape` with values of type `self.dtype`.
`tangent_space`	a `TangentSpace` object (by default `FullSpace`) representing the tangent space to the manifold at `value`.

Args
`*args`	Positional arguments: a value structure or component values (see above).
`**kwargs`	Keyword arguments: a value structure or component values (see above). May also include `name`, specifying a Python string name for ops generated by this method.

Args
`sample_shape`	integer `Tensor` desired shape of samples to draw. Default value: `()`.
`seed`	PRNG seed; see `tfp.random.sanitize_seed` for details. Default value: `None`.
`name`	name to give to the op. Default value: `'sample_and_log_prob'`.
`**kwargs`	Named arguments forwarded to subclass implementation.

Returns
`samples`	a `Tensor`, or structure of `Tensor`s, with prepended dimensions `sample_shape`.
`log_prob`	a `Tensor` of shape `sample_shape(x) + self.batch_shape` with values of type `self.dtype`.

Args
`*args`	Positional arguments: a `value` structure or component values (see above).
`**kwargs`	Keyword arguments: a `value` structure or component values (see above). May also include `name`, specifying a Python string name for ops generated by this method.

Args
`sample_shape`	`Tensor` or python list/tuple. Desired shape of a call to `sample()`.
`name`	name to prepend ops with.

Args
`dtype`	Optional float `dtype` to assume for continuous-valued parameters. Some constraining bijectors require advance knowledge of the dtype because certain constants (e.g., `tfb.Softplus.low`) must be instantiated with the same dtype as the values to be transformed.
`num_classes`	Optional `int` `Tensor` number of classes to assume when inferring the shape of parameters for categorical-like distributions. Otherwise ignored.

Args
`distribution_names`	`list` of `str` or `None` names corresponding to each of `model` elements. (`None`s are expanding into the appropriate `str`.)
`leaf_name`	`str` used when no maker depends on a particular `model` element.

Args
`sample_shape`	0D or 1D `int32` `Tensor`. Shape of the generated samples.
`seed`	PRNG seed; see `tfp.random.sanitize_seed` for details.
`name`	name to give to the op.
`**kwargs`	Named arguments forwarded to subclass implementation.

Returns
`distributions`	a `tuple` of `Distribution` instances for each of `distribution_fn`.
`samples`	a `tuple` of `Tensor`s with prepended dimensions `sample_shape` for each of `distribution_fn`.

tfp.substrates.jax.distributions.JointDistributionNamedAutoBatched Stay organized with collections Save and categorize content based on your preferences.

View aliases

Examples

Args

Attributes

Child Classes

Methods

batch_shape_tensor

cdf

copy

covariance

cross_entropy

entropy

event_shape_tensor

experimental_default_event_space_bijector

experimental_fit

experimental_local_measure

experimental_pin

Examples:

experimental_sample_and_log_prob

is_scalar_batch

is_scalar_event

kl_divergence

log_cdf

log_prob

log_prob_parts

log_survival_function

mean

mode

param_shapes

param_static_shapes

parameter_properties

prob

prob_parts

quantile

resolve_graph

Example

sample

kwargs:

sample_distributions

stddev

survival_function

unnormalized_log_prob

unnormalized_log_prob_parts

unnormalized_prob_parts

variance

__getitem__

__iter__

tfp.substrates.jax.distributions.JointDistributionNamedAutoBatched

`batch_shape_tensor`

`cdf`

`copy`

`covariance`

`cross_entropy`

`entropy`

`event_shape_tensor`

`experimental_default_event_space_bijector`

`experimental_fit`

`experimental_local_measure`

`experimental_pin`

`experimental_sample_and_log_prob`

`is_scalar_batch`

`is_scalar_event`

`kl_divergence`

`log_cdf`

`log_prob`

`log_prob_parts`

`log_survival_function`

`mean`

`mode`

`param_shapes`

`param_static_shapes`

`parameter_properties`

`prob`

`prob_parts`

`quantile`

`resolve_graph`

`sample`

`kwargs`:

`sample_distributions`

`stddev`

`survival_function`

`unnormalized_log_prob`

`unnormalized_log_prob_parts`

`unnormalized_prob_parts`

`variance`

`getitem`

`iter`