tf.contrib.distributions.VectorSinhArcsinhDiag

The (diagonal) SinhArcsinh transformation of a distribution on R^k.

Inherits From: TransformedDistribution

tf.contrib.distributions.VectorSinhArcsinhDiag(
    loc=None, scale_diag=None, scale_identity_multiplier=None, skewness=None,
    tailweight=None, distribution=None, validate_args=False, allow_nan_stats=True,
    name='MultivariateNormalLinearOperator'
)

This distribution models a random vector Y = (Y1,...,Yk), making use of a SinhArcsinh transformation (which has adjustable tailweight and skew), a rescaling, and a shift.

The SinhArcsinh transformation of the Normal is described in great depth in Sinh-arcsinh distributions. Here we use a slightly different parameterization, in terms of tailweight and skewness. Additionally we allow for distributions other than Normal, and control over scale as well as a "shift" parameter loc.

Mathematical Details

Given iid random vector Z = (Z1,...,Zk), we define the VectorSinhArcsinhDiag transformation of Z, Y, parameterized by (loc, scale, skewness, tailweight), via the relation (with @ denoting matrix multiplication):

Y := loc + scale @ F(Z) * (2 / F_0(2))
F(Z) := Sinh( (Arcsinh(Z) + skewness) * tailweight )
F_0(Z) := Sinh( Arcsinh(Z) * tailweight )

This distribution is similar to the location-scale transformation L(Z) := loc + scale @ Z in the following ways:

If skewness = 0 and tailweight = 1 (the defaults), F(Z) = Z, and then Y = L(Z) exactly.
loc is used in both to shift the result by a constant factor.
The multiplication of scale by 2 / F_0(2) ensures that if skewness = 0 P[Y - loc <= 2 * scale] = P[L(Z) - loc <= 2 * scale]. Thus it can be said that the weights in the tails of Y and L(Z) beyond loc + 2 * scale are the same.

This distribution is different than loc + scale @ Z due to the reshaping done by F:

Positive (negative) skewness leads to positive (negative) skew.
- positive skew means, the mode of F(Z) is "tilted" to the right.
- positive skew means positive values of F(Z) become more likely, and negative values become less likely.
Larger (smaller) tailweight leads to fatter (thinner) tails.
- Fatter tails mean larger values of |F(Z)| become more likely.
- tailweight < 1 leads to a distribution that is "flat" around Y = loc, and a very steep drop-off in the tails.
- tailweight > 1 leads to a distribution more peaked at the mode with heavier tails.

To see the argument about the tails, note that for |Z| >> 1 and |Z| >> (|skewness| * tailweight)**tailweight, we have Y approx 0.5 Z**tailweight e**(sign(Z) skewness * tailweight).

To see the argument regarding multiplying scale by 2 / F_0(2),

P[(Y - loc) / scale <= 2] = P[F(Z) * (2 / F_0(2)) <= 2]
                          = P[F(Z) <= F_0(2)]
                          = P[Z <= 2]  (if F = F_0).

Args
`loc`	Floating-point `Tensor`. If this is set to `None`, `loc` is implicitly `0`. When specified, may have shape `[B1, ..., Bb, k]` where `b >= 0` and `k` is the event size.
`scale_diag`	Non-zero, floating-point `Tensor` representing a diagonal matrix added to `scale`. May have shape `[B1, ..., Bb, k]`, `b >= 0`, and characterizes `b`-batches of `k x k` diagonal matrices added to `scale`. When both `scale_identity_multiplier` and `scale_diag` are `None` then `scale` is the `Identity`.
`scale_identity_multiplier`	Non-zero, floating-point `Tensor` representing a scale-identity-matrix added to `scale`. May have shape `[B1, ..., Bb]`, `b >= 0`, and characterizes `b`-batches of scale `k x k` identity matrices added to `scale`. When both `scale_identity_multiplier` and `scale_diag` are `None` then `scale` is the `Identity`.
`skewness`	Skewness parameter. floating-point `Tensor` with shape broadcastable with `event_shape`.
`tailweight`	Tailweight parameter. floating-point `Tensor` with shape broadcastable with `event_shape`.
`distribution`	`tf.Distribution`-like instance. Distribution from which `k` iid samples are used as input to transformation `F`. Default is `tfp.distributions.Normal(loc=0., scale=1.)`. Must be a scalar-batch, scalar-event distribution. Typically `distribution.reparameterization_type = FULLY_REPARAMETERIZED` or it is a function of non-trainable parameters. WARNING: If you backprop through a VectorSinhArcsinhDiag sample and `distribution` is not `FULLY_REPARAMETERIZED` yet is a function of trainable variables, then the gradient will be incorrect!
`validate_args`	Python `bool`, default `False`. When `True` distribution parameters are checked for validity despite possibly degrading runtime performance. When `False` invalid inputs may silently render incorrect outputs.
`allow_nan_stats`	Python `bool`, default `True`. When `True`, statistics (e.g., mean, mode, variance) use the value "`NaN`" to indicate the result is undefined. When `False`, an exception is raised if one or more of the statistic's batch members are undefined.
`name`	Python `str` name prefixed to Ops created by this class.

Raises
`ValueError`	if at most `scale_identity_multiplier` is specified.

Attributes
`allow_nan_stats`	Python `bool` describing behavior when a stat is undefined. Stats return +/- infinity when it makes sense. E.g., the variance of a Cauchy distribution is infinity. However, sometimes the statistic is undefined, e.g., if a distribution's pdf does not achieve a maximum within the support of the distribution, the mode is undefined. If the mean is undefined, then by definition the variance is undefined. E.g. the mean for Student's T for df = 1 is undefined (no clear way to say it is either + or - infinity), so the variance = E[(X - mean)**2] is also undefined.
`batch_shape`	Shape of a single sample from a single event index as a `TensorShape`. May be partially defined or unknown. The batch dimensions are indexes into independent, non-identical parameterizations of this distribution.
`bijector`	Function transforming x => y.
`distribution`	Base distribution, p(x).
`dtype`	The `DType` of `Tensor`s handled by this `Distribution`.
`event_shape`	Shape of a single sample from a single batch as a `TensorShape`. May be partially defined or unknown.
`loc`	The `loc` in `Y := loc + scale @ F(Z) * (2 / F(2)). </td> </tr><tr> <td>`name`</td> <td> Name prepended to all ops created by this`Distribution`. </td> </tr><tr> <td>`parameters`</td> <td> Dictionary of parameters used to instantiate this`Distribution`. </td> </tr><tr> <td>`reparameterization_type`	Describes how samples from the distribution are reparameterized. Currently this is one of the static instances `distributions.FULLY_REPARAMETERIZED` or `distributions.NOT_REPARAMETERIZED`.
`scale`	The `LinearOperator` `scale` in `Y := loc + scale @ F(Z) * (2 / F(2)). </td> </tr><tr> <td>`skewness`</td> <td> Controls the skewness.`Skewness > 0`means right skew. </td> </tr><tr> <td>`tailweight`</td> <td> Controls the tail decay.`tailweight > 1`means faster than Normal. </td> </tr><tr> <td>`validate_args`</td> <td> Python`bool` indicating possibly expensive checks are enabled.

Args
`value`	`float` or `double` `Tensor`.
`name`	Python `str` prepended to names of ops created by this function.

Args
`other`	`tfp.distributions.Distribution` instance.
`name`	Python `str` prepended to names of ops created by this function.

Args
`sample_shape`	`Tensor` or python list/tuple. Desired shape of a call to `sample()`.
`name`	name to prepend ops with.

Args
`sample_shape`	0D or 1D `int32` `Tensor`. Shape of the generated samples.
`seed`	Python integer seed for RNG
`name`	name to give to the op.

tf.contrib.distributions.VectorSinhArcsinhDiag

Mathematical Details

Args

Raises

Attributes

Methods

batch_shape_tensor

cdf

copy

covariance

cross_entropy

entropy

event_shape_tensor

is_scalar_batch

is_scalar_event

kl_divergence

log_cdf

log_prob

log_survival_function

mean

mode

param_shapes

param_static_shapes

prob

quantile

sample

stddev

survival_function

variance

`batch_shape_tensor`

`cdf`

`copy`

`covariance`

`cross_entropy`

`entropy`

`event_shape_tensor`

`is_scalar_batch`

`is_scalar_event`

`kl_divergence`

`log_cdf`

`log_prob`

`log_survival_function`

`mean`

`mode`

`param_shapes`

`param_static_shapes`

`prob`

`quantile`

`sample`

`stddev`

`survival_function`

`variance`