tfp.stats.quantiles

Compute quantiles of x along axis.

tfp.stats.quantiles(
    x,
    num_quantiles,
    axis=None,
    interpolation=None,
    keepdims=False,
    validate_args=False,
    name=None
)

The quantiles of a distribution are cut points dividing the range into intervals with equal probabilities.

Given a vector x of samples, this function estimates the cut points by returning num_quantiles + 1 cut points, (c0, ..., cn), such that, roughly speaking, equal number of sample points lie in the num_quantiles intervals [c0, c1), [c1, c2), ..., [c_{n-1}, cn]. That is,

About 1 / n fraction of the data lies in [c_{k-1}, c_k), k = 1, ..., n
About k / n fraction of the data lies below c_k.
c0 is the sample minimum and cn is the maximum.

The exact number of data points in each interval depends on the size of x (e.g. whether the size is divisible by n) and the interpolation kwarg.

Args
`x`	Numeric `N-D` `Tensor` with `N > 0`. If `axis` is not `None`, `x` must have statically known number of dimensions.
`num_quantiles`	Scalar `integer` `Tensor`. The number of intervals the returned `num_quantiles + 1` cut points divide the range into.
`axis`	Optional `0-D` or `1-D` integer `Tensor` with constant values. The axis that index independent samples over which to return the desired percentile. If `None` (the default), treat every dimension as a sample dimension, returning a scalar.
`interpolation`	{'nearest', 'linear', 'lower', 'higher', 'midpoint'}. Default value: 'nearest'. This specifies the interpolation method to use when the fractions `k / n` lie between two data points `i < j`: linear: i + (j - i) * fraction, where fraction is the fractional part of the index surrounded by i and j. lower: `i`. higher: `j`. nearest: `i` or `j`, whichever is nearest. midpoint: (i + j) / 2. `linear` and `midpoint` interpolation do not work with integer dtypes.
`keepdims`	Python `bool`. If `True`, the last dimension is kept with size 1 If `False`, the last dimension is removed from the output shape.
`validate_args`	Whether to add runtime checks of argument validity. If False, and arguments are incorrect, correct behavior is not guaranteed.
`name`	A Python string name to give this `Op`. Default is 'percentile'

Returns
`cut_points`	A `rank(x) + 1 - len(axis)` dimensional `Tensor` with same `dtype` as `x` and shape `[num_quantiles + 1, ...]` where the trailing shape is that of `x` without the dimensions in `axis` (unless `keepdims is True`)

Raises
`ValueError`	If argument 'interpolation' is not an allowed type.
`ValueError`	If interpolation type not compatible with `dtype`.

Examples

# Get quartiles of x with various interpolation choices.
x = [0.,  1.,   2.,   3.,   4.,   5.,   6.,   7.,   8.,   9.,  10.]

tfp.stats.quantiles(x, num_quantiles=4, interpolation='nearest')
==> [  0.,   2.,   5.,   8.,  10.]

tfp.stats.quantiles(x, num_quantiles=4, interpolation='linear')
==> [  0. ,   2.5,   5. ,   7.5,  10. ]

tfp.stats.quantiles(x, num_quantiles=4, interpolation='lower')
==> [  0.,   2.,   5.,   7.,  10.]

# Get deciles of columns of an R x C data set.
data = load_my_columnar_data(...)
tfp.stats.quantiles(data, num_quantiles=10)
==> Shape [11, C] Tensor

tfp.stats.quantiles Stay organized with collections Save and categorize content based on your preferences.

Args

Returns

Raises

Examples

tfp.stats.quantiles