tfdv.TransformStatsGenerator

Class TransformStatsGenerator

Generate statistics using a Beam PTransform.

Note that the input PTransform must take a PCollection of sliced examples (tuple of (slice_key, example)) as input and output a PCollection of sliced protos (tuple of (slice_key, DatasetFeatureStatistics proto)).

__init__

__init__(
    name,
    ptransform,
    schema=None
)

Initializes a statistics generator.

Args:

  • name: A unique name associated with the statistics generator.
  • schema: An optional schema for the dataset.

Properties

name

ptransform

schema