Apply to speak at TensorFlow World. Deadline April 23rd. Propose talk

tfdv.GenerateStatistics

Class GenerateStatistics

API for generating data statistics.

Example:

  with beam.Pipeline(runner=...) as p:
    _ = (p
         | 'ReadData' >> beam.io.ReadFromTFRecord(data_location)
         | 'DecodeData' >> beam.Map(TFExampleDecoder().decode)
         | 'GenerateStatistics' >> GenerateStatistics()
         | 'WriteStatsOutput' >> beam.io.WriteToTFRecord(
             output_path, shard_name_template='',
             coder=beam.coders.ProtoCoder(
                 statistics_pb2.DatasetFeatureStatisticsList)))

__init__

__init__(options=stats_options.StatsOptions())

Initializes the transform.

Args:

Raises:

  • TypeError: If options is not of the expected type.

Methods

expand

expand(dataset)