tfdv.GenerateStatistics

View source on GitHub

API for generating data statistics.

Example:

  with beam.Pipeline(runner=...) as p:
    _ = (p
         | 'ReadData' >> beam.io.ReadFromTFRecord(data_location)
         | 'DecodeData' >> tfdv.DecodeTFExample()
         | 'GenerateStatistics' >> GenerateStatistics()
         | 'WriteStatsOutput' >> tfdv.WriteStatisticsToTFRecord(output_path))

options tfdv.StatsOptions for generating data statistics.

TypeError If options is not of the expected type.

Class Variables

  • pipeline = None