Module: tfdv

Init module for TensorFlow Data Validation.

Classes

class CombinerStatsGenerator: Generate statistics using combiner function.

class DecodeCSV: Decodes CSV records into an in-memory dict representation.

class GenerateStatistics: API for generating data statistics.

class StatsOptions: Options for generating statistics.

class TFExampleDecoder: A decoder for decoding TF examples into tf data validation datasets.

class TransformStatsGenerator: Generate statistics using a Beam PTransform.

Functions

display_anomalies(...): Displays the input anomalies.

display_schema(...): Displays the input schema.

generate_statistics_from_csv(...): Compute data statistics from CSV files.

generate_statistics_from_tfrecord(...): Compute data statistics from TFRecord files containing TFExamples.

get_domain(...): Get the domain associated with the input feature from the schema.

get_feature(...): Get a feature from the schema.

infer_schema(...): Infer schema from the input statistics.

load_schema_text(...): Loads the schema stored in text format in the input path.

load_statistics(...): Loads data statistics proto from file.

set_domain(...): Sets the domain for the input feature in the schema.

validate_statistics(...): Validate the input statistics against the provided input schema.

visualize_statistics(...): Visualize the input statistics using Facets.

write_schema_text(...): Writes input schema to a file in text format.