TF 2.0 is out! Get hands-on practice at TF World, Oct 28-31. Use code TF20 for 20% off select passes. Register now

Module: tfdv

Init module for TensorFlow Data Validation.

Classes

class CombinerStatsGenerator: Generate statistics using combiner function.

class DecodeCSV: Decodes CSV records into Arrow tables.

class FeaturePath: Represents the path to a feature in an input example.

class GenerateStatistics: API for generating data statistics.

class StatsOptions: Options for generating statistics.

class TFExampleDecoder: A decoder for decoding TF examples into tf data validation datasets.

class TransformStatsGenerator: Generate statistics using a Beam PTransform.

Functions

DecodeTFExample(...): Decodes serialized TF examples into Arrow tables.

display_anomalies(...): Displays the input anomalies.

display_schema(...): Displays the input schema.

generate_statistics_from_csv(...): Compute data statistics from CSV files.

generate_statistics_from_dataframe(...): Compute data statistics for the input pandas DataFrame.

generate_statistics_from_tfrecord(...): Compute data statistics from TFRecord files containing TFExamples.

get_domain(...): Get the domain associated with the input feature from the schema.

get_feature(...): Get a feature from the schema.

infer_schema(...): Infers schema from the input statistics.

load_anomalies_text(...): Loads the Anomalies proto stored in text format in the input path.

load_schema_text(...): Loads the schema stored in text format in the input path.

load_statistics(...): Loads data statistics proto from file.

set_domain(...): Sets the domain for the input feature in the schema.

update_schema(...): Updates input schema to conform to the input statistics.

validate_examples_in_csv(...): Validates examples in csv files.

validate_examples_in_tfrecord(...): Validates TFExamples in TFRecord files.

validate_instance(...): Validates a batch of examples against the schema provided in options.

validate_statistics(...): Validates the input statistics against the provided input schema.

visualize_statistics(...): Visualize the input statistics using Facets.

write_anomalies_text(...): Writes the Anomalies proto to a file in text format.

write_schema_text(...): Writes input schema to a file in text format.