ML Community Day is November 9! Join us for updates from TensorFlow, JAX, and more Learn more


Compute data statistics for the input pandas DataFrame.

This is a utility method for users with in-memory data represented as a pandas DataFrame.

dataframe Input pandas DataFrame.
stats_options tfdv.StatsOptions for generating data statistics.
n_jobs Number of processes to run (defaults to 1). If -1 is provided, uses the same number of processes as the number of CPU cores.

A DatasetFeatureStatisticsList proto.