Announcing the TensorFlow Dev Summit 2020 Learn more

tfx.components.StatisticsGen

View source on GitHub

Class StatisticsGen

Official TFX StatisticsGen component.

Inherits From: BaseComponent

Aliases: tfx.components.statistics_gen.component.StatisticsGen

Used in the tutorials:

The StatisticsGen component generates features statistics and random samples over training data, which can be used for visualization and validation. StatisticsGen uses Apache Beam and approximate algorithms to scale to large datasets.

Please see https://www.tensorflow.org/tfx/data_validation for more details.

Example

  # Computes statistics over data for visualization and example validation.
  statistics_gen = StatisticsGen(examples=example_gen.outputs['examples'])

__init__

View source

__init__(
    examples=None,
    output=None,
    input_data=None,
    instance_name=None
)

Construct a StatisticsGen component.

Args:

  • examples: A Channel of ExamplesPath type, likely generated by the ExampleGen component. This needs to contain two splits labeled train and eval. required
  • output: ExampleStatisticsPath channel for statistics of each split provided in the input examples.
  • input_data: Backwards compatibility alias for the examples argument.
  • instance_name: Optional name assigned to this specific instance of StatisticsGen. Required only if multiple StatisticsGen components are declared in the same pipeline.

Child Classes

class DRIVER_CLASS

class SPEC_CLASS

Properties

component_id

DEPRECATED FUNCTION

component_type

DEPRECATED FUNCTION

downstream_nodes

exec_properties

id

Node id, unique across all TFX nodes in a pipeline.

If instance name is available, node_id will be: . otherwise, node_id will be:

Returns:

node id.

inputs

outputs

type

upstream_nodes

Methods

add_downstream_node

View source

add_downstream_node(downstream_node)

add_upstream_node

View source

add_upstream_node(upstream_node)

from_json_dict

View source

from_json_dict(
    cls,
    dict_data
)

Convert from dictionary data to an object.

to_json_dict

View source

to_json_dict()

Convert from an object to a JSON serializable dictionary.

Class Members

  • EXECUTOR_SPEC