tfx.components.example_gen.utils.calculate_splits_fingerprint_and_span

View source on GitHub

Calculates the fingerprint of files in a URI matching split patterns.

If a pattern has the {SPAN} placeholder, attempts to find an identical value across splits that results in all splits having the most recently updated files.

input_base_uri The base path from which files will be searched
splits An iterable collection of example_gen_pb2.Input.Split objects

A Tuple of [fingerprint, select_span], where select_span is either the value matched with the {SPAN} placeholder, or None if the placeholder wasn't specified.