Module: tft

Init module for TF.Transform.

Modules

coders module: Module level imports for tensorflow_transform.coders.

Classes

class TFTransformOutput: A wrapper around the output of the tf.Transform.

Functions

apply_buckets(...): Returns a bucketized column, with a bucket index assigned to each input.

apply_buckets_with_interpolation(...): Interpolates within the provided buckets and then normalizes to 0 to 1.

apply_function(...): Deprecated function, equivalent to fn(*args). (deprecated)

apply_function_with_checkpoint(...): Applies a tensor-in-tensor-out function with variables to some Tensors.

apply_pyfunc(...): Applies a python function to some Tensors.

apply_saved_model(...): Applies a SavedModel to some Tensors.

apply_vocab(...): See tft.apply_vocabulary. (deprecated)

apply_vocabulary(...): Maps x to a vocabulary specified by the deferred tensor.

bag_of_words(...): Computes a bag of "words" based on the specified ngram configuration.

bucketize(...): Returns a bucketized column, with a bucket index assigned to each input.

bucketize_per_key(...): Returns a bucketized column, with a bucket index assigned to each input.

compute_and_apply_vocabulary(...): Generates a vocabulary for x and maps it to an integer with this vocab.

covariance(...): Computes the covariance matrix over the whole dataset.

deduplicate_tensor_per_row(...): Deduplicates each row (0-th dimension) of the provided tensor.

get_analyze_input_columns(...): Return columns that are required inputs of AnalyzeDataset.

get_transform_input_columns(...): Return columns that are required inputs of TransformDataset.

hash_strings(...): Hash strings into buckets.

max(...): Computes the maximum of the values of a Tensor over the whole dataset.

mean(...): Computes the mean of the values of a Tensor over the whole dataset.

min(...): Computes the minimum of the values of a Tensor over the whole dataset.

ngrams(...): Create a SparseTensor of n-grams.

pca(...): Computes pca on the dataset using biased covariance.

ptransform_analyzer(...): Applies a user-provided PTransform over the whole dataset.

quantiles(...): Computes the quantile boundaries of a Tensor over the whole dataset.

scale_by_min_max(...): Scale a numerical column into the range [output_min, output_max].

scale_by_min_max_per_key(...): Scale a numerical column into a predefined range on a per-key basis.

scale_to_0_1(...): Returns a column which is the input column scaled to have range [0,1].

scale_to_0_1_per_key(...): Returns a column which is the input column scaled to have range [0,1].

scale_to_z_score(...): Returns a standardized column with mean 0 and variance 1.

scale_to_z_score_per_key(...): Returns a standardized column with mean 0 and variance 1, grouped per key.

segment_indices(...): Returns a Tensor of indices within each segment.

size(...): Computes the total size of instances in a Tensor over the whole dataset.

sparse_tensor_to_dense_with_shape(...): Converts a SparseTensor into a dense tensor and sets its shape.

string_to_int(...): See tft.compute_and_apply_vocabulary. (deprecated)

sum(...): Computes the sum of the values of a Tensor over the whole dataset.

tfidf(...): Maps the terms in x to their term frequency * inverse document frequency.

uniques(...): See tft.vocabulary. (deprecated)

var(...): Computes the variance of the values of a Tensor over the whole dataset.

vocabulary(...): Computes the unique values of a Tensor over the whole dataset.

word_count(...): Find the token count of each document/row.