Module: tf.strings

Operations for working with string Tensors.

Functions

as_string(...): Converts each entry in the given tensor to strings.

bytes_split(...): Split string elements of input into bytes.

format(...): Formats a string template using a list of tensors.

join(...): Perform element-wise concatenation of a list of string tensors.

length(...): String lengths of input.

lower(...): Converts all uppercase characters into their respective lowercase replacements.

ngrams(...): Create a tensor of n-grams based on data.

reduce_join(...): Joins all strings into a single string, or joins along an axis.

regex_full_match(...): Check if the input matches the regex pattern.

regex_replace(...): Replace elements of input matching regex pattern with rewrite.

split(...): Split elements of input based on sep into a RaggedTensor.

strip(...): Strip leading and trailing whitespaces from the Tensor.

substr(...): Return substrings from Tensor of strings.

to_hash_bucket(...): Converts each string in the input Tensor to its hash mod by a number of buckets.

to_hash_bucket_fast(...): Converts each string in the input Tensor to its hash mod by a number of buckets.

to_hash_bucket_strong(...): Converts each string in the input Tensor to its hash mod by a number of buckets.

to_number(...): Converts each string in the input Tensor to the specified numeric type.

unicode_decode(...): Decodes each string in input into a sequence of Unicode code points.

unicode_decode_with_offsets(...): Decodes each string into a sequence of code points with start offsets.

unicode_encode(...): Encodes each sequence of Unicode code points in input into a string.

unicode_script(...): Determine the script codes of a given tensor of Unicode integer code points.

unicode_split(...): Splits each string in input into a sequence of Unicode code points.

unicode_split_with_offsets(...): Splits each string into a sequence of code points with start offsets.

unicode_transcode(...): Transcode the input text from a source encoding to a destination encoding.

unsorted_segment_join(...): Joins the elements of inputs based on segment_ids.

upper(...): Converts all lowercase characters into their respective uppercase replacements.