Operations for working with string Tensors.
Functions
as_string(...)
: Converts each entry in the given tensor to strings.
bytes_split(...)
: Split string elements of input
into bytes.
format(...)
: Formats a string template using a list of tensors.
join(...)
: Perform element-wise concatenation of a list of string tensors.
length(...)
: String lengths of input
.
lower(...)
: Converts all uppercase characters into their respective lowercase replacements.
ngrams(...)
: Create a tensor of n-grams based on data
.
reduce_join(...)
: Joins all strings into a single string, or joins along an axis.
regex_full_match(...)
: Check if the input matches the regex pattern.
regex_replace(...)
: Replace elements of input
matching regex pattern
with rewrite
.
split(...)
: Split elements of input
based on sep
into a RaggedTensor
.
strip(...)
: Strip leading and trailing whitespaces from the Tensor.
substr(...)
: Return substrings from Tensor
of strings.
to_hash_bucket(...)
: Converts each string in the input Tensor to its hash mod by a number of buckets.
to_hash_bucket_fast(...)
: Converts each string in the input Tensor to its hash mod by a number of buckets.
to_hash_bucket_strong(...)
: Converts each string in the input Tensor to its hash mod by a number of buckets.
to_number(...)
: Converts each string in the input Tensor to the specified numeric type.
unicode_decode(...)
: Decodes each string in input
into a sequence of Unicode code points.
unicode_decode_with_offsets(...)
: Decodes each string into a sequence of code points with start offsets.
unicode_encode(...)
: Encodes each sequence of Unicode code points in input
into a string.
unicode_script(...)
: Determine the script codes of a given tensor of Unicode integer code points.
unicode_split(...)
: Splits each string in input
into a sequence of Unicode code points.
unicode_split_with_offsets(...)
: Splits each string into a sequence of code points with start offsets.
unicode_transcode(...)
: Transcode the input text from a source encoding to a destination encoding.
unsorted_segment_join(...)
: Joins the elements of inputs
based on segment_ids
.
upper(...)
: Converts all lowercase characters into their respective uppercase replacements.