Maximum length of the string to encode. Note that this
is measured in bytes and some unicode characters may take more than 1
byte. In the case that string_max_length does not divide
self._dtype_size_bytes (calculated below), it is rounded up to the
smallest integer that divides it.
Maximum value in each chunk. Defaults to the maximum
possible value in dtype.
encoded_strings: A tf.Tensor of shape
(num_strings, self._num_chunks) containing encoded input_strings
trimmed_input_strings: A tf.Tensor of shape (num_strings,)
containing trimmed input_strings that the length of each string in it
is no more than self._max_length bytes.
Note that a utf-8 character might take morethan one byte, so both the
encoded and trimmed strings could contain characters that are cut in the
middle. The caller needs to be aware of this when decoding these strings,
i.g. decode a byte string s by s.decode('utf-8', 'ignore') to avoid