Encode a tensor of ints into unicode strings.
Returns a vector of strings, where `output[i]` is constructed by encoding the Unicode codepoints in `input_values[input_splits[i]:input_splits[i+1]]` using `output_encoding`.
---
Example:
input_values = [72, 101, 108, 108, 111, 87, 111, 114, 108, 100]
input_splits = [0, 5, 10]
output_encoding = 'UTF-8'
output = ['Hello', 'World']
Nested Classes
class | UnicodeEncode.Options |
Optional attributes for
UnicodeEncode
|
Public Methods
Output <String> |
asOutput
()
Returns the symbolic handle of a tensor.
|
static <T extends Number> UnicodeEncode |
create
(
Scope
scope,
Operand
<Integer> inputValues,
Operand
<T> inputSplits, String outputEncoding,
Options...
options)
Factory method to create a class wrapping a new UnicodeEncode operation.
|
static UnicodeEncode.Options |
errors
(String errors)
|
Output <String> |
output
()
The 1-D Tensor of strings encoded from the provided unicode codepoints.
|
static UnicodeEncode.Options |
replacementChar
(Long replacementChar)
|
Inherited Methods
Public Methods
public Output <String> asOutput ()
Returns the symbolic handle of a tensor.
Inputs to TensorFlow operations are outputs of another TensorFlow operation. This method is used to obtain a symbolic handle that represents the computation of the input.
public static UnicodeEncode create ( Scope scope, Operand <Integer> inputValues, Operand <T> inputSplits, String outputEncoding, Options... options)
Factory method to create a class wrapping a new UnicodeEncode operation.
Parameters
scope | current scope |
---|---|
inputValues | A 1D tensor containing the unicode codepoints that should be encoded. |
inputSplits | A 1D tensor specifying how the unicode codepoints should be split into strings. In particular, `output[i]` is constructed by encoding the codepoints in the slice `input_values[input_splits[i]:input_splits[i+1]]`. |
outputEncoding | Unicode encoding of the output strings. Valid encodings are: `"UTF-8", "UTF-16-BE", and "UTF-32-BE"`. |
options | carries optional attributes values |
Returns
- a new instance of UnicodeEncode
public static UnicodeEncode.Options errors (String errors)
Parameters
errors | Error handling policy when there is invalid formatting found in the input. The value of 'strict' will cause the operation to produce a InvalidArgument error on any invalid input formatting. A value of 'replace' (the default) will cause the operation to replace any invalid formatting in the input with the `replacement_char` codepoint. A value of 'ignore' will cause the operation to skip any invalid formatting in the input and produce no corresponding output character. |
---|
public Output <String> output ()
The 1-D Tensor of strings encoded from the provided unicode codepoints.
public static UnicodeEncode.Options replacementChar (Long replacementChar)
Parameters
replacementChar | The replacement character codepoint to be used in place of any invalid formatting in the input when `errors='replace'`. Any valid unicode codepoint may be used. The default value is the default unicode replacement character is 0xFFFD (U+65533). |
---|