tf.feature_column.sequence_categorical_column_with_hash_bucket
Stay organized with collections
Save and categorize content based on your preferences.
A sequence of categorical terms where ids are set by hashing.
tf.feature_column.sequence_categorical_column_with_hash_bucket(
key, hash_bucket_size, dtype=tf.dtypes.string
)
Pass this to embedding_column
or indicator_column
to convert sequence
categorical data into dense representation for input to sequence NN, such as
RNN.
Example:
tokens = sequence_categorical_column_with_hash_bucket(
'tokens', hash_bucket_size=1000)
tokens_embedding = embedding_column(tokens, dimension=10)
columns = [tokens_embedding]
features = tf.io.parse_example(..., features=make_parse_example_spec(columns))
sequence_feature_layer = SequenceFeatures(columns)
sequence_input, sequence_length = sequence_feature_layer(features)
sequence_length_mask = tf.sequence_mask(sequence_length)
rnn_cell = tf.keras.layers.SimpleRNNCell(hidden_size)
rnn_layer = tf.keras.layers.RNN(rnn_cell)
outputs, state = rnn_layer(sequence_input, mask=sequence_length_mask)
Args |
key
|
A unique string identifying the input feature.
|
hash_bucket_size
|
An int > 1. The number of buckets.
|
dtype
|
The type of features. Only string and integer types are supported.
|
Returns |
A SequenceCategoricalColumn .
|
Raises |
ValueError
|
hash_bucket_size is not greater than 1.
|
ValueError
|
dtype is neither string nor integer.
|
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates. Some content is licensed under the numpy license.
Last updated 2021-05-14 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2021-05-14 UTC."],[],[]]