Missed TensorFlow World? Check out the recap. Learn more

tf.keras.preprocessing.text.text_to_word_sequence

TensorFlow 2 version

Converts a text to a sequence of words (or tokens).

Aliases:

tf.keras.preprocessing.text.text_to_word_sequence(
    text,
    filters='!"#$%&()*+,-./:;<=>?@[\\]^_`{|}~\t\n',
    lower=True,
    split=' '
)

Arguments

text: Input text (string).
filters: list (or concatenation) of characters to filter out, such as
    punctuation. Default: ``!"#$%&()*+,-./:;<=>?@[\]^_`{|}~\t\n``,
    includes basic punctuation, tabs, and newlines.
lower: boolean. Whether to convert the input to lowercase.
split: str. Separator for word splitting.

Returns

A list of words (or tokens).