Help protect the Great Barrier Reef with TensorFlow on Kaggle Join Challenge


Converts a text to a sequence of words (or tokens).

This function transforms a string of text into a list of words while ignoring filters which include punctuations by default.

sample_text = 'This is a sample sentence.'
['this', 'is', 'a', 'sample', 'sentence']

input_text Input text (string).
filters list (or concatenation) of characters to filter out, such as punctuation. Default: '!"#$%&()*+,-./:;<=>?@[\]^_`{|}~\t\n', includes basic punctuation, tabs, and newlines.
lower boolean. Whether to convert the input to lowercase.
split str. Separator for word splitting.

A list of words (or tokens).