tokenize

tokenize(text)

Lowercase, extract words of 3+ chars, filter stop words.