Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

We gathered the top 30k most common words for ~120 languages from wikipedia or the Leipzig corpus.  These word lists are available in the common_tokens directory.

...