DraganSr

web-links (blinks) web-log (blog) by Dragan Sretenovic

Thursday, February 04, 2021

Google 10000 English words

 GitHub - first20hours/google-10000-english: This repo contains a list of the 10,000 most common English words in order of frequency, as determined by n-gram frequency analysis of the Google's Trillion Word Corpus.

first20hours (Josh Kaufman) · GitHub

Natural Language Corpus Data: Beautiful Data

Dragan at 9:55 PM No comments:
‹
›
Home
View web version

About Me

My photo
Dragan
View my complete profile
Powered by Blogger.