23 Best Free NLP Datasets for Machine Learning
Iguazio
SEPTEMBER 20, 2023
The Blog Authorship Corpus A dataset comprising more than 680,000 blog posts (over 140 million words) from more than 19,000 bloggers, gathered in August 2004. The dataset includes nearly 7 million reviews, more than 150,000 businesses, more than 900,000 tips by nearly 2 million, and more than 1.2 Get the dataset here. Get the dataset here.
Let's personalize your content