Skip to content

Releases: AndyTheFactory/FakeNewsDataset

Initial release

25 Oct 08:00
17ed9f5
Compare
Choose a tag to compare

a consolidated and cleaned up version of the opensources Fake News dataset, classified into 12 classes: reliable, unreliable, political, bias, fake, conspiracy, rumor clickbait, junk science, satire, hate and unknown. The articles were scraped between the end of 2017 and the beginning of 2018 from various news websites, totaling 647 distinct sources

The extracted file is 20 GB large

Label Nr Records
reliable 1,807,323
political 96,8205
bias 769,874
fake 762,178
conspiracy 494,184
rumor 375,963
unknown 230,532
clickbait 174,176
unreliable 104,537
satire 84,735
junksci 79,099
hate 64,763
--- ----
total 5,915,569