Skip to content

pigeonburger/theonion-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

The Onion Article Scraper

Scrapes all articles and their headlines from the satirical news website https://www.theonion.com

Also see Clickhole Article Scraper

Requirements:

  • Python 3.6 or higher
  • pip install beautifulsoup4
  • pip install requests

This script writes all the articles and their headlines to the file theonioncontent.txt. The start of each article is denoted by <|startoftext|> and the end by <|endoftext|>.

To run, simply download the file, install the above requirements and then run the following command:

python scraper.py

The program will display its progress as it scrapes each article.

About

Scrapes all articles and their headlines from theonion.com

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages