Parse feeds in Python

Last update: Dec 30, 2022

Related tags

Web Crawling feedparser

Overview

feedparser - Parse Atom and RSS feeds in Python.

feedparser is open source. See the LICENSE file for more information.

Installation

feedparser can be installed by running pip:

$ pip install feedparser

Documentation

The feedparser documentation is available on the web at:

https://feedparser.readthedocs.io/en/latest/

It is also included in its source format, ReST, in the docs/ directory. To build the documentation you'll need the Sphinx package, which is available at:

https://www.sphinx-doc.org/

You can then build HTML pages using a command similar to:

$ sphinx-build -b html docs/ fpdocs

This will produce HTML documentation in the fpdocs/ directory.

Testing

Feedparser has an extensive test suite, powered by tox. To run it, type this:

$ python -m venv venv
$ source venv/bin/activate  # or "venv\bin\activate.ps1" on Windows
(venv) $ python -m pip install --upgrade pip
(venv) $ python -m pip install poetry
(venv) $ poetry update
(venv) $ tox

This will spawn an HTTP server that will listen on port 8097. The tests will fail if that port is in use.

Parse feeds in Python

Related tags

Overview

Installation

Documentation

Testing

Owner

Kurt McKee

Web3 Pancakeswap Sniper bot written in python3

SkyScrapers: A collection of variety of Scraping Apps

WebScraper - A script that prints out a list of all EXTERNAL references in the HTML response to an HTTP/S request

Telegram group scraper tool

This code will be able to scrape movies from a movie website and also provide download links to newly uploaded movies.

Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.

A Happy and lightweight Python Package that searches Google News RSS Feed and returns a usable JSON response and scrap complete article - No need to write scrappers for articles fetching anymore

A powerful annex BUBT, BUBT Soft, and BUBT website scraping script.

Google Maps crawler using Selenium

A repository with scraping code and soccer dataset from understat.com.

High available distributed ip proxy pool, powerd by Scrapy and Redis

A Python Covid-19 cases tracker that scrapes data off the web and presents the number of Cases, Recovered Cases, and Deaths that occurred because of the pandemic.

Scrapes Every Email Address of Every Society in Every University

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

Screenhook is a script that captures an image of a web page and send it to a discord webhook.

Automatically download and crop key information from the arxiv daily paper.

WebScraping - Scrapes Job website for python developer jobs and exports the data to a csv file

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

A scrapy pipeline that provides an easy way to store files and images using various folder structures.