Newsscraper - A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

Last update: Jan 02, 2022

Overview

NewsScraper

A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

🔧 Installation

Clone the repo locally.
Use the package manager pip to install the requirements.

pip install -r requirements.txt

✨ Basic Usage

import NewsScraper

all_data = NewsScraper.fetch_all()
news_data = NewsScraper.fetch_news_data()
crypto_data = NewsScraper.fetch_crypto_data()

fetch_all()

Returns a set of NewsScraper.Result containing fetched results from all available RSS feeds

Can include categories: GLOBAL, US, EU, CRYPTO, BLOCKCHAIN, BTC, ETH, LTC.

fetch_news_data()

Returns a set of NewsScraper.Result containing fetched results from CNN, ABC News, Yahoo News, Fox News RSS feeds

Can include categories: GLOBAL, US, EU.

fetch_crypto_data()

Returns a set of NewsScraper.Result containing fetched results from CoinJournal, Crypto Currency News RSS feeds.

Can include categories: CRYPTO, BLOCKCHAIN, BTC, ETH, LTC.

🔨 Advanced Usage

NewsScraper.Result class

A class used to represent a returned article.

Attributes

context : str

A string describing the category of the article.

ex. "GLOBAL", "US", "BLOCKCHAIN", "BTC".
title : str

A string containing the name of the article.
summary : str

A string containing the summary of the article.

NOTE: sometimes it can have the value of "", because the RSS feed didn't provide a summary.
content : str

A string containing the content of the article.

Methods

Result.json()

Returns a dictionary with the attributes of the class formatted in JSON.

ex.

{
  "context": "global",
  "title": "title of the article",
  "summary": "summary of the article",
  "content": "content of the article"
}

News RSS Feeds

All of these functions return a set of NewsScraper.Result containing fetched results of the described RSS feeds.

fetch_abc()
fetch_cnn()
fetch_yahoo()
fetch_fox_news()

Can include categories: GLOBAL, US, EU.

Alternatively, you can use fetch_news_data() to receive results from all of them.

Crypto RSS Feeds

All of these functions return a set of NewsScraper.Result containing fetched results of the described RSS feeds.

fetch_coinjournal()
fetch_cryptocurrencynews()

Can include categories: CRYPTO, BLOCKCHAIN, BTC, ETH, LTC.

Alternatively, you can use fetch_news_data() to receive results from all of them.

🤝 Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

📝 License

This project is licensed under the MIT license.

Newsscraper - A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

Related tags

Overview

NewsScraper

🔧 Installation

✨ Basic Usage

🔨 Advanced Usage

NewsScraper.Result class

context : str

title : str

summary : str

content : str

Result.json()

News RSS Feeds

Crypto RSS Feeds

🤝 Contributing

📝 License

Owner

Rokas

Web Scraping Practica With Python

Crawl BookCorpus

A crawler of doubamovie

Current Antarctic large iceberg positions derived from ASCAT and OSCAT-2

Bulk download tool for the MyMedia platform

Python web scrapper

Python scraper to check for earlier appointments in Clalit Health Services

Pyrics is a tool to scrape lyrics, get rhymes, generate relevant lyrics with rhymes.

A python tool to scrape NFT's off of OpenSea

Web-Scraping using Selenium Master

A Python web scraper to scrape latest posts from official Coinbase's Blog.

News, full-text, and article metadata extraction in Python 3. Advanced docs:

WebScraper - A script that prints out a list of all EXTERNAL references in the HTML response to an HTTP/S request

A high-level distributed crawling framework.

Scrap the 42 Intranet's elearning videos in a single click

Creating Scrapy scrapers via the Django admin interface

A tool can scrape product in aliexpress: Title, Price, and URL Product.

A simple python web scraper.

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Get-web-images - A python code that get images from any site