A tool for scraping and organizing data from NewsBank API searches

Last update: Jun 17, 2021

Overview

nbscraper

Overview

This simple tool automates the process of copying, pasting, and organizing data from NewsBank API searches. Curerntly, nbscrape only searches print sources in the USA.

Requirements

Access to NewsBank (e.g. via your institution's library)
Python 3

Basic Usage

Call nbscrape function
- Arguments include "search", "date_from", and "date_to"
Output is a pandas dataframe, with all available metadata for each source

Disclaimer

This tool is to be used in compliance with terms of service outlined by your institution and NewsBank. As such, it is suggested that you use this tool for research purposes only, once you have settled on your final search terms. This is not an exploratory tool. The purpose of nbscraper is to alleviate the tedium of having to click through 50 pages one by one and to manually save sources' metadata.

Owner

GitHub Repository

A tool for scraping and organizing data from NewsBank API searches

Related tags

Overview

nbscraper

Overview

Requirements

Basic Usage

Disclaimer

Owner

Web Crawlers for Data Labelling of Malicious Domain Detection & IP Reputation Evaluation

Screenhook is a script that captures an image of a web page and send it to a discord webhook.

python+selenium实现的web端自动打卡 + 每日邮件发送 + 金山词霸每日一句 + 毒鸡汤（从2月份稳定运行至今）

A Simple Web Scraper made to Extract Download Links from Todaytvseries2.com

Web Scraping images using Selenium and Python

Scrapes Every Email Address of Every Society in Every University

Crawl BookCorpus

simple http & https proxy scraper and checker

Free-Game-Scraper is a useful script that allows you to track down free games and DLCs on many platforms.

This is a python api to scrape search results from a url.

A simple Discord scraper for discord bots

LSpider 一个为被动扫描器定制的前端爬虫

This was supposed to be a web scraping project, but somehow I've turned it into a spamming project

Web Scraping COVID 19 Meta Portal with Python

Example of scraping a paginated API endpoint and dumping the data into a DB

Amazon scraper using scrapy, a python framework for crawling websites.

Using Selenium with Python to Web Scrap Popular Youtube Tech Channels.

A pure-python HTML screen-scraping library

优化版本的京东茅台抢购神器

Open Crawl Vietnamese Text

A tool for scraping and organizing data from NewsBank API searches

Related tags

Overview

nbscraper

Overview

Requirements

Basic Usage

Disclaimer

Owner

Web Crawlers for Data Labelling of Malicious Domain Detection & IP Reputation Evaluation

Screenhook is a script that captures an image of a web page and send it to a discord webhook.

python+selenium实现的web端自动打卡 + 每日邮件发送 + 金山词霸 每日一句 + 毒鸡汤（从2月份稳定运行至今）

A Simple Web Scraper made to Extract Download Links from Todaytvseries2.com

Web Scraping images using Selenium and Python

Scrapes Every Email Address of Every Society in Every University

Crawl BookCorpus

simple http & https proxy scraper and checker

Free-Game-Scraper is a useful script that allows you to track down free games and DLCs on many platforms.

This is a python api to scrape search results from a url.

A simple Discord scraper for discord bots

LSpider 一个为被动扫描器定制的前端爬虫

This was supposed to be a web scraping project, but somehow I've turned it into a spamming project

Web Scraping COVID 19 Meta Portal with Python

Example of scraping a paginated API endpoint and dumping the data into a DB

Amazon scraper using scrapy, a python framework for crawling websites.

Using Selenium with Python to Web Scrap Popular Youtube Tech Channels.

A pure-python HTML screen-scraping library

优化版本的京东茅台抢购神器

Open Crawl Vietnamese Text

python+selenium实现的web端自动打卡 + 每日邮件发送 + 金山词霸每日一句 + 毒鸡汤（从2月份稳定运行至今）