Scrape plants scientific name information from Agroforestry Species Switchboard 2.0.

Last update: Dec 23, 2021

Overview

Agroforestry Species Switchboard 2.0 Scraper

Scrape plants scientific name information from Species Switchboard 2.0.

Requirements

python >= 3.10 (you can use pyenv for easier python version management)
pipenv

How to run

Install dependencies

cp env.sample .env
pipenv --python 3
pipenv install

Run
```
pipenv run python main.py
```
The result will be placed in a file named result.*.csv

Test Shell

pipenv run scrapy shell 'http://apps.worldagroforestry.org/products/switchboard/index.php/species_search/Acacia%20abyssinica'

Cleanup All Outputs

rm result.* && rm log.*

Special Cases

Case	Link	Note
ICRAF Databases Not Found	Engelhardia spicata
Genus Found	Forficula	What to do next?
Multiple Species Found	Alstonia spectabilis	Get the matched species right?
Species Variant Found	Engelhardtia spicata	Need human to check
Similar Species Found	Costus speciosus	Need human to check

Contributing

Fork this repo
Develop
Create pull request
Tag @rizqirizqi for review
Merge~~

License

GPL-3.0

Scrape plants scientific name information from Agroforestry Species Switchboard 2.0.

Related tags

Overview

Agroforestry Species Switchboard 2.0 Scraper

Requirements

How to run

Test Shell

Cleanup All Outputs

Special Cases

Contributing

License

Owner

Mgs. M. Rizqi Fadhlurrahman

News, full-text, and article metadata extraction in Python 3. Advanced docs:

simple http & https proxy scraper and checker

Crawl the information of a given keyword on Google search engine

A simple, configurable and expandable combined shop scraper to minimize the costs of ordering several items

An Web Scraping API for MDL(My Drama List) for Python.

Scrapes all articles and their headlines from theonion.com

A Python module to bypass Cloudflare's anti-bot page.

A web crawler script that crawls the target website and lists its links

Poolbooru gelscraper - a simple python script for scraping images off gelbooru pools.

Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo.

Scrape and display grades onto the console

a small library for extracting rich content from urls

对于有验证码的站点爆破，用于安全合法测试

This code will be able to scrape movies from a movie website and also provide download links to newly uploaded movies.

抖音批量下载用户所有无水印视频

A simple Discord scraper for discord bots

Subscrape - A Python scraper for substrate chains

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

This is a web crawler that works on employ email data by gmane.org and visualizes it in different ways.

Using Python and Pushshift.io to Track stocks on the WallStreetBets subreddit