Scrape plants scientific name information from Agroforestry Species Switchboard 2.0.

Last update: Dec 23, 2021

Overview

Agroforestry Species Switchboard 2.0 Scraper

Scrape plants scientific name information from Species Switchboard 2.0.

Requirements

python >= 3.10 (you can use pyenv for easier python version management)
pipenv

How to run

Install dependencies

cp env.sample .env
pipenv --python 3
pipenv install

Run
```
pipenv run python main.py
```
The result will be placed in a file named result.*.csv

Test Shell

pipenv run scrapy shell 'http://apps.worldagroforestry.org/products/switchboard/index.php/species_search/Acacia%20abyssinica'

Cleanup All Outputs

rm result.* && rm log.*

Special Cases

Case	Link	Note
ICRAF Databases Not Found	Engelhardia spicata
Genus Found	Forficula	What to do next?
Multiple Species Found	Alstonia spectabilis	Get the matched species right?
Species Variant Found	Engelhardtia spicata	Need human to check
Similar Species Found	Costus speciosus	Need human to check

Contributing

Fork this repo
Develop
Create pull request
Tag @rizqirizqi for review
Merge~~

License

GPL-3.0

Scrape plants scientific name information from Agroforestry Species Switchboard 2.0.

Related tags

Overview

Agroforestry Species Switchboard 2.0 Scraper

Requirements

How to run

Test Shell

Cleanup All Outputs

Special Cases

Contributing

License

Owner

Mgs. M. Rizqi Fadhlurrahman

a way to scrape a database of all of the isef projects

12306抢票脚本

Crawler job that scrapes comments from social media posts and saves them in a S3 bucket.

Telegram Group Scrapper

ChromiumJniGenerator - Jni Generator module extracted from Chromium project

Unja is a fast & light tool for fetching known URLs from Wayback Machine

This is a script that scrapes the longitude and latitude on food.grab.com

Google Developer Profile Badge Scraper

Crawl BookCorpus

Subscrape - A Python scraper for substrate chains

The open-source web scrapers that feed the Los Angeles Times California coronavirus tracker.

This code will be able to scrape movies from a movie website and also provide download links to newly uploaded movies.

Works very well and you can ask for the type of image you want the scrapper to collect.

Libextract: extract data from websites

A distributed crawler for weibo, building with celery and requests.

UsernameScraperTool - Username Scraper Tool With Python

Rottentomatoes, Goodreads and IMDB sites crawler. Semantic Web final project.

:arrow_double_down: Dumb downloader that scrapes the web

PyQuery-based scraping micro-framework.

Google Maps crawler using Selenium