Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Last update: Nov 30, 2021

Related tags

Overview

Baua Biocides Scraper

Scrapping the data from each page of biocides listed on the BAUA website (https://www.baua.de/DE/Biozid-Meldeverordnung/Offen/offen.html) into a csv file.
A windows standalone client is avalaible in the dist folder

About the project

What's the problem?

Baua website contains many usefull data for biocides domain, but the website only allows you to search product by product and it is not easy to find and get some informations with over 80,000 products listed

The idea

Facilitate the data manipulation with providing a csv file with all data scraped from Baua website.

How does it work ?

The user start the program.
The program extract data from Baua website.
A csv file containing data are created.

Roadmap

This project was created after a request and is not intended to evolve. Nevertheless you can fork the project to improve it by yourself and propose them via the project pull requests. or make a suggestion via the project issues.

Build with

Programming language : Python 3.10.0
Scraping Framework : Scrapy 2.5.1
HTTP library : Requests 2.26.0
Standalone Builder : PyInstaller 4.7

Demo

You can use the windows standalone client in the dist folder

Version management

We use a semantic version management, that is a version number MAJOR.MINOR.CORRECTIVE :

the MAJOR version number when there are non backward compatible changes,
the MINOR version number when there are backward compatible feature additions,
the FIX version number when there are backwards compatible bug fixes.

See SignMail tags For more info: semver.org

Authors

Eric De Maria - Numio - Initial work

License

This project is licensed under the GNU GPL 3 license - See the LICENSE file for more details.

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Instagram_scrapper This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or exce

5 Oct 17, 2022

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

mcc-mnc.com-webscraper Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX) A Python script for web scraping mcc-mnc.com Link: mcc

1 Nov 7, 2021

AssistScraper - program for /r/nba to use to find list of all players a player assisted and how many assists each player recieved

5 Nov 25, 2021

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

Introduction This is a project I built with the sole intent to learn more about

1 Jan 14, 2022

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

Web-Scrapping-1 An application that on a given url, crowls a web page and gets all words, sorts and counts them. Installation Using the package manage

1 Jan 16, 2022

Releases(v0.1.0)

v0.1.0(Nov 30, 2021)

The windows standalone client for the first public version of Baua Biocides Scraper
Source code(tar.gz)
Source code(zip)
Baua_Biocides_Scraper_Windows.zip(16.02 MB)

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Related tags

Overview

Baua Biocides Scraper

About the project

What's the problem?

The idea

How does it work ?

Roadmap

Build with

Demo

Version management

Authors

License

You might also like...

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

AssistScraper - program for /r/nba to use to find list of all players a player assisted and how many assists each player recieved

A Python module to bypass Cloudflare's anti-bot page.

Screenhook is a script that captures an image of a web page and send it to a discord webhook.

A Python module to bypass Cloudflare's anti-bot page.

Python script who crawl first shodan page and check DBLTEK vulnerability

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

Releases(v0.1.0)

v0.1.0(Nov 30, 2021)

Owner

Eric DE MARIA

Create crawler get some new products with maximum discount in banimode website

Find thumbnails and original images from URL or HTML file.

feapder 是一款简单、快速、轻量级的爬虫框架。以开发快速、抓取快速、使用简单、功能强大为宗旨。支持分布式爬虫、批次爬虫、多模板爬虫，以及完善的爬虫报警机制。

Facebook Group Scraping Using Beautiful Soup & Selenium

a small library for extracting rich content from urls

A web Scraper for CSrankings.com that scrapes University and Faculty list for a particular country

TarkovScrappy - A nifty little bot that lets you know if a queried item might be required for a quest at some point in the land of Tarkov!

Scrap-mtg-top-8 - A top 8 mtg scraper using python

Html Content / Article Extractor, web scrapping lib in Python

Telegram Group Scrapper

A Web Scraping Program.

A simple app to scrap data from Twitter.

Google Developer Profile Badge Scraper

This program scrapes information and images for movies and TV shows.

Web crawling framework based on asyncio.

Bigdata - This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster

Scrapes all articles and their headlines from theonion.com

A command-line program to download media, like and unlike posts, and more from creators on OnlyFans.

A database scraper created with mechanical soup and sqlite

12306抢票脚本