Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Last update: Nov 30, 2021

Related tags

Overview

Baua Biocides Scraper

Scrapping the data from each page of biocides listed on the BAUA website (https://www.baua.de/DE/Biozid-Meldeverordnung/Offen/offen.html) into a csv file.
A windows standalone client is avalaible in the dist folder

About the project

What's the problem?

Baua website contains many usefull data for biocides domain, but the website only allows you to search product by product and it is not easy to find and get some informations with over 80,000 products listed

The idea

Facilitate the data manipulation with providing a csv file with all data scraped from Baua website.

How does it work ?

The user start the program.
The program extract data from Baua website.
A csv file containing data are created.

Roadmap

This project was created after a request and is not intended to evolve. Nevertheless you can fork the project to improve it by yourself and propose them via the project pull requests. or make a suggestion via the project issues.

Build with

Programming language : Python 3.10.0
Scraping Framework : Scrapy 2.5.1
HTTP library : Requests 2.26.0
Standalone Builder : PyInstaller 4.7

Demo

You can use the windows standalone client in the dist folder

Version management

We use a semantic version management, that is a version number MAJOR.MINOR.CORRECTIVE :

the MAJOR version number when there are non backward compatible changes,
the MINOR version number when there are backward compatible feature additions,
the FIX version number when there are backwards compatible bug fixes.

See SignMail tags For more info: semver.org

Authors

Eric De Maria - Numio - Initial work

License

This project is licensed under the GNU GPL 3 license - See the LICENSE file for more details.

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Instagram_scrapper This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or exce

5 Oct 17, 2022

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

mcc-mnc.com-webscraper Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX) A Python script for web scraping mcc-mnc.com Link: mcc

1 Nov 7, 2021

AssistScraper - program for /r/nba to use to find list of all players a player assisted and how many assists each player recieved

5 Nov 25, 2021

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

Introduction This is a project I built with the sole intent to learn more about

1 Jan 14, 2022

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

Web-Scrapping-1 An application that on a given url, crowls a web page and gets all words, sorts and counts them. Installation Using the package manage

1 Jan 16, 2022

Releases(v0.1.0)

v0.1.0(Nov 30, 2021)

The windows standalone client for the first public version of Baua Biocides Scraper
Source code(tar.gz)
Source code(zip)
Baua_Biocides_Scraper_Windows.zip(16.02 MB)

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Related tags

Overview

Baua Biocides Scraper

About the project

What's the problem?

The idea

How does it work ?

Roadmap

Build with

Demo

Version management

Authors

License

You might also like...

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

AssistScraper - program for /r/nba to use to find list of all players a player assisted and how many assists each player recieved

A Python module to bypass Cloudflare's anti-bot page.

Screenhook is a script that captures an image of a web page and send it to a discord webhook.

A Python module to bypass Cloudflare's anti-bot page.

Python script who crawl first shodan page and check DBLTEK vulnerability

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

Releases(v0.1.0)

v0.1.0(Nov 30, 2021)

Owner

Eric DE MARIA

Ebay Webscraper for Getting Average Product Price

A simple django-rest-framework api using web scraping

A simple app to scrap data from Twitter.

Scraping and visualising India's real-time COVID-19 data from the MOHFW dataset.

Extract embedded metadata from HTML markup

A tool for scraping and organizing data from NewsBank API searches

Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js

Web-Scraping using Selenium Master

Automated Linkedin bot that will improve your visibility and increase your network.

Scrape all the media from an OnlyFans account - Updated regularly

A Powerful Spider(Web Crawler) System in Python.

Footballmapies - Football mapies for learning webscraping and use of gmplot module in python

An Automated udemy coupons scraper which scrapes coupons and autopost the result in blogspot post

A web crawler for recording posts in "sina weibo"

An helper library to scrape data from Instagram effortlessly, using the Influencer Hunters APIs.

一款利用Python来自动获取QQ音乐上某个歌手所有歌曲歌词的爬虫软件

此脚本为 python 脚本,实现原理为利用 selenium 定位相关元素,再配合点击事件完成浏览器的自动化.

Amazon scraper using scrapy, a python framework for crawling websites.

Automatically scrapes all menu items from the Taco Bell website

Python scraper to check for earlier appointments in Clalit Health Services