This is a simple website crawler which asks for a website link from the user to crawl and find specific data from the given website address.

Last update: Jan 10, 2022

Related tags

Web Crawling Website-Crawler-Python-

Overview

Website-Crawler-Python

This is a simple website crawler which asks for a website link from the user to crawl and find specific data from the given website address. After getting the website address, it asks for how much crawling depth the user wants in between the number of links has been found after providing the website address.

Website Crawler takes 3 inputs:

A website address
Integer value for the crawling depth
A user specified regular expression to find user specific data

General tasks:

Find all the Nowgegian mobile numbers and saves into a text file.
Find all the sub-links inside the given website and saves into a text file.
Saves the website's raw HTML code into a text file.
Find all email addresses and save into a text file.
Find all the comments used in the website and saves it into a text file.
Find five most used words and print it into the terminal.

This is a Python based project and used some dependent libraries to execute the functionalities.

RegEx
Urllib3
BeautifulSoup 4
Counter in Collections

This is a simple website crawler which asks for a website link from the user to crawl and find specific data from the given website address.

Related tags

Overview

Website-Crawler-Python

Owner

Faisal Ahmed

抢京东茅台脚本，定时自动触发，自动预约，自动停止

This is a script that scrapes the longitude and latitude on food.grab.com

🕷 Phone Crawler with multi-thread functionality

京东茅台抢购最新优化版本，京东茅台秒杀，优化了茅台抢购进程队列

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster

A scrapy pipeline that provides an easy way to store files and images using various folder structures.

Web-Scrapper using Python and Flask

Dex-scrapper - Hobby project for scrapping dex data on VeChain

Grab the changelog from releases on Github

Minimal set of tools to conduct stealthy scraping.

A simple Discord scraper for discord bots

中国大学生在线四史自动答题刷分(现仅支持英雄篇)

Web-scraping - Program that scrapes a website for a collection of quotes, picks one at random and displays it

This project was created using Python technology and flask tools to scrape a music site

Scraping Thailand COVID-19 data from the DDC's tableau dashboard

Screen scraping and web crawling framework

Discord webhook spammer with proxy support and proxy scraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

一款利用Python来自动获取QQ音乐上某个歌手所有歌曲歌词的爬虫软件

A Very simple free proxy list scraper.

This is a simple website crawler which asks for a website link from the user to crawl and find specific data from the given website address.

Related tags

Overview

Website-Crawler-Python

Owner

Faisal Ahmed

抢京东茅台脚本，定时自动触发，自动预约，自动停止

This is a script that scrapes the longitude and latitude on food.grab.com

🕷 Phone Crawler with multi-thread functionality

京东茅台抢购最新优化版本，京东茅台秒杀，优化了茅台抢购进程队列

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster

A scrapy pipeline that provides an easy way to store files and images using various folder structures.

Web-Scrapper using Python and Flask

Dex-scrapper - Hobby project for scrapping dex data on VeChain

Grab the changelog from releases on Github

Minimal set of tools to conduct stealthy scraping.

A simple Discord scraper for discord bots

中国大学生在线 四史自动答题刷分(现仅支持英雄篇)

Web-scraping - Program that scrapes a website for a collection of quotes, picks one at random and displays it

This project was created using Python technology and flask tools to scrape a music site

Scraping Thailand COVID-19 data from the DDC's tableau dashboard

Screen scraping and web crawling framework

Discord webhook spammer with proxy support and proxy scraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

一款利用Python来自动获取QQ音乐上某个歌手所有歌曲歌词的爬虫软件

A Very simple free proxy list scraper.

中国大学生在线四史自动答题刷分(现仅支持英雄篇)