👁️ Tool for Data Extraction and Web Requests.

Last update: Dec 05, 2021

Overview

httpmapper 👁️

Project • Technologies • Installation • How it works • License

Project 🚧

For educational purposes.

This is a project that I developed, which is basically a Web crawler that navigate in the web, extracting source codes, links, cookies and more. I also did to learn more about requests and data-extraction.

Technologies 🛠️

This project was developed with the following technologies:

Python

Installation 🚀

# cloning repository
git clone https://github.com/vLeeH/httpmapper

# enter on folder 
cd httpmapper 

# update 
sudo apt update 

# running 
python install.py

How it works 🔧

# using
python install.py 

examples 

# website for this example: https://github.com 

   python install.py 

   Choice: 5
   Website: https://github.com 

   [+] Cookie Name = _octo - Cookie Value = GH1.1.413278149.1633841686
   [+] Cookie Name = logged_in - Cookie Value = no
   [+] Cookie Name = _gh_sess - Cookie Value = ejqBvu%2BSIjM68y7f8niePF8U%2FyrwbGVoKc8iW6FWLil8%2BtsOtGcYSaxw52b%2FhCg%2F275eqHG18jSe4wZ7TFzvlD5Xx6tqvddoSy%2BEdOUlooL7gEpchhK1W8i0Y%2Fg1ARBhrK3saX43%2FjlBEMJX45km%2BPHf39gxk1fO8fc6ytX%2Fp7uX2F1z3hMIep76ooxirYuFzSwBefa3EZU5fZq2OQoV2is8xjiInY72lDSxErMjPKKS6%2B1cjUp9NW7bS5G63%2B9AJCPMwjpdg15qa8aulJ%2FLZg%3D%3D--qTvdBCfTpQiV75Hr--liTEY8bhr%2B0QHWrLVyJZ8w%3D%3D
   [-] Cookie extracter finished!

Note: You need to identify which browser is being used for the header variable.

Contributing 🔨

How can I contribute to the project?

1. Create a fork from httpmapper repository.
2. git clone https://github.com/vLeeH/httpmapper.git
3. cd httpmapper/
4. Make your changes.
5. Commit and make a git push.
6. Open a pull request.

License 📝

This project is under the MIT License.

👁️ Tool for Data Extraction and Web Requests.

Related tags

Overview

httpmapper 👁️

Project 🚧

Technologies 🛠️

Installation 🚀

How it works 🔧

Contributing 🔨

License 📝

Owner

This scrapper scrapes the mail ids of faculty members from a given linl/page and stores it in a csv file

This Spider/Bot is developed using Python and based on Scrapy Framework to Fetch some items information from Amazon

Web Scraping COVID 19 Meta Portal with Python

A scrapy pipeline that provides an easy way to store files and images using various folder structures.

Libextract: extract data from websites

script to scrape direct download links (ddls) from google drive index.

This code will be able to scrape movies from a movie website and also provide download links to newly uploaded movies.

Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js

This app will let you continuously scrape certain parts of LeasePlan and extract data of cars becoming available for lease.

A modern CSS selector implementation for BeautifulSoup

IGLS - Instagram Like Scraper CLI tool

A pure-python HTML screen-scraping library

This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster

PyQuery-based scraping micro-framework.

A webdriver-based script for reserving Tsinghua badminton courts.

A web scraping pipeline project that retrieves TV and movie data from two sources, then transforms and stores data in a MySQL database.

A web scraper which checks price of a product regularly and sends price alerts by email if price reduces.

Nekopoi scraper using python3

Introduction to WebScraping Workshop - Semcomp 24 Beta

The core packages of security analyzer web crawler