This is a web scraper, using Python framework Scrapy, built to extract data from the Deals of the Day section on Mercado Livre website.

Last update: Jan 12, 2022

Related tags

Web Crawling mercadolivre-scraper

Overview

Deals of the Day

This is a web scraper, using the Python framework Scrapy, built to extract data such as price and product name from the Deals of the Day section on Mercado Livre website.

What Data Do We Want to Scrape?

Product Name
Original Price
Current Price
Product Url
Data Extraction Date

Note: The scraper handles pagination and extracts the aforementioned data throughout the entire Deals of the Day section.

💻 Requirements

Before you start, please check if you have met these few basic requirements:

Installed the latest stable python version (Python 3.7 or later).
Created a virtual enviroment to run the ScraPy framework on your machine.
Installed Scrapy 1.6 or a later stable version.

Note: It is strongly recommended that you install Scrapy in a dedicated virtualenv, to avoid conflicting with your system packages.

Getting Started

From terminal

Create an Enviroment:

mkdir virtual-enviroments
$ cd virtual-enviroments
$ python3 -m venv venv

Activate it:
Linux/macOS

$ source venv/bin/activate

Install the Scrapy framework:

$ pip install Scrapy

🚀 How to Use:

Clone this repository into your workspace:

$ git clone https://github.com/david-adds/mercadolivre-scraper.git

Once you have cloned the repository, open it up so you can run the scraper.

$ cd mercadolivre-scraper

Then, run the spider to scrape the data:

$ scrapy crawl deals

This is a web scraper, using Python framework Scrapy, built to extract data from the Deals of the Day section on Mercado Livre website.

Related tags

Overview

Deals of the Day

What Data Do We Want to Scrape?

Note: The scraper handles pagination and extracts the aforementioned data throughout the entire Deals of the Day section.

💻 Requirements

Note: It is strongly recommended that you install Scrapy in a dedicated virtualenv, to avoid conflicting with your system packages.

Getting Started

🚀 How to Use:

Owner

David Souza

Google Developer Profile Badge Scraper

CRI Scrape is a tool for get general info about Italian Red Cross in GAIA Platform

Twitter Eye is a Twitter Information Gathering Tool With Twitter Eye

fork huanghyw/jd_seckill

Automatically download and crop key information from the arxiv daily paper.

京东茅台抢购

Python based Web Scraper which can discover javascript files and parse them for juicy information (API keys, IP's, Hidden Paths etc)

A scrapy pipeline that provides an easy way to store files and images using various folder structures.

Libextract: extract data from websites

Python script to check if there is any differences in responses of an application when the request comes from a search engine's crawler.

A Python web scraper to scrape latest posts from official Coinbase's Blog.

Automatically scrapes all menu items from the Taco Bell website

Comment Webpage Screenshot is a GitHub Action that captures screenshots of web pages and HTML files located in the repository

A tool for scraping and organizing data from NewsBank API searches

A python module to parse the Open Graph Protocol

Scraping web pages to get data

Example of scraping a paginated API endpoint and dumping the data into a DB

A Python module to bypass Cloudflare's anti-bot page.

A database scraper created with mechanical soup and sqlite

Web scrapping