This is a web scraper, using Python framework Scrapy, built to extract data from the Deals of the Day section on Mercado Livre website.

Last update: Jan 12, 2022

Related tags

Web Crawling mercadolivre-scraper

Overview

Deals of the Day

This is a web scraper, using the Python framework Scrapy, built to extract data such as price and product name from the Deals of the Day section on Mercado Livre website.

What Data Do We Want to Scrape?

Product Name
Original Price
Current Price
Product Url
Data Extraction Date

Note: The scraper handles pagination and extracts the aforementioned data throughout the entire Deals of the Day section.

💻 Requirements

Before you start, please check if you have met these few basic requirements:

Installed the latest stable python version (Python 3.7 or later).
Created a virtual enviroment to run the ScraPy framework on your machine.
Installed Scrapy 1.6 or a later stable version.

Note: It is strongly recommended that you install Scrapy in a dedicated virtualenv, to avoid conflicting with your system packages.

Getting Started

From terminal

Create an Enviroment:

mkdir virtual-enviroments
$ cd virtual-enviroments
$ python3 -m venv venv

Activate it:
Linux/macOS

$ source venv/bin/activate

Install the Scrapy framework:

$ pip install Scrapy

🚀 How to Use:

Clone this repository into your workspace:

$ git clone https://github.com/david-adds/mercadolivre-scraper.git

Once you have cloned the repository, open it up so you can run the scraper.

$ cd mercadolivre-scraper

Then, run the spider to scrape the data:

$ scrapy crawl deals

This is a web scraper, using Python framework Scrapy, built to extract data from the Deals of the Day section on Mercado Livre website.

Related tags

Overview

Deals of the Day

What Data Do We Want to Scrape?

Note: The scraper handles pagination and extracts the aforementioned data throughout the entire Deals of the Day section.

💻 Requirements

Note: It is strongly recommended that you install Scrapy in a dedicated virtualenv, to avoid conflicting with your system packages.

Getting Started

🚀 How to Use:

Owner

David Souza

京东秒杀商品抢购Python脚本

Scrapping Connections' info on Linkedin

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

API to parse tibia.com content into python objects.

Web and PDF Scraper Refactoring

crypto currency scraping

UsernameScraperTool - Username Scraper Tool With Python

优化版本的京东茅台抢购神器

Scrapegoat is a python library that can be used to scrape the websites from internet based on the relevance of the given topic irrespective of language using Natural Language Processing

An Web Scraping API for MDL(My Drama List) for Python.

Pro Football Reference Game Data Webscraper

Here I provide the source code for doing web scraping using the python library, it is Selenium.

Use Flask API to wrap Facebook data. Grab the wapper of Facebook public pages without an API key.

PyQuery-based scraping micro-framework.

一些爬虫相关的签名、验证码破解

Visual scraping for Scrapy

Find thumbnails and original images from URL or HTML file.

A simple Discord scraper for discord bots

Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments)

a small library for extracting rich content from urls