Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo.

Last update: Oct 04, 2022

Overview

Fundamentus com framework scrapy

Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo.

Baixa informacões que os outros scrapys do fundamentus não realizam.

Para iniciar, dentro da pasta fundamentus digite: scrapy crawl detalhes -O nomedoarquivocriado.csv ou scrapy crawl resultado -O nomedoarquivocriado.csv

Não é um codigo elegante, mas funcional, realiza o scrapy de forma rapida.

As informacões baixadas são:

       columns = ['Papel', 'Cotação', 'Tipo', 'Data ult cot', 'Empresa', 'Min 52 sem',
                  'Setor', 'Max 52 sem', 'Subsetor', 'Vol $ méd (2m)', 'Valor de mercado',
                  'Últ balanço processado', 'Valor da firma', 'Nro. Ações',

                  'Dia', 'P/L',
                  'LPA', 'Mês', 'P/VP', 'VPA', '30 dias', 'P/EBIT', 'Marg. Bruta',
                  '12 meses', 'PSR', 'Marg. EBIT', '2021', 'P/Ativos', 'Marg. Líquida',
                  '2020', 'P/Cap. Giro', 'EBIT / Ativo', '2019', 'P/Ativ Circ Liq',
                  'ROIC', '2018', 'Div. Yield', 'ROE', '2017', 'EV / EBITDA',
                  'Liquidez Corr', '2016', 'EV / EBIT', 'Div Br/ Patrim', '2015',
                  'Cres. Rec (5a)', 'Giro Ativos',

                  'Ativo',
                  'Dív. Bruta',
                  'Disponibilidades',
                  'Dív. Líquida',
                  'Ativo Circulante',               
                  'Depósitos',
                  'Cart. de Crédito',
                  'Patrim. Líq',

                  'Receita Líquida_12meses',         
                  'Receita Líquida_3meses', 'EBIT_12meses', 'EBIT_3meses',
                  'Lucro Líquido_12meses', 'Lucro Líquido_3meses']
                  
                  e mais algumas informações...

Realizei este projeto com o fim de aprendizado e por não encontrar no github nenhum scrapy que pegue todas as informaçoes que eu precisava como setores e subsetores para realizar modelos KNN e KMC de machine learning.

Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo.

Related tags

Overview

Fundamentus com framework scrapy

Owner

Guilherme Silva Uchoa

This repo has the source code for the crawler and data crawled from auto-data.net

An IpVanish Proxies Scraper

Web-Scrapper using Python and Flask

Python script that reads Aliexpress offers urls from a Excel filename (.csv) and post then in a Telegram channel using a bot

WebScrapping Project - G1 Latest News

Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments)

一款利用Python来自动获取QQ音乐上某个歌手所有歌曲歌词的爬虫软件

This program will help you to properly scrape all data from a specific website

A tool to easily scrape youtube data using the Google API

A web scraping pipeline project that retrieves TV and movie data from two sources, then transforms and stores data in a MySQL database.

腾讯课堂，模拟登陆，获取课程信息，视频下载，视频解密。

Nekopoi scraper using python3

京东茅台抢购最新优化版本，京东茅台秒杀，优化了茅台抢购进程队列

This script is intended to crawl license information of repositories through the GitHub API.

Minecraft Item Scraper

Ebay Webscraper for Getting Average Product Price

AssistScraper - program for /r/nba to use to find list of all players a player assisted and how many assists each player recieved

A webdriver-based script for reserving Tsinghua badminton courts.

Scrape plants scientific name information from Agroforestry Species Switchboard 2.0.

A Telegram crawler to search groups and channels automatically and collect any type of data from them.