Twitter Scraper

Last update: Dec 30, 2022

Related tags

Overview

tweety

Twitter's API is annoying to work with, and has lots of limitations — luckily their frontend (JavaScript) has it's own API, which I reverse–engineered. No API rate limits. No restrictions. Extremely fast.

Prerequisites

Before you begin, ensure you have met the following requirements:

Internet Connection
Python 3.6+
BeautifulSoup (Python Module)
Requests (Python Module)

All Functions

get_tweets()
get_user_info()
get_trends() (can be used without username)
search() (can be used without username)
tweet_detail() (can be used without username)

Using tweety

Getting Tweets:

Description:

Get 20 Tweets of a Twitter User

Required Parameter:

Username or User profile URL while initiating the Twitter Object

Optional Parameter:

pages : int (default is 1,starts from 2) -> Get the mentioned number of pages of tweets
include_extras : boolean (default is False) -> Get different extras on the page like Topics etc

Output:

Type -> dictionary

Structure

    {
      "p-1" : {
        "result": {
            "tweets": []
        }
      },
      "p-2":{
        "result": {
            "tweets": []
        }
      }
    }

Example:

>> from tweet import Twitter >>> all_tweet = Twitter("Username or URL").get_tweets(pages=2) >>> for i in all_tweet: ... print(all_tweet[i]) ">

python
Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from tweet import Twitter
>>> all_tweet = Twitter("Username or URL").get_tweets(pages=2)
>>> for i in all_tweet:
...   print(all_tweet[i])

Getting Trends:

Description:

Get 20 Locale Trends

Output:

Type -> dictionary

Structure

", "url":"
" }, { "name":"

", "url":"

" } ] } ">
  {
    "trends":[
      {
        "name":"
      
       "
      ,
        "url":"
      
       "
      
      },
      {
        "name":"
      
       "
      ,
        "url":"
      
       "
      
      }
    ]
  } 

Example :

>> from tweet import Twitter >>> trends = Twitter().get_trends() >>> for i in trends['trends']: ... print(i['name']) ">

python
Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from tweet import Twitter
>>> trends = Twitter().get_trends()
>>> for i in trends['trends']:
...   print(i['name'])

Searching a keyword:

Description:

Get 20 Tweets for a specific Keyword or Hashtag

Required Parameter:

keyword : str -> Keyword begin search

Optional Parameter:

latest : boolean (Default is False) -> Get the latest tweets

Output:

Type -> list

Example:

>> from tweet import Twitter >>> trends = Twitter().search("Pakistan") ">

python
Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from tweet import Twitter
>>> trends = Twitter().search("Pakistan")

Getting USER Info:

Description:

Get the information about the user

Required Parameter:

Username or User profile URL while initiating the Twitter Object

Optional Parameter:

banner_extensions : boolean (Default is False) -> get more information about user banner image
image_extensions : boolean (Default is False) -> get more information about user profile image

Output:

Type -> dict

Example:

>> from tweet import Twitter >>> trends = Twitter("Username or URL").get_user_info() ">

python
Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from tweet import Twitter
>>> trends = Twitter("Username or URL").get_user_info()

Getting a Tweet Detail:

Description:

Get the detail of a tweet including its reply

Required Parameter:

Identifier of the Tweet -> Either Tweet URL OR Tweet ID

Output:

Type -> dict
Structure

  {
    "conversation_threads":[],
    "tweet": {}
  }

Example:

>> from tweet import Twitter >>> trends = Twitter().tweet_detail("https://twitter.com/Microsoft/status/1442542812197801985") ">

python
Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from tweet import Twitter
>>> trends = Twitter().tweet_detail("https://twitter.com/Microsoft/status/1442542812197801985")

Updates:

Update 0.1:

Get Multiple Pages of tweets using pages parameter in get_tweets() function
output of get_tweets has been reworked.

Update 0.2:

Again reworked and simplified tweets in get_tweets function 😜
Added tweet_detail function for getting details about a tweet including replies to it

Update 0.2.1:

Fixed Hashtag Search

Twitter Scraper

Related tags

Overview

tweety

Prerequisites

All Functions

Using tweety

Getting Tweets:

Description:

Required Parameter:

Optional Parameter:

Output:

Example:

Getting Trends:

Description:

Output:

Example :

Searching a keyword:

Description:

Required Parameter:

Optional Parameter:

Output:

Example:

Getting USER Info:

Description:

Required Parameter:

Optional Parameter:

Output:

Example:

Getting a Tweet Detail:

Description:

Required Parameter:

Output:

Example:

Updates:

Update 0.1:

Update 0.2:

Update 0.2.1:

Owner

Tayyab Kharl

A training task for web scraping using python multithreading and a real-time-updated list of available proxy servers.

Scrapes the Sun Life of Canada Philippines web site for historical prices of their investment funds and then saves them as CSV files.

An experiment to deploy a serverless infrastructure for a scrapy project.

A simple flask application to scrape gogoanime website.

A python module to parse the Open Graph Protocol

A package that provides you Latest Cyber/Hacker News from website using Web-Scraping.

Simply scrape / download all the media from an fansly account.

Simple proxy scraper made by using ProxyScrape's api.

一些爬虫相关的签名、验证码破解

a Scrapy spider that utilizes Postgres as a DB, Squid as a proxy server, Redis for de-duplication and Splash to render JavaScript. All in a microservices architecture utilizing Docker and Docker Compose

A Python Covid-19 cases tracker that scrapes data off the web and presents the number of Cases, Recovered Cases, and Deaths that occurred because of the pandemic.

Google Scholar Web Scraping

爬取各大SRC当日公告 | 通过微信通知的小工具 | 赏金工具

Collection of code files to scrap different kinds of websites.

Webservice wrapper for hhursev/recipe-scrapers (python library to scrape recipes from websites)

download NCERT books using scrapy

Instagram profile scrapper with python

Semplice scraper realizzato in Python tramite la libreria BeautifulSoup

Anonymously scrapes onlinesim.ru for new usable phone numbers.

A Web Scraper built with beautiful soup, that fetches udemy course information. Get udemy course information and convert it to json, csv or xml file