A multithreaded tool for searching and downloading images from popular search engines. It is straightforward to set up and run!

Last update: Dec 31, 2022

Overview

🕳️ CygnusX1

Code by Trong-Dat Ngo.

Overviews

🕳️ CygnusX1 is a multithreaded tool 🛠️ , used to search and download images from popular search engines 🔎 . It is straightforward to set up and run!

Key features

🥰 No knowledge is required to get up and to run.
🚀 Download image using customizable number of threads.
⛏️ Crawl all possible images (search results and recommendations).

Installation

This repository is tested on Python 3.6+ and PyTorch selenium 3.141.0+, as well as it works fine on macOS, Windows, Linux.

You should setup and run 🕳️ CygnusX1 in a virtual environment. If you're unfamiliar with Python virtual environments, check out the user guide here.

First, create a virtual environment with the version of Python you're going to use and activate it. (Can be omitted if you want to set up directly on the OS environment)

source venv/bin/activate

Then download 🕳️ CygnusX1 from Github:

git clone https://github.com/dat821168/CygnusX1.git

Finally install dependencies in requirements.txt:

pip install -r requirements.txt

Run

Use run.py to start the script:

python run.py  --keywords "keyword 1, keyword 2" --workers 8 --use_suggestions --headless

Argument details:

--keywords: Indicate the keywords/keyphrases you want to search. For multiple keywords, separate them with commas.
--out_dir: Path where to save results. Default = './IMAGES'.
--workers: The maximum number of workers used to crawl image. Default = 2.
--use_suggestions: Crawl search engine suggestions/recommendations. Default = False.
--headless: Hide browser during scraping. Default = False.

A multithreaded tool for searching and downloading images from popular search engines. It is straightforward to set up and run!

Related tags

Overview

🕳️ CygnusX1

Overviews

Key features

Installation

Run

Future Releases

References

Owner

DatNgo

Scrape puzzle scrambles from csTimer.net

Parse feeds in Python

Bulk download tool for the MyMedia platform

京东抢茅台，秒杀成功很多次讨论，天猫抢购，赚钱交流等。

爱奇艺会员,腾讯视频,哔哩哔哩,百度,各类签到

Telegram Group Scrapper

Tool to scan for secret files on HTTP servers

Collection of code files to scrap different kinds of websites.

Python based Web Scraper which can discover javascript files and parse them for juicy information (API keys, IP's, Hidden Paths etc)

UsernameScraperTool - Username Scraper Tool With Python

This was supposed to be a web scraping project, but somehow I've turned it into a spamming project

Webservice wrapper for hhursev/recipe-scrapers (python library to scrape recipes from websites)

API to parse tibia.com content into python objects.

Snowflake database loading utility with Scrapy integration

Twitter Scraper

Simple tool to scrape and download cross country ski timings and results from live.skidor.com

河南工业大学完美校园自动校外打卡

A powerful annex BUBT, BUBT Soft, and BUBT website scraping script.

Python framework to scrape Pastebin pastes and analyze them

抖音批量下载用户所有无水印视频

A multithreaded tool for searching and downloading images from popular search engines. It is straightforward to set up and run!

Related tags

Overview

🕳️ CygnusX1

Overviews

Key features

Installation

Run

Future Releases

References

Owner

DatNgo

Scrape puzzle scrambles from csTimer.net

Parse feeds in Python

Bulk download tool for the MyMedia platform

京东抢茅台，秒杀成功很多次讨论，天猫抢购，赚钱交流等。

爱奇艺会员,腾讯视频,哔哩哔哩,百度,各类签到

Telegram Group Scrapper

Tool to scan for secret files on HTTP servers

Collection of code files to scrap different kinds of websites.

Python based Web Scraper which can discover javascript files and parse them for juicy information (API keys, IP's, Hidden Paths etc)

UsernameScraperTool - Username Scraper Tool With Python

This was supposed to be a web scraping project, but somehow I've turned it into a spamming project

Webservice wrapper for hhursev/recipe-scrapers (python library to scrape recipes from websites)

API to parse tibia.com content into python objects.

Snowflake database loading utility with Scrapy integration

Twitter Scraper

Simple tool to scrape and download cross country ski timings and results from live.skidor.com

河南工业大学 完美校园 自动校外打卡

A powerful annex BUBT, BUBT Soft, and BUBT website scraping script.

Python framework to scrape Pastebin pastes and analyze them

抖音批量下载用户所有无水印视频

河南工业大学完美校园自动校外打卡