Iptvcrawl - A scrapy project for crawl IPTV playlist

Related tags

Web Crawlingiptvcrawl
Overview

iptvcrawl

a scrapy project for crawl IPTV playlist.

Dependency

  • Python3
  • pip install scrapy

Usage

scrapy crawl ejatv

Output

Output playlist file is playlist.m3u. You should note that this file will be overwritten every time when you run spider.

Customize

You can customer the filter condition. Just edit the start_urls in ejatv.py

Example:

this url

https://eja.tv/?limit=0&country=js&language=Chinese&category=&level=0&search=

means channel from Japan, language is Chinese, and any category

Avaliable parameters value are follow:

Category

Animation => Animation
Auto => Auto
Business => Business
Classic => Classic
Comedy => Comedy
Cooking => Cooking
Culture => Culture
Documentary => Documentary
Education => Education
Entertainment => Entertainment
Family => Family
Fashion => Fashion
General => General
Kids => Kids
Legislative => Legislative
Lifestyle => Lifestyle
Local => Local
Movies => Movies
Music => Music
News => News
Outdoor => Outdoor
Relax => Relax
Religious => Religious
Science => Science
Series => Series
Shop => Shop
Sport => Sport
Sports => Sports
Travel => Travel
Weather => Weather
XXX => XXX
Youtube => Youtube
VOD => VOD

Language

Akan => Akan
Albanian => Albanian
Amharic => Amharic
Arabic => Arabic
Armenian => Armenian
Azerbaijani => Azerbaijani
Bosnian => Bosnian
Bulgarian => Bulgarian
Catalan => Catalan
Chinese => Chinese
Croatian => Croatian
Czech => Czech
Danish => Danish
Divehi => Divehi
Dutch => Dutch
English => English
Estonian => Estonian
Faroese => Faroese
Finnish => Finnish
French => French
Galician => Galician
Georgian => Georgian
German => German
Greek => Greek
Hebrew => Hebrew
Hindi => Hindi
Hungarian => Hungarian
Icelandic => Icelandic
Ignota => Ignota
Indonesian => Indonesian
Italian => Italian
Japanese => Japanese
Javanese => Javanese
Kannada => Kannada
Kazakh => Kazakh
Khmer => Khmer
Kinyarwanda => Kinyarwanda
Korean => Korean
Kurdish => Kurdish
Lao => Lao
Latvian => Latvian
Lithuanian => Lithuanian
Luxembourgish => Luxembourgish
Macedonian => Macedonian
Malay => Malay
Malay => Malay 
Malayalam => Malayalam
Maltese => Maltese
Mandarin%20Chinese => Mandarin Chinese
Min%20Nan%20Chinese => Min Nan Chinese
Modern%20Greek => Modern Greek 
Montenegrin => Montenegrin
Music => Music
Māori => Māori
Norwegian%20Bokmål => Norwegian Bokmål
Persian => Persian
Polish => Polish
Portuguese => Portuguese
Punjabi => Punjabi
Pushto => Pushto
Romanian => Romanian
Russian => Russian
Serbian => Serbian
Sinhala => Sinhala
Slovak => Slovak
Slovenian => Slovenian
Somali => Somali
Spanish => Spanish
Sundanese => Sundanese
Swahili => Swahili
Swedish => Swedish
Tagalog => Tagalog
Tamil => Tamil
Telugu => Telugu
Thai => Thai
Turkish => Turkish
Ukrainian => Ukrainian
Urdu => Urdu
Uzbek => Uzbek
Vietnamese => Vietnamese
Western%20Frisian => Western Frisian
Yue%20Chinese => Yue Chinese

Country

af => Afghanistan
al => Albania
dz => Algeria
ar => Argentina
am => Armenia
aw => Aruba
au => Australia
at => Austria
az => Azerbaijan
bh => Bahrain
bb => Barbados
by => Belarus
be => Belgium
bo => Bolivia
ba => Bosnia and Herzegovina
br => Brazil
bg => Bulgaria
bf => Burkina Faso
kh => Cambodia
cm => Cameroon
ca => Canada
cl => Chile
cn => China
co => Colombia
cr => Costa Rica
hr => Croatia
cw => Curacao
cy => Cyprus
cz => Czechia
cd => Democratic Republic of the Congo
dk => Denmark
do => Dominican Republic
ec => Ecuador
eg => Egypt
sv => El Salvador
gq => Equatorial Guinea
ee => Estonia
et => Ethiopia
fo => Faroe Islands
fj => Fiji
fi => Finland
fr => France
ge => Georgia
de => Germany
gh => Ghana
gr => Greece
gt => Guatemala
gy => Guyana
ht => Haiti
hn => Honduras
hk => Hong Kong
hu => Hungary
is => Iceland
in => India
id => Indonesia
int => International
ir => Iran
iq => Iraq
ie => Ireland
il => Israel
it => Italy
jm => Jamaica
jp => Japan
jo => Jordan
kz => Kazakhstan
ke => Kenya
xk => Kosovo
kw => Kuwait
kg => Kyrgyzstan
la => Laos
lv => Latvia
lb => Lebanon
ly => Libya
li => Liechtenstein
lt => Lithuania
lu => Luxembourg
mo => Macao
my => Malaysia
mv => Maldives
mt => Malta
mx => Mexico
md => Moldova
mc => Monaco
me => Montenegro
ma => Morocco
mz => Mozambique
nl => Netherlands
an => Netherlands Antilles
nz => New Zealand
ni => Nicaragua
ng => Nigeria
mk => North Macedonia
no => Norway
om => Oman
pk => Pakistan
ps => Palestinian Territory
pa => Panama
py => Paraguay
pe => Peru
ph => Philippines
pl => Poland
pt => Portugal
pr => Puerto Rico
qa => Qatar
cg => Republic of the Congo
ro => Romania
ru => Russia
rw => Rwanda
sm => San Marino
sa => Saudi Arabia
sn => Senegal
rs => Serbia
sl => Sierra Leone
sg => Singapore
sk => Slovakia
si => Slovenia
so => Somalia
kr => South Korea
es => Spain
lk => Sri Lanka
sd => Sudan
se => Sweden
ch => Switzerland
sy => Syria
tw => Taiwan
tj => Tajikistan
tz => Tanzania
th => Thailand
tn => Tunisia
tr => Turkey
vi => U.S. Virgin Islands
ug => Uganda
ua => Ukraine
ae => United Arab Emirates
gb => United Kingdom
us => United States
uy => Uruguay
uz => Uzbekistan
va => Vatican
ve => Venezuela
vn => Vietnam
ye => Yemen
zm => Zambia
Owner
Zhijun
Zhijun
An application that on a given url, crowls a web page and gets all words, sorts and counts them.

Web-Scrapping-1 An application that on a given url, crowls a web page and gets all words, sorts and counts them. Installation Using the package manage

adriano atambo 1 Jan 16, 2022
Twitter Eye is a Twitter Information Gathering Tool With Twitter Eye

Twitter Eye is a Twitter Information Gathering Tool With Twitter Eye, you can search with various keywords and usernames on Twitter.

Jolanda de Koff 19 Dec 12, 2022
Find thumbnails and original images from URL or HTML file.

Haul Find thumbnails and original images from URL or HTML file. Demo Hauler on Heroku Installation on Ubuntu $ sudo apt-get install build-essential py

Vinta Chen 150 Oct 15, 2022
Web scrapping

Project Setup Table of Contents Project Setup Table of Contents Run project locally Install Requirements Run script Run project locally Install Requir

Charles 3 Feb 04, 2022
河南工业大学 完美校园 自动校外打卡

HAUT-checkin 河南工业大学自动校外打卡 由于github actions存在明显延迟,建议直接使用腾讯云函数 特点 多人打卡 使用简单,仅需账号密码以及用于微信推送的uid 自动获取上一次打卡信息用于打卡 向所有成员微信单独推送打卡状态 完美校园服务器繁忙时造成打卡失败会自动重新打卡

36 Oct 27, 2022
Scrapes Every Email Address of Every Society in Every University

society-email-scrape Site Live at https://kcsoc.github.io/society-email-scrape/ How to automatically generate new data Go to unis.yml Add your uni Cre

Krishna Consciousness Society 18 Dec 14, 2022
Scrape all the media from an OnlyFans account - Updated regularly

Scrape all the media from an OnlyFans account - Updated regularly

CRIMINAL 3.2k Dec 29, 2022
Simple python tool for the purpose of swapping latinic letters with cirilic ones and vice versa in txt, docx and pdf files in Serbian language

Alpha Swap English This is a simple python tool for the purpose of swapping latinic letters with cirylic ones and vice versa, in txt, docx and pdf fil

Aleksandar Damnjanovic 3 May 31, 2022
Scraping script for stats on covid19 pandemic status in Chiba prefecture, Japan

About 千葉県の地域別の詳細感染者統計(Excelファイル) をCSVに変換し、かつ地域別の日時感染者集計値を出力するスクリプトです。 Requirement POSIX互換なシェル, e.g. GNU Bash (1) curl (1) python = 3.8 pandas = 1.1.

Conv4Japan 1 Nov 29, 2021
Grab the changelog from releases on Github

release-notes-scraper This simple script can be used to grab the release notes for projects from github that do not keep a CHANGELOG, but publish thei

Dan Čermák 4 Apr 01, 2022
A spider for Universal Online Judge(UOJ) system, converting problem pages to PDFs.

Universal Online Judge Spider Introduction This is a spider for Universal Online Judge (UOJ) system (https://uoj.ac/). It also works for all other Onl

TriNitroTofu 1 Dec 07, 2021
Scraping Top Repositories for Topics on GitHub,

0.-Webscrapping-using-python Scraping Top Repositories for Topics on GitHub, Web scraping is the process of extracting and parsing data from websites

Dev Aravind D Satprem 2 Mar 18, 2022
A web service for scanning media hosted by a Matrix media repository

Matrix Content Scanner A web service for scanning media hosted by a Matrix media repository Installation TODO Development In a virtual environment wit

Brendan Abolivier 5 Dec 01, 2022
Web-scraping - A bot using Python with BeautifulSoup that scraps IRS website by form number and returns the results as json

Web-scraping - A bot using Python with BeautifulSoup that scraps IRS website (prior form publication) by form number and returns the results as json. It provides the option to download pdfs over a ra

1 Jan 04, 2022
mlscraper: Scrape data from HTML pages automatically with Machine Learning

🤖 Scrape data from HTML websites automatically with Machine Learning

Karl Lorey 798 Dec 29, 2022
News, full-text, and article metadata extraction in Python 3. Advanced docs:

Newspaper3k: Article scraping & curation Inspired by requests for its simplicity and powered by lxml for its speed: "Newspaper is an amazing python li

Lucas Ou-Yang 12.3k Jan 07, 2023
Web Crawlers for Data Labelling of Malicious Domain Detection & IP Reputation Evaluation

Web Crawlers for Data Labelling of Malicious Domain Detection & IP Reputation Evaluation This repository provides two web crawlers to label domain nam

1 Nov 05, 2021
薅薅乐 - JD 测试脚本

薅薅乐 安裝 使用docker docker一键安装: docker run -d --name jd classmatelin/hhl:latest. 使用 进入容器: docker exec -it jd bash 获取JD_COOKIES: python get_jd_cookies.py,

ClassmateLin 575 Dec 28, 2022
Simple library for exploring/scraping the web or testing a website you’re developing

Robox is a simple library with a clean interface for exploring/scraping the web or testing a website you’re developing. Robox can fetch a page, click on links and buttons, and fill out and submit for

Dan Claudiu Pop 79 Nov 27, 2022
热搜榜-python爬虫+正则re+beautifulsoup+xpath

仓库简介 微博热搜榜, 参数wb 百度热搜榜, 参数bd 360热点榜, 参数360 csdn热榜接口, 下方查看 其他热搜待加入 如何使用? 注册vercel fork到你的仓库, 右上角 点击这里完成部署(一键部署) 请求参数 vercel配置好的地址+api?tit=+参数(仓库简介有参数信息

Harry 3 Jul 08, 2022