A python module to parse the Open Graph Protocol

Last update: Nov 12, 2022

Related tags

Overview

OpenGraph is a module of python for parsing the Open Graph Protocol, you can read more about the specification at http://ogp.me/

Installation

$ pip install opengraph

Features

Use it as a python dict
Input and parsing from a specific url
Input and parsung from html previous extracted
HTML output
JSON output

Usage

From an URL

>>> import opengraph
>>> video = opengraph.OpenGraph(url="http://www.youtube.com/watch?v=q3ixBmDzylQ")
>>> video.is_valid()
True
>>> for x,y in video.items():
...     print "%-15s => %s" % (x, y)
...
site_name       => YouTube
description     => Eric Clapton and Paul McCartney perform George Harrison's "While My Guitar Gently Weeps" at the...
title           => While My Guitar Gently Weeps
url             => http://www.youtube.com/watch?v=q3ixBmDzylQ
image           => http://i2.ytimg.com/vi/q3ixBmDzylQ/default.jpg
video:type      => application/x-shockwave-flash
video:height    => 224
video           => http://www.youtube.com/v/q3ixBmDzylQ?version=3&autohide=1
video:width     => 398
type            => video

From HTML

>>> HTML = """
... <html xmlns:og="http://ogp.me/ns#">
... <head>
... <title>The Rock (1996)</title>
... <meta property="og:title" content="The Rock" />
... <meta property="og:type" content="movie" />
... <meta property="og:url" content="http://www.imdb.com/title/tt0117500/" />
... <meta property="og:image" content="http://ia.media-imdb.com/images/rock.jpg" />
... </head>
... </html>
... """
>>> movie = opengraph.OpenGraph() # or you can instantiate as follows: opengraph.OpenGraph(html=HTML)
>>> movie.parser(HTML)
>>> video.is_valid()
True

Generate JSON or HTML

>>> ogp = opengraph.OpenGraph("http://ogp.me/")
>>> print ogp.to_json()
{"image:type": "image/png", "title": "Open Graph protocol", "url": "http://ogp.me/", "image": "http://ogp.me/logo.png", "scrape": false, "_url": "http://ogp.me/", "image:height": "300", "type": "website", "image:width": "300", "description": "The Open Graph protocol enables any web page to become a rich object in a social graph."}
>>> print ogp.to_html()

<meta property="og:image:type" content="image/png" />
<meta property="og:title" content="Open Graph protocol" />
<meta property="og:url" content="http://ogp.me/" />
<meta property="og:image" content="http://ogp.me/logo.png" />
<meta property="og:scrape" content="False" />
<meta property="og:_url" content="http://ogp.me/" />
<meta property="og:image:height" content="300" />
<meta property="og:type" content="website" />
<meta property="og:image:width" content="300" />
<meta property="og:description" content="The Open Graph protocol enables any web page to become a rich object in a social graph." />

A python module to parse the Open Graph Protocol

Related tags

Overview

Installation

Features

Usage

Owner

Erik Rivera

Scrap the 42 Intranet's elearning videos in a single click

A python module to parse the Open Graph Protocol

A powerful annex BUBT, BUBT Soft, and BUBT website scraping script.

Simple Web scrapper Bot to scrap webpages using Requests, html5lib and Beautifulsoup.

Find thumbnails and original images from URL or HTML file.

Web Scraping Instagram photos with Selenium by only using a hashtag.

🐞 Douban Movie / Douban Book Scarpy

Shopee Scraper - A web scraper in python that extract sales, price, avaliable stock, location and more of a given seller in Brazil

A dead simple crawler to get books information from Douban.

Python scrapper scrapping torrent website and download new movies Automatically.

Ebay Webscraper for Getting Average Product Price

Web scraper for Zillow

a small library for extracting rich content from urls

Screenhook is a script that captures an image of a web page and send it to a discord webhook.

河南工业大学完美校园自动校外打卡

Parse feeds in Python

Deep Web Miner Python | Spyder Crawler

Web-Scraping using Selenium Master

A scalable frontier for web crawlers

This program will help you to properly scrape all data from a specific website

A python module to parse the Open Graph Protocol

Related tags

Overview

Installation

Features

Usage

Owner

Erik Rivera

Scrap the 42 Intranet's elearning videos in a single click

A python module to parse the Open Graph Protocol

A powerful annex BUBT, BUBT Soft, and BUBT website scraping script.

Simple Web scrapper Bot to scrap webpages using Requests, html5lib and Beautifulsoup.

Find thumbnails and original images from URL or HTML file.

Web Scraping Instagram photos with Selenium by only using a hashtag.

🐞 Douban Movie / Douban Book Scarpy

Shopee Scraper - A web scraper in python that extract sales, price, avaliable stock, location and more of a given seller in Brazil

A dead simple crawler to get books information from Douban.

Python scrapper scrapping torrent website and download new movies Automatically.

Ebay Webscraper for Getting Average Product Price

Web scraper for Zillow

a small library for extracting rich content from urls

Screenhook is a script that captures an image of a web page and send it to a discord webhook.

河南工业大学 完美校园 自动校外打卡

Parse feeds in Python

Deep Web Miner Python | Spyder Crawler

Web-Scraping using Selenium Master

A scalable frontier for web crawlers

This program will help you to properly scrape all data from a specific website

河南工业大学完美校园自动校外打卡