Zotero2Readwise - A Python Library to retrieve annotations and notes from Zotero and upload them to your Readwise

Last update: Dec 20, 2022

Related tags

Overview

Zotero ➡️ Readwise

zotero2readwise is a Python library that retrieves all Zotero annotations† and notes. Then, It automatically uploads them to your Readwise§§.

This is particularly useful for the new Zotero PDF Reader that stores all highlights in the Zotero database. The new Zotero, also available for iOS app (currently in beta). In the new Zotero, the annotations are NOT saved in the PDF file unless you export the highlights in order to save them.

If you annotate your files outside the new Zotero PDF reader, this library may not work with your PDF annotations as those are not retrievable from Zotero API.

This library is for you if you annotate (highlight + note) using the Zotero's PDF reader (including the Zotero iOS)

👉 Updating an existing Zotero annotation or note and re-running this library will update the corresponding Readwise highlight without creating a duplicate!

† Annotations made in the new Zotero PDF reader and note editor.

§ Readwise is a paid service/software that integrates your highlights from almost everywhere (Pocket, Instapaper, Twitter, Medium, Apple Books, and many more). It even has an amazing OCR for directly importing your highlights on a physical book/article into Readwise and allowing you to export all your highlights to Obsidian, Notion, Roam, Markdown, etc. Moreover, It has an automated Spaced Repition and Active Recall. You can use the the link here to get an extra free month (Disclaimer: I will get a free month too!)

Installation

You can install the library by running

pip install zotero2readwise

Note: If you do not have pip installed on your system, you can follow the instructions here.

Usage

Since we have to retrieve the notes from Zotero API and then upload them to the Readwise, the minimum requirements are:

Readwise access token [Required]: You can get your access token from https://readwise.io/access_token
Zotero API key [Required]: Create a new Zotero Key from your Zotero settings
Zotero personal or group ID [Required]:
- Your personal library ID (aka userID) can be found here next to Your userID for use in API calls is XXXXXX.
- If you're using a group library, you can find the library ID by
  1. Go to https://www.zotero.org/groups/
  2. Click on the interested group.
  3. You can find the library ID from the URL link that has format like https://www.zotero.org/groups/<group_id>/group_name. The number between /groups/ and /group_name is the libarry ID.
Zotero library type [Optional]: "user" (default) if using personal library and "group" if using group library.

Note that if you want to retrieve annotations and notes from a group, you should provide the group ID (zotero_library_id=<group_id>) and set the library type to group (zotero_library_type="group").

Approach 1 (Recommended)

from zotero2readwise.zt2rw import Zotero2Readwise

zt_rw = Zotero2Readwise(
    readwise_token="your_readwise_access_token",  # Visit https://readwise.io/access_token)
    zotero_key="your_zotero_key",  # Visit https://www.zotero.org/settings/keys
    zotero_library_id="your_zotero_id", # Visit https://www.zotero.org/settings/keys
    zotero_library_type="user", # "user" (default) or "group"
    include_annotations=True, # Include Zotero annotations -> Default: True
    include_notes=False, # Include Zotero notes -> Default: False
)
zt_rw.run()

Just to make sure that all files are created, you can run save_failed_items_to_json() from readwise attribute of the class object to save any highlight that failed to upload to Readwise. If a file or more failed to create, the filename (item title) and the corresponding Zotero item key will be saved to a txt file.

zt_rw.readwise.save_failed_items_to_json("failed_readwise_highlights.json")

Approach 2

You can use the run.py script. Run python run.py -h to get more information about all options. You can simply run the script as the following:

python run.py <readwise_token> <zotero_key> <zotero_id>

Request a new feature or report a bug

Feel free to request a new feature or report a bug in GitHub issue here.

📫 How to reach me:

Comments

Update README.md

Link to Zotero Settings changed from https://www.zotero.org/settings/key to https://www.zotero.org/settings/keys

I also added /new to directly link to generating a new key, maybe you could explain which settings are needed for a new key (read/write).

opened by floriankilian 1
Fix invalid link to Zotero Settings page

Thank you so much for your great work! While setting up my forked repo, I noticed a broken link, so I fixed it.

"https://www.zotero.org/settings/key" to "https://www.zotero.org/settings/keys"

opened by nobuyukioishi 0
Send case law and other types of documents to Readwise?

Would it be possible to send other types of documents other than books and articles to Readwise? For example I annotate a lot of case law and laws and reports. I’m fine if these are categorised as articles if this means they are also sent to Readwise.

But if it’s possible to categorise them correctly and get them into Readwise that’d be wonderful! Is this a possibility?

opened by ABeehive 0
Partial push of highlights to Readwise

Currently, the library fetches all Zotero highlights/notes and pushes them to Readwise each time.

For efficiency and also in order to avoid any potential duplicated highlights due to either library changes or changes in Readwise de-duplication algorithm, the library should be able to push only latest highlights.

This is related to the issue #31.

opened by e-alizadeh 0
After the last release all my articles from Zottero got duplicated

As in the title. I have set the cronjob for 3 am every day. And on 20 Oct (after the new release on 19 Oct) all the articles got pushed to the Zotero second time. I tried to simply remove them, but they got pushed again. Can we somehow fix this issue?

opened by piojanu 6
Z2R.zt2rw Approach 2 (through python terminal) stopped working in July 2022

It goes through the normal sequence in python, exactly as before but highlights just no longer appear in Readwise. I recreated my zotero key and readwise token and still not appearing. No errors, zotero seems to be collating and pushing the data to Readwise.

Anyone else experiencing this? Did Readwise change something on their end?

opened by bcmorrison3 6
Only sync pre-specified color(s)?

It would be convenient to have a highlight color which signifies "I want this to be synced to Readwise". In my case, most of my highlights aren't review-worthy in a generalized context outside of whatever research I'm doing. Some small amount are.

I imagine most people don't use all of the available options anyway.

opened by deklanw 0

Releases(v0.2.6)

v0.2.6(Oct 31, 2022)
Fix

Merge pull request #32 from e-alizadeh/better-logging (0788e4c)

Source code(tar.gz)
Source code(zip)
zotero2readwise-0.2.6-py3-none-any.whl(11.27 KB)
zotero2readwise-0.2.6.tar.gz(11.97 KB)
v0.2.5(Oct 19, 2022)
Fix

Merge pull request #28 from stefanku/master (ea16ffa)

Source code(tar.gz)
Source code(zip)
zotero2readwise-0.2.5-py3-none-any.whl(11.25 KB)
zotero2readwise-0.2.5.tar.gz(11.94 KB)
v0.2.4(Apr 24, 2022)
Fix

Update iPython to resolve a security bug. (12b1908)

Remove category from Readwise source_url (0ed6118)

Source code(tar.gz)
Source code(zip)
zotero2readwise-0.2.4-py3-none-any.whl(11.25 KB)
zotero2readwise-0.2.4.tar.gz(11.98 KB)
v0.2.3(Jan 7, 2022)
Fix

Use alternate link Zotero (https://www.zotero.org/username/items/<itemKey>) that has a html content instead of self link (https://api.zotero.org/users/<userID>/items/<itemKey>) that contains a JSON content and calls the API. (3310ad1)

Source code(tar.gz)
Source code(zip)
zotero2readwise-0.2.3-py3-none-any.whl(11.25 KB)
zotero2readwise-0.2.3.tar.gz(11.98 KB)
v0.2.2(Jan 3, 2022)
Fix

An oversight in Zotero2Readwise class method run() (previously run_all()) (e2b1336)

Source code(tar.gz)
Source code(zip)
zotero2readwise-0.2.2-py3-none-any.whl(11.06 KB)
zotero2readwise-0.2.2.tar.gz(11.58 KB)
v0.2.1(Jan 3, 2022)
Fix

Get non-empty objects from ZoteroItem (so that we have a JSON serializable object) (6b79fc9)

Ignore highlights more than 8191 characters (readwise limit for a highlight.) (7503324)

Documentation

Improve printouts for both Zotero and Readwise operations (5a22717)

Define Zotero2ReadwiseError exception object. (7d5022a)

Source code(tar.gz)
Source code(zip)
zotero2readwise-0.2.1-py3-none-any.whl(11.06 KB)
zotero2readwise-0.2.1.tar.gz(11.59 KB)
v0.2.0(Jan 1, 2022)
Feature

Refactor Zotero2Readwise.run() to pass a custom number of Zotero annotations and notes instead of running all. (7c8a337)

Fix

Remove filtering zotero items upto 5 items. (4f3e5e0)

Source code(tar.gz)
Source code(zip)
zotero2readwise-0.2.0-py3-none-any.whl(10.33 KB)
zotero2readwise-0.2.0.tar.gz(10.84 KB)
v0.1.1(Jan 1, 2022)
Fix

Project descriptions (502806c)

Source code(tar.gz)
Source code(zip)
zotero2readwise-0.1.1-py3-none-any.whl(10.26 KB)
zotero2readwise-0.1.1.tar.gz(10.68 KB)
v0.1.0(Jan 1, 2022)
Feature

Define Zotero2Readwise class that runs everything. (8361426)

Documentation

Add instructions to README. (925ecf9)

Source code(tar.gz)
Source code(zip)
zotero2readwise-0.1.0-py3-none-any.whl(8.07 KB)
zotero2readwise-0.1.0.tar.gz(6.25 KB)

Owner

Essi Alizadeh

Engineer & Data Scientist in Permanent Beta: Learning, Improving, Evolving ...

GitHub Repository

Zotero2Readwise - A Python Library to retrieve annotations and notes from Zotero and upload them to your Readwise

Related tags

Overview

Zotero ➡️ Readwise

Installation

Usage

Approach 1 (Recommended)

Approach 2

Request a new feature or report a bug

📫 How to reach me:

Comments

Update README.md

Fix invalid link to Zotero Settings page

Send case law and other types of documents to Readwise?

Partial push of highlights to Readwise

After the last release all my articles from Zottero got duplicated

Z2R.zt2rw Approach 2 (through python terminal) stopped working in July 2022

Only sync pre-specified color(s)?

Releases(v0.2.6)

v0.2.6(Oct 31, 2022)

Fix

v0.2.5(Oct 19, 2022)

Fix

v0.2.4(Apr 24, 2022)

Fix

v0.2.3(Jan 7, 2022)

Fix

v0.2.2(Jan 3, 2022)

Fix

v0.2.1(Jan 3, 2022)

Fix

Documentation

v0.2.0(Jan 1, 2022)

Feature

Fix

v0.1.1(Jan 1, 2022)

Fix

v0.1.0(Jan 1, 2022)

Feature

Documentation

Owner

Essi Alizadeh

News, full-text, and article metadata extraction in Python 3. Advanced docs:

Brownant is a web data extracting framework.

Pythonic HTML Parsing for Humans™

Github Actions采集RSS, 打造无广告内容优质的头版头条超赞宝藏页

Open clone of OpenAI's unreleased WebText dataset scraper.

Combine XPath, CSS Selectors and JSONPath for Web data extracting.

Every web site provides APIs.

Module for automatic summarization of text documents and HTML pages.

RSS feed generator website with user friendly interface

Export your data from Xiami

Web-Extractor - Simple Tool To Extract IP-Adress From Website

Web Content Retrieval for Humans™

Zotero2Readwise - A Python Library to retrieve annotations and notes from Zotero and upload them to your Readwise

fast python port of arc90's readability tool, updated to match latest readability.js!

a small library for extracting rich content from urls

Fast and robust date extraction from web pages, with Python or on the command-line

Convert HTML to Markdown-formatted text.