pubmex.py - a script to get a fancy paper title based on given DOI or PMID

Last update: Nov 20, 2022

Overview

Pu(b)mex

pubmex.py is a script to get a fancy paper title based on given DOI or PMID (can be also combined with macOS Finder)

Format of the title:

a first author . a last author - (title("dotted") or your customed title) . PMID . journal . year . pdf
e.g.
  Kelley.Scott.The.evolution.biology.shift.towards.engineering.prediction-generating.tools.away.traditional.research.practice.EMBORep.2008.pdf

Nowadays, it’s not a big issue, with all Mendeley and other tools, however...

I don’t want to put any PDF file collected on the way into my library, because then it gets super big (and then it’s hard to sync it for example with Dropbox). So now I can keep these PDF files into pdf-icebox and re-name them niecely automatically:

$ ls
Hnisz.Sharp.Phase.Separation.Model.Transcriptional.Control.Cell.2017.pdf
Sharp.Hockfield.Convergence.The.future.health.Science.2017.pdf

Usage:

./Balas.Johnson.Establishing.RNA-RNA.interactions.remodels.lncRNA.structure.promotes.PRC2.activity.SciAdv.2021.pdf ">

$ pubmex.py sharp2017.pdf
Sharp.Hockfield.Convergence.The.future.health.Science.2017.pdf
mv sharp2017.pdf --> ./Sharp.Hockfield.Convergence.The.future.health.Science.2017.pdf

$ pubmex.py Query.Konarska.pdf
mv Query.Konarska.pdf --> ./Smith.Konarska."Nought.may.endure.but.mutability".spliceosome.dynamics.regulation.splicing.MolCell.2008.pdf
    
$ pubmex.py eabc9191.full.pdf
mv  eabc9191.full.pdf --> ./Balas.Johnson.Establishing.RNA-RNA.interactions.remodels.lncRNA.structure.promotes.PRC2.activity.SciAdv.2021.pdf

DEPENDENCIES

biopython (http://biopython.org/wiki/Biopython)
pdftotext (http://linux.die.net/man/1/pdftotext)

INSTALLATION

pip install pubmex
# Ubuntu (Debian-based system)
apt-get install xclip python-biopython pdftotext
# macOS
brew install poppler biopython # or "sudo port install poppler biopython"

HISTORY

1.4 Add osx-automator
1.3 Fixed #4 #5
1.2 Fixed #2
1.1 Simplify input, pubmex.py *.pdf
1.0 With recent bugfixes 2021
0.3 OSX installation
0.2 Small changes
0.1 Init version in 2010! :-)

Comments

Automator not working
It seems that when using the automator installations that come with the pubmex the pubmex.py can not be found.

for f in "$@" do pubmex.py $f done

The following error is displayed:

The action “Run Shell Script” encountered an error: “zsh:3: command not found: pubmex.py”

When specifying the direct location of just the pubmex.py file another error occures.

for f in "$@" do /users/suntim/miniforge3/bin/pubmex.py $f done

The following error is displayed:

The action “Run Shell Script” encountered an error: “”

When specifying the direct location of python and the pubmex.py file another error occures.

for f in "$@" do /usr/local/bin/python3 /users/suntim/miniforge3/bin/pubmex.py $f done

The following error is displayed:

The action “Run Shell Script” encountered an error: “Traceback (most recent call last): File "/users/suntim/miniforge3/bin/pubmex.py", line 27, in <module> from Bio import Entrez ModuleNotFoundError: No module named 'Bio'”

I have all dependencies installed pip3 install pubmex, pip3 install biopython, brew install poppler. As it says in the readme.md that biopython should be isntalled via brew I assume that was a mistake. I instead installed it via pip3.

The same error messages occure regardless of using the zsh or bash version.
opened by LinusKaiser 2
Not found in PubMed, although DOI (.ORG/10.1016/J.BBAGRM.2015.08.009) was detected

[email protected]:~/Desktop/pdfs$ pubmex.py -a -r -f 1-s2.0-S1874939915001868-main.pdf ERROR: Not found in PubMed, although DOI (.ORG/10.1016/J.BBAGRM.2015.08.009) was detected in the pdf! Traceback (most recent call last): File "/home/magnus/bin/pubmex.py", line 472, in main() File "/home/magnus/bin/pubmex.py", line 451, in main title = get_title_auto_from_text(text, OPTIONS.debug, False, OPTIONS.keywords) File "/home/magnus/bin/pubmex.py", line 239, in get_title_auto_from_text return get_title_via_doi(doi, debug, reference, customed_title) File "/home/magnus/bin/pubmex.py", line 359, in get_title_via_doi pmid = get_pmid_via_doi_net(doi) File "/home/magnus/bin/pubmex.py", line 333, in get_pmid_via_doi_net return get_value('citation_pmid', content) TypeError: get_value() takes exactly 3 arguments (2 given)

opened by mmagnus 2
Invalid git clone (edit: on windows machines)

The colon in 'demo/10.1261:rna.418407.pdf' causes problems in cloning from windows machines.

Cloning into 'pubmex'... remote: Enumerating objects: 426, done. remote: Counting objects: 100% (9/9), done. remote: Total 426 (delta 8), reused 8 (delta 8), pack-reused 417 eceiving obj Receiving objects: 100% (426/426), 3.79 MiB | 2.86 MiB/s, done. Resolving deltas: 100% (252/252), done. error: invalid path 'demo/10.1261:rna.418407.pdf' fatal: unable to checkout working tree warning: Clone succeeded, but checkout failed. You can inspect what was checked out with 'git status' and retry with 'git restore --source=HEAD :/'

opened by gcasale 1

ct200162x.pdf

(py37) [mx] rna$ pubmex.py ct200162x.pdf --debug
filename: .......... ct200162x.pdf
filename: .......... ct200162x.pdf
doi: ............... ct200162x
IdList.............. []
pmid: .............. False
ERROR: 		Not found in PubMed, although DOI (ct200162x) was detected in the pdf!
generate ./temp.....[OK]
out:
err:
temp is going to be opened
doi_line: .......... DX.DOI.ORG/10.1021/CT200162X | J. CHEM. THEORY COMPUT. 2011, 7, 28862902
doi is found: ...... 10.1021/CT200162X
doi: ............... 10.1021/CT200162X
IdList.............. ['21921995']
pmid: .............. 21921995
summary_dict........ {'Item': [], 'Id': '21921995', 'PubDate': '2011 Sep 13', 'EPubDate': '2011 Aug 2', 'Source': 'J Chem Theory Comput', 'AuthorList': ['Zgarbová M', 'Otyepka M', 'Sponer J', 'Mládek A', 'Banáš P', 'Cheatham TE 3rd', 'Jurečka P'], 'LastAuthor': 'Jurečka P', 'Title': 'Refinement of the Cornell et al. Nucleic Acids Force Field Based on Reference Quantum Chemical Calculations of Glycosidic Torsion Profiles.', 'Volume': '7', 'Issue': '9', 'Pages': '2886-2902', 'LangList': ['English'], 'NlmUniqueID': '101232704', 'ISSN': '1549-9618', 'ESSN': '1549-9626', 'PubTypeList': ['Journal Article'], 'RecordStatus': 'PubMed', 'PubStatus': 'ppublish+epublish', 'ArticleIds': {'pubmed': ['21921995'], 'medline': [], 'doi': '10.1021/ct200162x', 'pmc': 'PMC3171997', 'rid': '21921995', 'eid': '21921995', 'pmcid': 'pmc-id: PMC3171997;'}, 'DOI': '10.1021/ct200162x', 'History': {'pubmed': ['2011/09/17 06:00'], 'medline': ['2011/09/17 06:01'], 'received': '2011/03/08 00:00', 'entrez': '2011/09/17 06:00'}, 'References': [], 'HasAbstract': IntegerElement(1, attributes={}), 'PmcRefCount': IntegerElement(242, attributes={}), 'FullJournalName': 'Journal of chemical theory and computation', 'ELocationID': '', 'SO': '2011 Sep 13;7(9):2886-2902'}
ERROR: 		Problem! The pubmex could not find automatically a title for the pdf file! Sorry!

opened by mmagnus 0

gkz1184.pdf

(py37) [mx] rna$ pubmex.py gkz1184.pdf --debug
filename: .......... gkz1184.pdf
filename: .......... gkz1184.pdf
doi: ............... gkz1184
IdList.............. []
pmid: .............. False
ERROR: 		Not found in PubMed, although DOI (gkz1184) was detected in the pdf!
generate ./temp.....[OK]
out:
err:
temp is going to be opened
doi_line: .......... 11641174 NUCLEIC ACIDS RESEARCH, 2020, VOL. 48, NO. 3 DOI: 10.1093/NAR/GKZ1184
doi is found: ...... 10.1093/NAR/GKZ1184
doi: ............... 10.1093/NAR/GKZ1184
IdList.............. ['31889193']
pmid: .............. 31889193
summary_dict........ {'Item': [], 'Id': '31889193', 'PubDate': '2020 Feb 20', 'EPubDate': '', 'Source': 'Nucleic Acids Res', 'AuthorList': ['Reißer S', 'Zucchelli S', 'Gustincich S', 'Bussi G'], 'LastAuthor': 'Bussi G', 'Title': 'Conformational ensembles of an RNA hairpin using molecular dynamics and sparse NMR data.', 'Volume': '48', 'Issue': '3', 'Pages': '1164-1174', 'LangList': ['English'], 'NlmUniqueID': '0411011', 'ISSN': '0305-1048', 'ESSN': '1362-4962', 'PubTypeList': ['Journal Article'], 'RecordStatus': 'PubMed - indexed for MEDLINE', 'PubStatus': 'ppublish', 'ArticleIds': {'pubmed': ['31889193'], 'medline': [], 'pii': '5691221', 'doi': '10.1093/nar/gkz1184', 'pmc': 'PMC7026608', 'rid': '31889193', 'eid': '31889193', 'pmcid': 'pmc-id: PMC7026608;'}, 'DOI': '10.1093/nar/gkz1184', 'History': {'pubmed': ['2020/01/01 06:00'], 'medline': ['2020/03/20 06:00'], 'accepted': '2019/12/09 00:00', 'revised': '2019/12/05 00:00', 'received': '2019/10/14 00:00', 'entrez': '2020/01/01 06:00'}, 'References': [], 'HasAbstract': IntegerElement(1, attributes={}), 'PmcRefCount': IntegerElement(3, attributes={}), 'FullJournalName': 'Nucleic acids research', 'ELocationID': 'doi: 10.1093/nar/gkz1184', 'SO': '2020 Feb 20;48(3):1164-1174'}
ERROR: 		Problem! The pubmex could not find automatically a title for the pdf file! Sorry!

opened by mmagnus 0

some problem when I removed some prints to make the script quite

(py37) [mx] d$ pubmex -p 10.1016/j.molcel.2020.11.004
(py37) [mx] d$ pubmex -p 10.1016/j.molcel.2020.11.004 -d
doi: ............... 10.1016/j.molcel.2020.11.004
IdList.............. ['33259809']
pmid: .............. 33259809
summary_dict........ {'Item': [], 'Id': '33259809', 'PubDate': '2020 Dec 17', 'EPubDate': '2020 Nov 5', 'Source': 'Mol Cell', 'AuthorList': ['Ziv O', 'Price J', 'Shalamova L', 'Kamenova T', 'Goodfellow I', 'Weber F', 'Miska EA'], 'LastAuthor': 'Miska EA', 'Title': 'The Short- and Long-Range RNA-RNA Interactome of SARS-CoV-2.', 'Volume': '80', 'Issue': '6', 'Pages': '1067-1077.e5', 'LangList': ['English'], 'NlmUniqueID': '9802571', 'ISSN': '1097-2765', 'ESSN': '1097-4164', 'PubTypeList': ['Journal Article'], 'RecordStatus': 'PubMed - indexed for MEDLINE', 'PubStatus': 'ppublish+epublish', 'ArticleIds': {'pubmed': ['33259809'], 'medline': [], 'pii': 'S1097-2765(20)30782-6', 'doi': '10.1016/j.molcel.2020.11.004', 'pmc': 'PMC7643667', 'rid': '33259809', 'eid': '33259809', 'pmcid': 'pmc-id: PMC7643667;'}, 'DOI': '10.1016/j.molcel.2020.11.004', 'History': {'pubmed': ['2020/12/02 06:00'], 'medline': ['2021/01/12 06:00'], 'received': '2020/07/20 00:00', 'revised': '2020/10/05 00:00', 'accepted': '2020/10/29 00:00', 'entrez': '2020/12/01 20:08'}, 'References': [], 'HasAbstract': IntegerElement(1, attributes={}), 'PmcRefCount': IntegerElement(10, attributes={}), 'FullJournalName': 'Molecular cell', 'ELocationID': 'doi: 10.1016/j.molcel.2020.11.004', 'SO': '2020 Dec 17;80(6):1067-1077.e5'}
Ziv.Miska.The.Short-Long-Range.RNA-RNA.Interactome.SARS-CoV-2.MolCell.2020.pdf

bug

opened by mmagnus 0

Releases(1.4.2)

1.4.2(Mar 15, 2022)

Now you can see in Finder QuickAction pubmex to quick run it on a number of PDFs files.

Install pubmex_zsh.workflow from pubmex/osx-automator/ for if you default shell is zsh, or pubmex_bash.workflow for bash.

Source code(tar.gz)
Source code(zip)
1.4.1(Mar 12, 2022)

Add source .bashrc or .zshrc to fix a problem with missing pubmex.py (#6)
Source code(tar.gz)
Source code(zip)
1.4(Sep 27, 2021)

Now you can see in Finder QuickAction pubmex to quick run it on a number of PDFs files.

Install pubmex_zsh.workflow from pubmex/osx-automator/ for if you default shell is zsh, or pubmex_bash.workflow for bash.

Source code(tar.gz)
Source code(zip)
1.3(Sep 26, 2021)

Fixes for #4 & #5
Source code(tar.gz)
Source code(zip)
1.2(Sep 14, 2021)

Update the licence to MIT and add the tool to PyPI https://pypi.org/project/pubmex/1.2/
Source code(tar.gz)
Source code(zip)

1.1(Aug 18, 2021)

Simplify input to pubmex.py *.pdf. Fixed #2

Now, usage:

$ pubmex.py sharp2017.pdf
mv  sharp2017.pdf --> ./Sharp.Hockfield.Convergence.The.future.health.Science.2017.pdf

$ pubmex.py  Query.Konarska.pdf
mv  Query.Konarska.pdf --> Smith.Konarska."Nought.may.endure.but.mutability".spliceosome.dynamics.regulation.splicing.MolCell.2008.pdf

$ pubmex.py eabc9191.full.pdf
mv  eabc9191.full.pdf --> ./Balas.Johnson.Establishing.RNA-RNA.interactions.remodels.lncRNA.structure.promotes.PRC2.activity.SciAdv.2021.pdf

Source code(tar.gz)
Source code(zip)

1.0(Jun 23, 2021)

Usage:

$ pubmex.py -a -f sharp2017.pdf -r
mv  sharp2017.pdf --> ./Sharp.Hockfield.Convergence.The.future.health.Science.2017.pdf

$ pubmex.py -a -f Query.Konarska.pdf -r
mv  Query.Konarska.pdf --> Smith.Konarska."Nought.may.endure.but.mutability".spliceosome.dynamics.regulation.splicing.MolCell.2008.pdf

$ pubmex.py -a -f eabc9191.full.pdf -r
mv  eabc9191.full.pdf --> ./Balas.Johnson.Establishing.RNA-RNA.interactions.remodels.lncRNA.structure.promotes.PRC2.activity.SciAdv.2021.pdf

.. and we get a file:

Smith.Konarska."Nought.may.endure.but.mutability".spliceosome.dynamics.regulation.splicing.MolCell.2008.pdf

Source code(tar.gz)
Source code(zip)

Owner

Marcin Magnus

Ph.D., molecular biologist & bioinformatician, uses Pen & Paper and Emacs for notes, coding & RNA!

GitHub Repository

Simple tool downloads public PoC (refer from nomi-sec)

PoC Collection This is the little script to collect the proof-of-concept which is refered from nomi-sec. The repository now is only develop for linux-

2 Aug 17, 2022

Downloads data from OSM API and uploads it to the mapping sandbox.

OpenStreetMap To Sandbox This is a script to download data from OSM API and upload it to the mapping sandbox. Note that it clears all data in the sand

5 Nov 27, 2022

This is Yt Downloader. Coded with Python (my first repository)

Get Started Download & install Python first before using this software. Download Python Installing Python and Pytube Library (IMPORTANT) Installing Py

2 Oct 25, 2021

A Unit3D Mass Release Downloader

Unit3DMassDL A Unit3D Mass Release Downloader. Currently supports Aither. Installation Ensure Python 3 is installed in your system. Run the following

2 Apr 11, 2022

Youtube-music - Youtube music with python

youtube-music fzf on https://github.com/junegunn/fzf python3 ytb.py [no/yes] yes

0 Feb 03, 2022

Um projeto modesto para baixar vídeos do youtube usando tkinter como gui

Youtube Downloader Um projeto modesto para baixar vídeos do youtube usando tkinter como gui Instalação dos requirements: python3 setup.py ou python se

2 Nov 25, 2021

A python module to download ISO Standards

ISO Standards Downloader A python module to download ISO Standards from https://standards.iso.org/iso-iec/ Report Bug · Request Feature Table of conte

1 Dec 29, 2021

Download Thumbnail of YouTube Videos

Download Thumbnail of YouTube Videos in High Quality Variables: API_ID : Get From my.telegram.org API_HASH : Get from my.telegram.org BOT_TOKEN : Your

6 Jun 08, 2022

Tool To download 4KHDR DV SDR from AppleTV

# APPLE-TV 4K Downloader Tool To download 4K HDR DV SDR from AppleTV Hello Fellow Developers/ ! Hi! My name is WVDUMP. I am Leaking the scripts to

5 Dec 25, 2021

This is a simple Python Script to download Imgur Pictures with the short url!

Imgur Downloader This is a simple Python Script that runs a process with progress bar that downloads an Imgur Picture! Code Example Features Progress

1 Nov 18, 2021

A Quick demo of how to use the youtube_dl module in python.

youtube_dl python module demo A Quick demo of how to use the youtube_dl module in python. Whole documentation for the youtube_dl Installation git

7 Aug 27, 2021

1Fichier Download Manager.

1fichier-dl 1Fichier Download Manager. Features ⭐ Manage your downloads ⭐ Bypass time limits Credits All icons, including the app icon, were provided

470 Oct 08, 2022

Can automatically download mods from a Curseforge modpack

Curseforge-Modpack-Downloader A Python script which automatically downloads mods from a Curseforge modpack. Installing Dependencies ⚠ Make sure you ha

1 Sep 20, 2022

抖音批量下载助手

303 Jan 05, 2023

Simple Youtube Video Downloader

Simple Youtube Video Downloader Download Youtube video using link and Will output result in D:/ (You can change the path in main.py file) Installation

1 Oct 28, 2021

Youtube list to mp3 - Youtube list to mp3 downloader

Youtube list to mp3 downloader Tiny script to convert a list of youtube videos t

3 Feb 11, 2022

YoutubeDownloader - Download any public Playlist from Youtube

YoutubeDownloader Download any public Youtube Channel / Playlist Features Bulk d

17 Nov 12, 2022

Download Apple Music Cover Artwork in the best Quality by providing an Apple Music Link. It downloads the jpg, png and webp version since they often differ from another.

amogus.py - Version 0.0.5 amogus - Apple Music Hi-Res Artwork Fetcher this is my first real python tool so sorry if its bad amogus is a Python script

46 Jan 09, 2023

The lyrics module of the repository apple-playlist-downloader

This is the lyrics module of the repository apple-playlist-downloader. With this code you can download the .lrc file (time synced lyrics) from yours t

6 Oct 07, 2022

PyDownloader - Downloads files and folders at high speed (based on your interent speed).

4 Feb 24, 2022

pubmex.py - a script to get a fancy paper title based on given DOI or PMID

Related tags

Overview

Pu(b)mex

DEPENDENCIES

INSTALLATION

HISTORY

Comments

Automator not working

It seems that when using the automator installations that come with the pubmex the pubmex.py can not be found.

The following error is displayed:

When specifying the direct location of just the pubmex.py file another error occures.

The following error is displayed:

When specifying the direct location of python and the pubmex.py file another error occures.

The following error is displayed:

Not found in PubMed, although DOI (.ORG/10.1016/J.BBAGRM.2015.08.009) was detected

Invalid git clone (edit: on windows machines)

The colon in 'demo/10.1261:rna.418407.pdf' causes problems in cloning from windows machines.

ct200162x.pdf

gkz1184.pdf

some problem when I removed some prints to make the script quite

Releases(1.4.2)

1.4.2(Mar 15, 2022)

1.4.1(Mar 12, 2022)

1.4(Sep 27, 2021)

1.3(Sep 26, 2021)

1.2(Sep 14, 2021)

1.1(Aug 18, 2021)

1.0(Jun 23, 2021)

Owner

Marcin Magnus

Simple tool downloads public PoC (refer from nomi-sec)

Downloads data from OSM API and uploads it to the mapping sandbox.

This is Yt Downloader. Coded with Python (my first repository)

A Unit3D Mass Release Downloader

Youtube-music - Youtube music with python

Um projeto modesto para baixar vídeos do youtube usando tkinter como gui

A python module to download ISO Standards

Download Thumbnail of YouTube Videos

Tool To download 4KHDR DV SDR from AppleTV

This is a simple Python Script to download Imgur Pictures with the short url!

A Quick demo of how to use the youtube_dl module in python.

1Fichier Download Manager.

Can automatically download mods from a Curseforge modpack

抖音批量下载助手

Simple Youtube Video Downloader

Youtube list to mp3 - Youtube list to mp3 downloader

YoutubeDownloader - Download any public Playlist from Youtube

Download Apple Music Cover Artwork in the best Quality by providing an Apple Music Link. It downloads the jpg, png and webp version since they often differ from another.

The lyrics module of the repository apple-playlist-downloader

PyDownloader - Downloads files and folders at high speed (based on your interent speed).