Unsub is a collection analysis tool that assists libraries in analyzing their journal subscriptions.

Last update: Nov 16, 2022

Overview

About

Unsub is a collection analysis tool that assists libraries in analyzing their journal subscriptions. The tool provides rich data and a summary graph, but more detailed analysis tends to take place off the site in an exported .csv file that allows for filtering, notes, and additional visualization.

This project, Unsub Extender, is a Python script that takes an Unsub data export file and automates useful plots and visualizations for a collection analysis team to explore. The graphs are interactive through Altair and support zoom, pan, and hover, and filters in the left sidebar help set parameters to quickly narrow in on obvious titles to KEEP or CANCEL. The Python code is turned into a web app using Streamlit.

Hosting provided by Iowa State University.

Requirements

An export .csv file - from an Unsub project, choose "Export - Download as spreadsheet".

A .csv file will be saved, which is the input to Unsub Extender.

The .csv file must have the following columns in any order, but named exactly as:

title
downloads
citations
authorships
usage
subscription_cost
subscribed
cpu
cpu_rank
use_ill_percent
use_oa_percent
use_other_delayed_percent

These should already be the default column names assigned by Unsub in the file export.

subscribed column

The subscribed column is especially important as it determines the color-coding of data points in several of the graphs. The column accepts the following values:

(TRUE and FALSE are conventions carried over from Unsub, MAYBE is supported as a third option for future consideration, and leaving the cell blank will color that journal data point grey):

TRUE
- A title to keep, displayed in blue
FALSE
- A title to cancel, displayed in red
MAYBE
- A title to think more about, displayed in green
(blank)
- A title with no decision yet, displayed in grey

Usage

Hosted by Iowa State University

Navigate to https://unsubextender.lib.iastate.edu to run in browser

License

See LICENSE file

Credits

Eric Schares
Nick Booher
unsub

Option Pricing Calculator using the Binomial Pricing Method (No Libraries Required)

Binomial Option Pricing Calculator Option Pricing Calculator using the Binomial Pricing Method (No Libraries Required) Background A derivative is a fi

1 Nov 29, 2021

A set of functions and analysis classes for solvation structure analysis

SolvationAnalysis The macroscopic behavior of a liquid is determined by its microscopic structure. For ionic systems, like batteries and many enzymes,

19 Nov 24, 2022

Statistical Analysis 📈 focused on statistical analysis and exploration used on various data sets for personal and professional projects.

Statistical Analysis 📈 This repository focuses on statistical analysis and the exploration used on various data sets for personal and professional pr

1 Sep 3, 2022

follow-analyzer helps GitHub users analyze their following and followers relationship

follow-analyzer follow-analyzer helps GitHub users analyze their following and followers relationship by providing a report in html format which conta

2 May 2, 2022

A neural-based binary analysis tool

A neural-based binary analysis tool Introduction This directory contains the demo of a neural-based binary analysis tool. We test the framework using

208 Dec 22, 2022

Flenser is a simple, minimal, automated exploratory data analysis tool.

Flenser Have you ever been handed a dataset you've never seen before? Flenser is a simple, minimal, automated exploratory data analysis tool. It runs

79 Sep 20, 2022

ELFXtract is an automated analysis tool used for enumerating ELF binaries

ELFXtract ELFXtract is an automated analysis tool used for enumerating ELF binaries Powered by Radare2 and r2ghidra This is specially developed for PW

49 Nov 28, 2022

This tool parses log data and allows to define analysis pipelines for anomaly detection.

logdata-anomaly-miner This tool parses log data and allows to define analysis pipelines for anomaly detection. It was designed to run the analysis wit

32 Nov 27, 2022

cLoops2: full stack analysis tool for chromatin interactions

cLoops2: full stack analysis tool for chromatin interactions Introduction cLoops2 is an extension of our previous work, cLoops. From loop-calling base

25 Dec 14, 2022

Releases(v1.3)

v1.3(Jun 16, 2022)

Text in How to Use dropdown Change from using 'era_subjects' to 'subjects' column, while still keeping logic to handle 'era_subjects' Update to Streamlit 1.9.0 Add award information
Source code(tar.gz)
Source code(zip)
v1.2(Mar 24, 2022)
v.1.2

add RUSA/ETS BETA award information

add live demo webinar links

'perpetual_access_years' and '_text' conversions

Support blank Subscribed status

Logic for 'subject' and 'era_subjects' str convert

removed streamlit-analytics, was not compatible with Streamlit 1.3.1

latest pandas (1.4) was causing AssertionError, downgraded to 1.3.5

fixed trailing underscore in exported dataset filename

added dropdown to show user what they changed before exporting new data

Source code(tar.gz)
Source code(zip)
v1.1(Jan 25, 2022)
Upgrading to Streamlit 1.3.1, uses less memory and adds features

expander out of beta

columns out of beta

new download and exporter function

added era_subjects column to example dataset

needed to convert subject columns to strings

add 'subscription' to IF% calculation

add versioning in About

Source code(tar.gz)
Source code(zip)
v1.0(Aug 6, 2021)

First major official relase, stable version. Get a DOI.
Source code(tar.gz)
Source code(zip)

Owner

Collection Analysis Librarian at Iowa State University

GitHub Repository

Anomaly Detection with R

AnomalyDetection R package AnomalyDetection is an open-source R package to detect anomalies which is robust, from a statistical standpoint, in the pre

3.5k Dec 27, 2022

MeSH2Matrix - A set of Python codes for the generation of biomedical ontologies from the MeSH keywords of the PubMed scholarly publications

A set of Python codes for the generation of biomedical ontologies from the MeSH keywords of the PubMed scholarly publications

6 Nov 30, 2022

Pizza Orders Data Pipeline Usecase Solved by SQL, Sqoop, HDFS, Hive, Airflow.

PizzaOrders_DataPipeline There is a Tony who is owning a New Pizza shop. He knew that pizza alone was not going to help him get seed funding to expand

4 Jun 05, 2022

This is a tool for speculation of ancestral allel, calculation of sfs and drawing its bar plot.

superSFS This is a tool for speculation of ancestral allel, calculation of sfs and drawing its bar plot. It is easy-to-use and runing fast. What you s

3 Dec 16, 2022

Data collection, enhancement, and metrics calculation.

l3_data_collection Data collection, enhancement, and metrics calculation. Summary Repository containing code for QuantDAO's JDT data collection task.

3 Dec 23, 2022

An Integrated Experimental Platform for time series data anomaly detection.

Curve Sorry to tell contributors and users. We decided to archive the project temporarily due to the employee work plan of collaborators. There are no

486 Dec 21, 2022

ELFXtract is an automated analysis tool used for enumerating ELF binaries

ELFXtract ELFXtract is an automated analysis tool used for enumerating ELF binaries Powered by Radare2 and r2ghidra This is specially developed for PW

49 Nov 28, 2022

Codes for the collection and predictive processing of bitcoin from the API of coinmarketcap

5 Apr 26, 2022

Python for Data Analysis, 2nd Edition

Python for Data Analysis, 2nd Edition Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media Buy

18.6k Jan 08, 2023

Intake is a lightweight package for finding, investigating, loading and disseminating data.

Intake: A general interface for loading data Intake is a lightweight set of tools for loading and sharing data in data science projects. Intake helps

851 Jan 01, 2023

Average time per match by division

HW_02 Unzip matches.rar to access .json files for matches. Get an API key to access their data at: https://developer.riotgames.com/ Average time per m

11 Jan 07, 2022

Pyspark project that able to do joins on the spark data frames.

SPARK JOINS This project is to perform inner, all outer joins and semi joins. create_df.py: load_data.py : helps to put data into Spark data frames. d

1 Dec 14, 2021

Monitor the stability of a pandas or spark dataframe ⚙︎

Population Shift Monitoring popmon is a package that allows one to check the stability of a dataset. popmon works with both pandas and spark datasets.

403 Dec 07, 2022

Autopsy Module to analyze Registry Hives based on bookmarks provided by EricZimmerman for his tool RegistryExplorer

13 Mar 31, 2022

signac-flow - manage workflows with signac

signac-flow - manage workflows with signac The signac framework helps users manage and scale file-based workflows, facilitating data reuse, sharing, a

44 Oct 14, 2022

PySpark Structured Streaming ROS Kafka ApacheSpark Cassandra

PySpark-Structured-Streaming-ROS-Kafka-ApacheSpark-Cassandra The purpose of this project is to demonstrate a structured streaming pipeline with Apache

5 Nov 13, 2022

Repository created with LinkedIn profile analysis project done

EN/en Repository created with LinkedIn profile analysis project done. The datase

4 Aug 06, 2022

A data analysis using python and pandas to showcase trends in school performance.

A data analysis using python and pandas to showcase trends in school performance. A data analysis to showcase trends in school performance using Panda

0 Sep 07, 2021

VHub - An API that permits uploading of vulnerability datasets and return of the serialized data

2 Feb 14, 2022

First and foremost, we want dbt documentation to retain a DRY principle. Every time we repeat ourselves, we waste our time. Second, we want to understand column level lineage and automate impact analysis.

dbt-osmosis First and foremost, we want dbt documentation to retain a DRY principle. Every time we repeat ourselves, we waste our time. Second, we wan

150 Jan 06, 2023

Unsub is a collection analysis tool that assists libraries in analyzing their journal subscriptions.

Related tags

Overview

About

Requirements

subscribed column

Usage

License

Credits

You might also like...

Option Pricing Calculator using the Binomial Pricing Method (No Libraries Required)

A set of functions and analysis classes for solvation structure analysis

Statistical Analysis 📈 focused on statistical analysis and exploration used on various data sets for personal and professional projects.

follow-analyzer helps GitHub users analyze their following and followers relationship

A neural-based binary analysis tool

Flenser is a simple, minimal, automated exploratory data analysis tool.

ELFXtract is an automated analysis tool used for enumerating ELF binaries

This tool parses log data and allows to define analysis pipelines for anomaly detection.

cLoops2: full stack analysis tool for chromatin interactions

Releases(v1.3)

v1.3(Jun 16, 2022)

v1.2(Mar 24, 2022)

v.1.2

v1.1(Jan 25, 2022)

v1.0(Aug 6, 2021)

Owner

Anomaly Detection with R

MeSH2Matrix - A set of Python codes for the generation of biomedical ontologies from the MeSH keywords of the PubMed scholarly publications

Pizza Orders Data Pipeline Usecase Solved by SQL, Sqoop, HDFS, Hive, Airflow.

This is a tool for speculation of ancestral allel, calculation of sfs and drawing its bar plot.

Data collection, enhancement, and metrics calculation.

An Integrated Experimental Platform for time series data anomaly detection.

ELFXtract is an automated analysis tool used for enumerating ELF binaries

Codes for the collection and predictive processing of bitcoin from the API of coinmarketcap

Python for Data Analysis, 2nd Edition

Intake is a lightweight package for finding, investigating, loading and disseminating data.

Average time per match by division

Pyspark project that able to do joins on the spark data frames.

Monitor the stability of a pandas or spark dataframe ⚙︎

Autopsy Module to analyze Registry Hives based on bookmarks provided by EricZimmerman for his tool RegistryExplorer

signac-flow - manage workflows with signac

PySpark Structured Streaming ROS Kafka ApacheSpark Cassandra

Repository created with LinkedIn profile analysis project done

A data analysis using python and pandas to showcase trends in school performance.

VHub - An API that permits uploading of vulnerability datasets and return of the serialized data

First and foremost, we want dbt documentation to retain a DRY principle. Every time we repeat ourselves, we waste our time. Second, we want to understand column level lineage and automate impact analysis.