ICLR 2022 Paper submission trend analysis

Last update: Dec 06, 2022

Related tags

Data Analysis ICLR2022-OpenReviewData

Overview

Visualize ICLR 2022 OpenReview Data

ICLR 2022 Paper submission analysis from https://openreview.net/group?id=ICLR.cc/2022/Conference

Requirements

pip install wordcloud nltk pandas imageio selenium tqdm

download nltk packages

import nltk
nltk.download('punkt')
nltk.download('averaged_perceptron_tagger')
nltk.download('wordnet')
nltk.download('stopwords')

if you got anything wrong when calling webdriver.Edge('msedgedriver.exe'), you can

Delete msedgedriver.exe since it may only work on my computer (Windows)
Install Microsoft Edge (Chromium): Ensure you have installed Microsoft Edge (Chromium). To confirm that you have Microsoft Edge (Chromium) installed, go to edge://settings/help in the browser, and verify the version number is Version 75 or later.
Download Microsoft Edge Driver:
- Go to edge://settings/help to get the version of Edge.
Navigate to the Microsoft Edge Driver downloads page and download the driver that matches the Edge version number.

From https://stackoverflow.com/questions/63529124/how-to-open-up-microsoft-edge-using-selenium-and-python

Crawl Data

Run crawl_paperlist.py to crawl the list of papers (~0.5h).

Paper List (3,407 submission in total

crawl_paperlist.py only crawls 3,000 papers, but it has 3,407 in total. The full paper list are in follows:

Visualization

Keywords Frequency

The top 50 common keywords (uncased) and their frequency:

Keywords Cloud

The word clouds formed by keywords of submissions show the hot topics including deep learning, reinforcement learning, representation learning, graph neural network, etc.

Title Keywords Frequency

The top 50 common title keywords (uncased) and their frequency:

Title Keywords Cloud

The word clouds formed by keywords of submission titles:

Acknowledgment

Inspired by this repo: https://github.com/evanzd/ICLR2021-OpenReviewData

ICLR 2022 Paper submission trend analysis

Related tags

Overview

Visualize ICLR 2022 OpenReview Data

Requirements

Crawl Data

Paper List (3,407 submission in total

Visualization

Acknowledgment

Owner

Jintang Li

PipeChain is a utility library for creating functional pipelines.

Python script to automate the plotting and analysis of percentage depth dose and dose profile simulations in TOPAS.

bigdata_analyse 大数据分析项目

WaveFake: A Data Set to Facilitate Audio DeepFake Detection

Multiple Pairwise Comparisons (Post Hoc) Tests in Python

Candlestick Pattern Recognition with Python and TA-Lib

An experimental project I'm undertaking for the sole purpose of increasing my Python knowledge

Demonstrate the breadth and depth of your data science skills by earning all of the Databricks Data Scientist credentials

Tools for the analysis, simulation, and presentation of Lorentz TEM data.

Data imputations library to preprocess datasets with missing data

Maximum Covariance Analysis in Python

In this tutorial, raster models of soil depth and soil water holding capacity for the United States will be sampled at random geographic coordinates within the state of Colorado.

A real-time financial data streaming pipeline and visualization platform using Apache Kafka, Cassandra, and Bokeh.

yt is an open-source, permissively-licensed Python library for analyzing and visualizing volumetric data.

INF42 - Topological Data Analysis

A Pythonic introduction to methods for scaling your data science and machine learning work to larger datasets and larger models, using the tools and APIs you know and love from the PyData stack (such as numpy, pandas, and scikit-learn).

This program analyzes a DNA sequence and outputs snippets of DNA that are likely to be protein-coding genes.

Hangar is version control for tensor data. Commit, branch, merge, revert, and collaborate in the data-defined software era.

Analysis of a dataset of 10000 passwords to find common trends and mistakes people generally make while setting up a password.

Parses data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)