Neural search engine for AI papers

Last update: Dec 24, 2022

Related tags

Overview

Papers search

Neural search engine for ML papers.

Demo

Usage is simple: input an abstract, get the matching papers. The following demo also showcases the finetuning functionality (notice how the paper marked as "irrelevant" is assigned a lower score after finetuning).

Dataset

We used a stripped-down version of the Kaggle arXiv Dataset in which only the following categories are retained: cs.AI, cs.CL, cs.CV, cs.LG, cs.MA, cs.NE

Setting up the environment

Clone the repository

git clone https://github.com/fissoreg/papers-search/
cd papers-search

For both the folders frontend and backend, run the following commands

cd folder_to_go_into/ # `folder_to_go_into` is either `frontend` or `backend`

python3 -m venv env
source venv/bin/activate

pip install --upgrade pip
pip install -r requirements.txt

Indexing

The app works by suggesting papers whose abstract is similar to the one you provided. The suggestions come from a database of published papers: you need to index all the suggestions for the system to be able to function. This is a lenghty operation, but it needs to be performed only once:

cd backend
python src/app.py --index

For testing, you can index a small number of papers providing the --n argument:

python src/app.py --index --n 10

Running the app

This can be run after indexing (section above).

Run the backend

cd backend
python3 src/app.py

In a new terminal, run the frontend

cd frontend
streamlit run app.py

Connect to http://localhost:8501/ (with your favourite browser).

Formatting, linting and testing

Refer to the Makefile for the specific commands

To format code following the black standard

$ make format

Code linting with flake8

$ make lint

Testing

$ make testdeps
$ make test

Testing with coverage analysis

$ make coverage

Format, test and coverage

$ make build

Contributing

This project is in its starting phase. If you are interested in contributing, don't hesitate to get in touch! (Or go straight to the Issues ;)).

Acknowledgements

Made possible by:

Jina AI
Sentence-Transformers
arXiv: Thank you to arXiv for use of its open access interoperability.
Kaggle

Neural search engine for AI papers

Related tags

Overview

Papers search

Demo

Dataset

Setting up the environment

Indexing

Running the app

Formatting, linting and testing

Contributing

Acknowledgements

Owner

Giancarlo Fissore

Document Image Dewarping

BD-ALL-DIGIT - This Is Bangladeshi All Sim Cloner Tools

Code for the paper: Fusformer: A Transformer-based Fusion Approach for Hyperspectral Image Super-resolution

PAGE XML format collection for document image page content and more

An interactive document scanner built in Python using OpenCV

Let's explore how we can extract text from forms

Text Detection from images using OpenCV

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約

Script para controlar o movimento do mouse usando Python e openCV com câmera em tempo real que detecta pontos de referência da mão, rastreia padrões de gestos em vez de um mouse físico.

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Handwritten_Text_Recognition

Text to QR-CODE

2 telegram-bots: for image recognition and for text generation

Automatically remove the mosaics in images and videos, or add mosaics to them.

A document scanner application for laptops/desktops developed using python, Tkinter and OpenCV.

Textboxes implementation with Tensorflow (python)

This tool will help you convert your text to handwriting xD

An Implementation of the FOTS: Fast Oriented Text Spotting with a Unified Network

M-LSDを用いて四角形を検出し、射影変換を行うサンプルプログラム

Neural search engine for AI papers

Related tags

Overview

Papers search

Demo

Dataset

Setting up the environment

Indexing

Running the app

Formatting, linting and testing

Contributing

Acknowledgements

Owner

Giancarlo Fissore

Document Image Dewarping

BD-ALL-DIGIT - This Is Bangladeshi All Sim Cloner Tools

Code for the paper: Fusformer: A Transformer-based Fusion Approach for Hyperspectral Image Super-resolution

PAGE XML format collection for document image page content and more

An interactive document scanner built in Python using OpenCV

Let's explore how we can extract text from forms

Text Detection from images using OpenCV

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約

Script para controlar o movimento do mouse usando Python e openCV com câmera em tempo real que detecta pontos de referência da mão, rastreia padrões de gestos em vez de um mouse físico.

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Handwritten_Text_Recognition

Text to QR-CODE

2 telegram-bots: for image recognition and for text generation

Automatically remove the mosaics in images and videos, or add mosaics to them.

A document scanner application for laptops/desktops developed using python, Tkinter and OpenCV.

Textboxes implementation with Tensorflow (python)

This tool will help you convert your text to handwriting xD

An Implementation of the FOTS: Fast Oriented Text Spotting with a Unified Network

M-LSDを用いて四角形を検出し、射影変換を行うサンプルプログラム

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約