Python library for parsing resumes using natural language processing and machine learning

Last update: Jul 29, 2021

Overview

CVParser

Python library for parsing resumes using natural language processing and machine learning.

Setup

Installation on Linux and Mac OS

Follow the guide here on how to clone or fork a repo
Follow the guide here on how to create virtualenv

To create a normal virtualenv (example myvenv) and activate it (see Code below).

$ virtualenv --python=python3 myvenv

$ source myvenv/bin/activate

(myvenv) $ pip install -r requirements.txt

Usage

from cvparser.parser import CVParser

CVParser.download_nlk_data()


parser = CVParser(file_path="path/to/file.[pdf|doc|docx|png|jpeg]")
parser.parse()
print(parser.json())

Re-training the Model

cd into the train folder.
Delete the folder model and the file train.json.
Copy your new training data into the train folder. The train data must be in json. This can be generated using the data annotation tool called Dataturk. The file containing the training data must be named train.json.
Then, start re-training the model by execute the python script in the train folder named manual_training.py.
Then test your new model by #usage .

Python library for parsing resumes using natural language processing and machine learning

Related tags

Overview

CVParser

Setup

Installation on Linux and Mac OS

Usage

Re-training the Model

Owner

nafiu

Smart discord chatbot integrated with Dialogflow

Fast, general, and tested differentiable structured prediction in PyTorch

CCKS-Title-based-large-scale-commodity-entity-retrieval-top1

Spacy-ginza-ner-webapi - Named Entity Recognition API with spaCy and GiNZA

Wrapper to display a script output or a text file content on the desktop in sway or other wlroots-based compositors

PUA Programming Language written in Python.

Package for controllable summarization

RIDE automatically creates the package and boilerplate OOP Python node scripts as per your needs

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x using fastT5.

Problem: Given a nepali news find the category of the news

Chinese real time voice cloning (VC) and Chinese text to speech (TTS).

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

Facilitating the design, comparison and sharing of deep text matching models.

Tools, wrappers, etc... for data science with a concentration on text processing

This is Assignment1 code for the Web Data Processing System.

中文医疗信息处理基准CBLUE: A Chinese Biomedical LanguageUnderstanding Evaluation Benchmark

Nateve compiler developed with python.

AMUSE - financial summarization

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Protein Language Model