Python binding for Morfologik

Morfologik is Polish morphological analyzer. For more information see http://github.com/morfologik/morfologik-stemming/ and http://http://www.morfologik.blogspot.com/

Requirements

This binding works with Python 2 and Python 3.

Installation

Install it from pip

pip install pyMorfologik

or directly from github

git clone https://github.com/dmirecki/pyMorfologik.git

Usage

Now, only simple stems are supported:

>>> from pymorfologik import Morfologik
>>> from pymorfologik.parsing import ListParser
>>>
>>> parser = ListParser()
>>> stemmer = Morfologik()
>>> stemmer.stem(['Ala ma kota'], parser)
[(u'Ala',
  {u'Al': [u'subst:sg:acc:m1+subst:sg:gen:m1'],
   u'Ala': [u'subst:sg:nom:f'],
   u'Alo': [u'subst:sg:acc:m1+subst:sg:gen:m1']}),
 (u'ma',
  {u'mieć': [u'verb:fin:sg:ter:imperf:refl.nonrefl'],
   u'mój': [u'adj:sg:nom.voc:f:pos']}),
 (u'kota', {u'kot': [u'subst:sg:acc:m1'], u'kota': [u'subst:sg:nom:f']})]

Acknowledgements

This repo is based on Morfologik, a great contribution of Marcin Miłowski (http://marcinmilkowski.pl) and Dawid Weiss (http://www.dawidweiss.com).

Contributions

Damian Mirecki

Adrian Bohdanowicz

pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

Related tags

Overview

Python binding for Morfologik

Requirements

Installation

Usage

Acknowledgements

Contributions

Owner

Damian Mirecki

Graph Coloring - Weighted Vertex Coloring Problem

Facilitating the design, comparison and sharing of deep text matching models.

BiNE: Bipartite Network Embedding

The Internet Archive Research Assistant - Daily search Internet Archive for new items matching your keywords

Implementation of Fast Transformer in Pytorch

Natural language computational chemistry command line interface.

BERT score for text generation

Resources for "Natural Language Processing" Coursera course.

ConvBERT-Prod

Dust model dichotomous performance analysis

A tool helps build a talk preview image by combining the given background image and talk event description

Scikit-learn style model finetuning for NLP

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

TEACh is a dataset of human-human interactive dialogues to complete tasks in a simulated household environment.

Use fastai-v2 with HuggingFace's pretrained transformers

Sentiment Analysis Project using Count Vectorizer and TF-IDF Vectorizer

Ecommerce product title recognition package

Pytorch version of BERT-whitening

SimCSE: Simple Contrastive Learning of Sentence Embeddings