Quantifiers-and-Negations-in-RE-Documents

This project was part of my work for a seminar at the Technical University of Munich (TUM) during my bachelor studies in 2019. The python project can be used to find quantifiers and negations in documents. It searches for problematic findings. Problematic findings are i.e. sentences that use specific combinations of quantifiers and negations that are ambiguous. This means there are multiple valid interpretations of the sentence. It can extract those and report them.

Motivation:

You want to avoid ambiguous sentences as they can cause problems that are hard to find and possibly hard to fix. This is especially the case for technical specifications and similar use cases. In this project we compare two different approaches to finding ambiguous sentences:

String based search
NLP based search

We want to find out if the computational overhead of using NLP gives better results than standard string based search methods.

Features:

Detect quantifiers and negations in .xml or .txt documents
Search either by a string based search or by NLP based search (using Stanfords CoreNLP library [1])
Extract possibly ambiguous sentences
Compare string search results with NLP search results

Prerequisites:

Java 8 or higher
Python 3.6 or higher as project interpreter
Stanford Corenlp library: https://stanfordnlp.github.io/CoreNLP/download.html
Environment variable "CORENLP_HOME" set to where the CoreNLP library is stored

References:

[1] Christopher D.Manning, MihaiSurdeanu, JohnBauer, JennyFinkel, StevenJ.Bethard, and David McClosky. The Stanford CoreNLP natural language processing toolkit. In Association for Computational Linguistics (ACL) System Demonstrations, pages 55–60, 2014.

Quantifiers and Negations in RE Documents

Related tags

Overview

Quantifiers-and-Negations-in-RE-Documents

Owner

Nicolas Ruscher

This is the writeup of all the challenges from Advent-of-cyber-2019 of TryHackMe

Findings of ACL 2021

Some embedding layer implementation using ivy library

EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?

auto_code_complete is a auto word-completetion program which allows you to customize it on your need

2021海华AI挑战赛·中文阅读理解·技术组·第三名

Document processing using transformers

Learning to Rewrite for Non-Autoregressive Neural Machine Translation

This repo is to provide a list of literature regarding Deep Learning on Graphs for NLP

Easy to use, state-of-the-art Neural Machine Translation for 100+ languages

CoNLL-English NER Task (NER in English)

An automated program that helps customers of Pizza Palour place their pizza orders

Python implementation of TextRank for phrase extraction and summarization of text documents

Under the hood working of transformers, fine-tuning GPT-3 models, DeBERTa, vision models, and the start of Metaverse, using a variety of NLP platforms: Hugging Face, OpenAI API, Trax, and AllenNLP

A retro text-to-speech bot for Discord

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Uses Google's gTTS module to easily create robo text readin' on command.

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

Natural Language Processing Best Practices & Examples

端到端的长本文摘要模型（法研杯2020司法摘要赛道）