open-information-extraction-system, build open-knowledge-graph(SPO, subject-predicate-object) by pyltp(version==3.4.0)

Last update: Nov 02, 2022

Overview

Open-Information-Extraction-System

中文开放信息抽取系统, open-information-extraction-system, build open-knowledge-graph(SPO, subject-predicate-object) by pyltp(version==3.4.0)

码源分析

基于LTP依存句法分析(DP, dependency parsing)的中文开放信息抽取系统(rule-based)。

增加并列关系、左附加关系、右附加关系等(递归实现);
这里的依存句法分析只适合简单短句，过长句子、口语化句子dp效果不好会很影响下游抽取。

结果展示(部分)

{
    "ques": "郑州是那个省的",
    "answer": [
        "河南"
    ],
    "desc": "郑州是河南省省会城市，周边有洛阳、开封、新郑、新密、许昌等城市",
    "SPO": [
        [
            "郑州",
            "是",
            "那个省"
        ]
    ]
},
{
    "ques": "格林童话《灰姑娘》中,灰姑娘参加舞会时所做的车是由哪种植物变成的?",
    "answer": [
        "南瓜"
    ],
    "desc": "这时，有一位仙女出现了，帮助她摇身一变成为高贵的千金小姐，并将老鼠变成马夫，南瓜变成马车，又变了一套漂亮的衣服和一双水晶（玻璃）鞋给灰姑娘穿上。",
    "SPO": [
        [
            "灰姑娘",
            "参加",
            "舞会"
        ],
        [
            "灰姑娘",
            "参加",
            "舞会"
        ],
        [
            "做车",
            "是",
            "变成"
        ]
    ]
 },
 {
    "ques": "中国农历的哪个节气有着北方吃饺子、南方吃汤圆的习俗?",
    "answer": [
        "冬至"
    ],
    "desc": "在冬至节，中国北方有冬至日吃饺子的习俗，南方某些地方有冬至日吃汤圆、粉糍粑的习俗，传说在汉朝的医圣张仲景体念家乡乡民在寒冬中工作的辛苦，在冬至那天利用羊肉等祛寒的药材包在面皮中，作成耳朵的样子，给乡民们治病补身，这个药方的名字...",
    "SPO": [
        [
            "中国农历哪个节气",
            "有着",
            "吃饺子习俗"
        ],
        [
            "北方",
            "吃",
            "饺子"
        ],
        [
            "南方",
            "吃",
            "汤圆"
        ]
    ]
}

open-information-extraction-system, build open-knowledge-graph(SPO, subject-predicate-object) by pyltp(version==3.4.0)

Related tags

Overview

Open-Information-Extraction-System

码源分析

结果展示(部分)

资源&依赖

Owner

A Word Level Transformer layer based on PyTorch and 🤗 Transformers.

Findings of ACL 2021

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"

A collection of Korean Text Datasets ready to use using Tensorflow-Datasets.

Awesome Treasure of Transformers Models Collection

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Full Spectrum Bioinformatics - a free online text designed to introduce key topics in Bioinformatics using the Python

This project converts your human voice input to its text transcript and to an automated voice too.

An evaluation toolkit for voice conversion models.

HuggingTweets - Train a model to generate tweets

Beta Distribution Guided Aspect-aware Graph for Aspect Category Sentiment Analysis with Affective Knowledge. Proceedings of EMNLP 2021

Higher quality textures for the Metal Gear Solid series.

A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

ChatBotProyect - This is an unfinished project about a simple chatbot.

SAVI2I: Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

Skipgram Negative Sampling in PyTorch

code for modular summarization work published in ACL2021 by Krishna et al

Black for Python docstrings and reStructuredText (rst).

texlive expressions for documents

KR-FinBert And KR-FinBert-SC