GIANT

Code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"

https://arxiv.org/pdf/2004.02118.pdf

Please cite our paper if this project is helpful to your work or research, thanks.

How to run

Download files Stanford CoreNLP (https://stanfordnlp.github.io/CoreNLP/download.html) and Chinese word embedding (https://ai.tencent.com/ailab/nlp/embedding.html). For word embedding, see note in the bottom.

Revise paths and put files in appropriate paths File paths are defined in common/constants.py. So just go to that file and change the paths according to your own setting. Similarly for other paths defined in some source files.
test run

python3 GIANT_main.py
--data_type concept
--train_file "../../../../Datasets/original/concept/concepts.json"
--emb_tags
--task_output_dims 2
--tasks "phrase"
--edge_types_list "seq" "dep" "contain" "synonym"
--d_model 32
--layers 3
--num_bases 5
--epochs 10
--mode train
--debug

Note: add —processed_emb in above command can help to prevent re-processing word embeddings (as it is time consuming). In this case, you also don't need to download the Chinese word embedding file. It is quite big. Our experience shows that add word embedding feature as a part of node features is not quite helpful in our tasks. Therefore, I think it is safe to ignore the word embedding features in your experiments. If not using word embedding, you may need to revise data_loader.py to avoid some running errors. However, you can still try to improve by word embeddings.

code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"

Related tags

Overview

GIANT

How to run

Owner

Excalibur

A unified 3D Transformer Pipeline for visual synthesis

Code for the upcoming CVPR 2021 paper

Source code for "Roto-translated Local Coordinate Framesfor Interacting Dynamical Systems"

Benchmarks for the Optimal Power Flow Problem

PIXIE: Collaborative Regression of Expressive Bodies

Where2Act: From Pixels to Actions for Articulated 3D Objects

Attack classification models with transferability, black-box attack; unrestricted adversarial attacks on imagenet

This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf

DeepCAD: A Deep Generative Network for Computer-Aided Design Models

lightweight python wrapper for vowpal wabbit

PyTorch implementation of the end-to-end coreference resolution model with different higher-order inference methods.

The repo contains the code of the ACL2020 paper `Dice Loss for Data-imbalanced NLP Tasks`

This is an official implementation of the High-Resolution Transformer for Dense Prediction.

Over-the-Air Ensemble Inference with Model Privacy

This repository contains a CBIR system that uses swin transformer to extract image's feature.

Use of Attention Gates in a Convolutional Neural Network / Medical Image Classification and Segmentation

Deep Distributed Control of Port-Hamiltonian Systems

Pyramid Pooling Transformer for Scene Understanding

Data-Uncertainty Guided Multi-Phase Learning for Semi-supervised Object Detection

object detection; robust detection; ACM MM21 grand challenge; Security AI Challenger Phase VII