Code to accompany the paper "Finding Bipartite Components in Hypergraphs", which is published in NeurIPS'21.

Last update: May 06, 2022

Overview

Finding Bipartite Components in Hypergraphs

This repository contains code to accompany the paper "Finding Bipartite Components in Hypergraphs", published in NeurIPS 2021. It provides an implementation of the proposed algorithm based on the new hypergraph diffusion process, as well as the baseline algorithm based on the clique reduction.

Below, you can find instructions for running the code which will reproduce the results reported in the paper.

Feel free to contact me with any questions or comments at [email protected].

Set-up

The code was written to work with Python 3.6, although other versions of Python 3 should also work. We recommend that you run inside a virtual environment.

To install the dependencies of this project, run

pip install -r requirements.txt

Viewing the visualisation

In order to demonstrate our algorithm, you can view the visualisation of the 2-graph constructed at each step by running

python show_visualisation.py

This example was used to create Figure 1 in the paper.

Experiments

In this section, we give instructions for running the experiments reported in the paper.

Penn Treebank Preprocessing

We are unfortunately not able to share the data used for the Penn Treebank experiment, and so we give instructions here for how to preprocess this data for use with our code. You will need to have your own access to the Penn Treebank corpus.

Follow the instructions in this repository, passing the --task pos command line option to generate the files train.tsv, test.tsv, and dev.tsv. Copy these three files to the data/nlp/penn-treebank directory.

Running the real-world experiments

To run the experiments on real-world data, you should run

python run_experiment.py {experiment_name}

where {experiment_name} is one of 'ptb', 'dblp', 'imdb', or 'wikipedia' to run the Penn Treebank, DBLP, IMDB and Wikipedia experiments respectively.

Running the synthetic experiments

To run an experiment on a single synthetic hypergraph, run

python run_experiment_synthetic.py {n} {r} {p} {q}

where {n} is the number of vertices in the hypergraph, {r} is the rank of the hypergraph, {p} is the probability of an edge inside a cluster, and {q} is the probability of an edge between clusters. Be careful not to set p or q to be too large. See the main paper for more information about the random hypergraph model. This will construct the hypergraph if needed, and report the performance of the diffusion algorithm and the clique algorithm on the constructed hypergraph.

Results

The full results from our experiments on synthetic hypergraphs are provided in the data/sbm/results directory, along with a Mathematica notebook for viewing them, and plotting the figures shown in the paper.

Code to accompany the paper "Finding Bipartite Components in Hypergraphs", which is published in NeurIPS'21.

Related tags

Overview

Finding Bipartite Components in Hypergraphs

Set-up

Viewing the visualisation

Experiments

Penn Treebank Preprocessing

Running the real-world experiments

Running the synthetic experiments

Results

Owner

Peter Macgregor

An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"

Official Pytorch implementation of C3-GAN

HuSpaCy: industrial-strength Hungarian natural language processing

Official implementation of our paper "LLA: Loss-aware Label Assignment for Dense Pedestrian Detection" in Pytorch.

To SMOTE, or not to SMOTE?

CaLiGraph Ontology as a Challenge for Semantic Reasoners ([email protected]'21)

The official code of "SCROLLS: Standardized CompaRison Over Long Language Sequences".

Spherical Confidence Learning for Face Recognition, accepted to CVPR2021.

Personalized Transfer of User Preferences for Cross-domain Recommendation (PTUPCDR)

nn_builder lets you build neural networks with less boilerplate code

Springer Link Download Module for Python

MultiSiam: Self-supervised Multi-instance Siamese Representation Learning for Autonomous Driving

Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper

Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

ELSED: Enhanced Line SEgment Drawing

Implementation of Sequence Generative Adversarial Nets with Policy Gradient

Generate saved_model, tfjs, tf-trt, EdgeTPU, CoreML, quantized tflite and .pb from .tflite.

An elaborate and exhaustive paper list for Named Entity Recognition (NER)

A deep learning library that makes face recognition efficient and effective

Discovering Interpretable GAN Controls [NeurIPS 2020]