Machine-in-the-Loop Rewriting for Creative Image Captioning

Last update: Jul 24, 2022

Related tags

Overview

Machine-in-the-Loop Rewriting for Creative Image Captioning

Data

Annotated sources of data used in the paper:

Data Source	URL
Mohammed et al.	Link
Gordon et al.	Link
Bostan et al.	Link
Niculae et al.	Link
Steen et al.	Link

TODO: Individual data cleaning scripts

Model Training

Follow the README in the model_training directory to train a Fairseq BART model. Reach out for our trained model.

Interface

Code to run the UI we used for interactive experiments. This UI hosts a server and needs you to have a backend GPU to run model inference during interaction. The code saves each interaction with a unique ID which we use to match to our crowdworkers for experimental analysis.

TODO: Data Processing Scripts to filter results

Owner

Vishakh P

GitHub Repository

Supplemental Code for "ImpressionNet :A Multi view Approach to Predict Socio Facial Impressions"

Supplemental Code for "ImpressionNet :A Multi view Approach to Predict Socio Facial Impressions" Environment requirement This code is based on Python

1 Dec 19, 2021

DUE: End-to-End Document Understanding Benchmark

This is the repository that provide tools to download data, reproduce the baseline results and evaluation. What can you achieve with this guide Based

21 Dec 29, 2022

Code for generating the figures in the paper "Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?"

Code for running simulations for the paper "Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Lin

1 Nov 22, 2022

2020 CCF大数据与计算智能大赛-非结构化商业文本信息中隐私信息识别-第7名方案

2020CCF-NER 2020 CCF大数据与计算智能大赛-非结构化商业文本信息中隐私信息识别-第7名方案 bert base + flat + crf + fgm + swa + pu learning策略 + clue数据集 = test1单模0.906 词向量

67 Oct 19, 2022

Totally Versatile Miscellanea for Pytorch

Totally Versatile Miscellania for PyTorch Thomas Viehmann [email protected] Thi

428 Dec 28, 2022

Reinforcement Learning for Automated Trading

Reinforcement Learning for Automated Trading This thesis has been realized for the obtention of the Master's in Mathematical Engineering at the Polite

80 Jun 19, 2022

Instant Real-Time Example-Based Style Transfer to Facial Videos

FaceBlit: Instant Real-Time Example-Based Style Transfer to Facial Videos The official implementation of FaceBlit: Instant Real-Time Example-Based Sty

131 Dec 19, 2022

KGDet: Keypoint-Guided Fashion Detection (AAAI 2021)

KGDet: Keypoint-Guided Fashion Detection (AAAI 2021) This is an official implementation of the AAAI-2021 paper "KGDet: Keypoint-Guided Fashion Detecti

35 Dec 29, 2022

Simple (but Strong) Baselines for POMDPs

Recurrent Model-Free RL is a Strong Baseline for Many POMDPs Welcome to the POMDP world! This repo provides some simple baselines for POMDPs, specific

172 Dec 29, 2022

GraPE is a Rust/Python library for high-performance Graph Processing and Embedding.

GraPE GraPE (Graph Processing and Embedding) is a fast graph processing and embedding library, designed to scale with big graphs and to run on both of

194 Dec 29, 2022

Image-to-Image Translation with Conditional Adversarial Networks (Pix2pix) implementation in keras

pix2pix-keras Pix2pix implementation in keras. Original paper: Image-to-Image Translation with Conditional Adversarial Networks (pix2pix) Paper Author

141 Dec 30, 2022

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing Figure: Joint multi-attribute edits using DyStyle model. Great diversity

74 Dec 03, 2022

Randomizes the warps in a stock pokeemerald repo.

pokeemerald warp randomizer Randomizes the warps in a stock pokeemerald repo. Usage Instructions Install networkx and matplotlib via pip3 or similar.

6 Mar 17, 2022

Kaggle Lyft Motion Prediction for Autonomous Vehicles 4th place solution

Lyft Motion Prediction for Autonomous Vehicles Code for the 4th place solution of Lyft Motion Prediction for Autonomous Vehicles on Kaggle. Discussion

44 Jun 27, 2022

A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items

A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items This repository co

3 Mar 16, 2022

Machine-in-the-Loop Rewriting for Creative Image Captioning

Related tags

Overview

Machine-in-the-Loop Rewriting for Creative Image Captioning

Data

TODO: Individual data cleaning scripts

Model Training

Interface

TODO: Data Processing Scripts to filter results

Owner

Vishakh P

Supplemental Code for "ImpressionNet :A Multi view Approach to Predict Socio Facial Impressions"

DUE: End-to-End Document Understanding Benchmark

Code for generating the figures in the paper "Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?"

2020 CCF大数据与计算智能大赛-非结构化商业文本信息中隐私信息识别-第7名方案

Totally Versatile Miscellanea for Pytorch

Reinforcement Learning for Automated Trading

Instant Real-Time Example-Based Style Transfer to Facial Videos

KGDet: Keypoint-Guided Fashion Detection (AAAI 2021)

Simple (but Strong) Baselines for POMDPs

GraPE is a Rust/Python library for high-performance Graph Processing and Embedding.

Image-to-Image Translation with Conditional Adversarial Networks (Pix2pix) implementation in keras

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

Randomizes the warps in a stock pokeemerald repo.

Kaggle Lyft Motion Prediction for Autonomous Vehicles 4th place solution

A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items

Intrinsic Image Harmonization

It's a powerful version of linebot

Evolving neural network parameters in JAX.

A Multi-modal Perception Tracker (MPT) for speaker tracking using both audio and visual modalities

Code for "FPS-Net: A convolutional fusion network for large-scale LiDAR point cloud segmentation".