PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".

Last update: Dec 12, 2022

Related tags

Overview

Maria: A Visual Experience Powered Conversational Agent

This repository is the Pytorch implementation of our paper "Maria: A Visual Experience Powered Conversational Agent" in ACL 2021.

In this paper, we present Maria, a neural conversation agent powered by the visual world experiences which are retrieved from a large-scale image index. Maria consists of three flexible components, i.e., text-to-image retriever, visual concept detector and visual-knowledge-grounded response generator.

Coming soon!

Summary

Maria: A Visual Experience Powered Conversational Agent

Dependencies

python 3.7
pytorch 1.4.0
Ubuntu 18.04

Usage

Citation

If you find this paper helps your research, please kindly consider citing our paper in your publications.

@inproceedings{liang2021maria,
   title={Maria: A Visual Experience Powered Conversational Agent},
   author={Liang, Zujie and Hu, Huang and Xu, Can and Chongyang, Tao and Geng, Xiubo and Chen, Danqi and Liang, Fan and Jiang, Daxin},
   booktitle={Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL)},
   year={2021}
}

Acknowledgment

Special thanks to the authors of OSCAR, vokenization, and py-bottom-up-attention.

PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".

Related tags

Overview

Maria: A Visual Experience Powered Conversational Agent

Summary

Dependencies

Usage

Text-to-Image Retrieval Model

Bottom-up Detector Model

Dialog Generation Model

Citation

Acknowledgment

Owner

Jokie

TensorFlow implementation of PHM (Parameterization of Hypercomplex Multiplication)

RoboDesk A Multi-Task Reinforcement Learning Benchmark

The official PyTorch code for NeurIPS 2021 ML4AD Paper, "Does Thermal data make the detection systems more reliable?"

Dilated Convolution with Learnable Spacings PyTorch

Visual dialog agents with pre-trained vision-and-language encoders.

A production-ready, scalable Indexer for the Jina neural search framework, based on HNSW and PSQL

Code for testing convergence rates of Lipschitz learning on graphs

Neural HMMs are all you need (for high-quality attention-free TTS)

Adaptable tools to make reinforcement learning and evolutionary computation algorithms.

Capstone-Project-2 - A game program written in the Python language

Official Implement of CVPR 2021 paper “Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting”

Code for paper entitled "Improving Novelty Detection using the Reconstructions of Nearest Neighbours"

Official PyTorch implementation of Less is More: Pay Less Attention in Vision Transformers.

IRON Kaggle project done while doing IRONHACK Bootcamp where we had to analyze and use a Machine Learning Project to predict future sales

Deep GPs built on top of TensorFlow/Keras and GPflow

Cossim - Sharpened Cosine Distance implementation in PyTorch

Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase

Effective Use of Transformer Networks for Entity Tracking

CausaLM: Causal Model Explanation Through Counterfactual Language Models

A PyTorch Implementation of Single Shot MultiBox Detector