A Python framework for conversational search

Last update: Oct 23, 2022

Related tags

Deep Learning conversational-search

Overview

Chatty Goose

Multi-stage Conversational Passage Retrieval: An Approach to Fusing Term Importance Estimation and Neural Query Rewriting

Installation

Make sure Java 11+ and Python 3.7+ are installed
Install the chatty-goose PyPI module

pip install chatty-goose

If you are using T5 or BERT, make sure to install PyTorch 1.4.0 - 1.7.1 using your specific platform instructions. Note that PyTorch 1.8 is currently incompatible due to the transformers version we currently use. Also make sure to install the corresponding torchtext version.
Download the English model for spaCy

python -m spacy download en_core_web_sm

Quickstart Guide

The following example shows how to initialize a searcher and build a ConversationalQueryRewriter agent from scratch using HQE and T5 as first-stage retrievers, and a BERT reranker. To see a working example agent, see chatty_goose/agents/chat.py.

First, load a searcher

from pyserini.search import SimpleSearcher

# Option 1: load a prebuilt index
searcher = SimpleSearcher.from_prebuilt_index("INDEX_NAME_HERE")
# Option 2: load a local Lucene index
searcher = SimpleSearcher("PATH_TO_INDEX")

searcher.set_bm25(0.82, 0.68)

Next, initialize one or more first-stage CQR retrievers

from chatty_goose.cqr import Hqe, Ntr
from chatty_goose.settings import HqeSettings, NtrSettings

hqe = Hqe(searcher, HqeSettings())
ntr = Ntr(NtrSettings())

Load a reranker

from chatty_goose.util import build_bert_reranker

reranker = build_bert_reranker()

Create a new RetrievalPipeline

from chatty_goose.pipeline import RetrievalPipeline

rp = RetrievalPipeline(searcher, [hqe, ntr], searcher_num_hits=50, reranker=reranker)

And we're done! Simply call rp.retrieve(query) to retrieve passages, or call rp.reset_history() to reset the conversational history of the retrievers.

Running Experiments

Clone the repo and all submodules (git submodule update --init --recursive)
Clone and build Anserini for evaluation tools
Install dependencies

pip install -r requirements.txt

Follow the instructions under docs/cqr_experiments.md to run experiments using HQE, T5, or fusion.

Example Agent

To run an interactive conversational search agent with ParlAI, simply run chat.py. By default, we use the CAsT 2019 pre-built Pyserini index, but it is possible to specify other indexes using the --from_prebuilt flag. See the file for other possible arguments:

python -m chatty_goose.agents.chat

Alternatively, run the agent using ParlAI's command line interface:

python -m parlai interactive --model chatty_goose.agents.chat:ChattyGooseAgent

We also provide instructions to deploy the agent to Facebook Messenger using ParlAI under examples/messenger.

Comments

Add baselines for CAsT 2020
Need someone help to add CAsT 2020 baseline results:

[ ] Naive: CQR without canonical responses

[ ] Canonical: CQR with canonical (manual) response

CQR methods: HQE /Ntr (T5)
enhancement help wanted
opened by justram 2
Running HQE and getting the reformulated queries
Dear authors,

I am trying to use your method in some of my work. For that, I need to get the reformulated queries (instead of only the generated ranked hits).

I am trying to run the HQE experiment as indicated using:

python -m experiments.run_retrieval \ --experiment hqe \ --hits 1000 \ --sparse_index cast2019 \ --qid_queries $input_query_json \ --output ./output/hqe_bm25

However, when I print the arguments passed inside the retrieval pipeline (L101 of retrieval_pipeline.py) I get as query the raw/original/last-turn query string, and as manual_context_buffer[turn_id] simply None. If I'm not mistaken, that means that running the specific experiment equals to no reformulation being done at all. Can you check/confirm this?

Digging more into the code, it seems to me that the queries I'd like to access are inside cqr_queries, but still, it seems to me that context should be empty/None in that case - probably resulting to no reformulation done at all.
opened by littlewine 1
Query rewriting fix

Thank you with the project.

The fix to below will be hits = rp.retrieve(query, manual_context_buffer[turn_id-1] if turn_id!=0 else None), to pass the last previous canonical response.

https://github.com/castorini/chatty-goose/blob/f9c21c8b7b6194d11d7aec5b4e218174cde98418/experiments/run_retrieval.py#L100

opened by xeniaqian94 1
Update based on Pyserini==0.14.0 and fix canonical response bug
Main change:

change --dense_index from temporary one to pyserini prebuilt index name

fix canonical response bug, which previously add current response to context

since now we have --dense_index, change option name --index to --sparse_index
opened by jacklin64 0
Add chatty goose support for dense retrieval and hybrid search for T5 and CQE

New features added: (only for T5 and CQE, may consider HQE in the future) (1) Dense retrieval (2) Dense-sparse hybrid retrieval

Some arg might be confused and may be changed in the future: (1) --index, --dense_index: may change to --sparse_index and --dense_index (2) --experiment now has options (hqe,cqe,t5,fusion,cqe_t5_fusion) may change to (hqe,cqe,t5,hqe_t5fusion,cqe_t5_fusion)

opened by jacklin64 0
Add cast2020 baseline

This PR adds both naive and canonical baselines for CAsT2020 topics. The results are overall lower as compared to CAst2019 and the results from the canonical run are only slightly better for some metrics as compared to results from the naive run.

Resolves #23

opened by saileshnankani 0
Add support for canonical response

This PR adds support for using manual_canonical_result_id in the CAsT2020 data for both ntr and hqe (for #23).

For ntr, rewrite uses the passage corresponding to the canonical document in the history. We only use 1 passage in the historical context as otherwise, it exceed 512 tokens limit. For e.g., it uses q1/P1/q2 and then q1/q2/P2/q3 and so on.
enhancement

opened by saileshnankani 0
CQR Replication
Add CQR replication for Fusion BM25

Library versions used: torch==1.7.0 torchvision==0.8.1 torchtext==0.8

Results:

map all 0.2584 recall_1000 all 0.8028 ndcg_cut_1 all 0.3353 ndcg_cut_3 all 0.3247

Details and reproduction results can be found in the notebook
opened by saileshnankani 0
Rename classes and update messenger bot
Breaking changes:

Renamed several classes to follow Python conventions / be more consistent

chatty_goose.agents.cqragent -> chatty_goose.agents.chat

HQE -> Hqe

T5_NTR -> Ntr

HQESettings -> HqeSettings

T5Settings -> NtrSettings

CQRType -> CqrType

CQRSettings -> CqrSettings

CQR -> ConversationalQueryRewriter
opened by edwinzhng 0

document spaCy model dependency

With a fresh install, we get the following error if we try to run anything:

OSError: [E050] Can't find model 'en_core_web_sm'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory.

Solution is:

$ python -m spacy download en_core_web_sm

We should document this.

opened by lintool 0

PyTorch version: needs Torch 1.7 (won't work with 1.8)
With a from-scratch installation, the module pulls in Torch 1.8, which causes this error:

ImportError: cannot import name 'SAVE_STATE_WARNING' from 'torch.optim.lr_scheduler' (/anaconda3/envs/chatty-goose-test/lib/python3.7/site-packages/torch/optim/lr_scheduler.py)

Downgrading fixes the issue:

$ pip install torch==1.7.1 torchtext==0.8.1

Should we pin the version in our module dependencies? Or at the very least this needs to be documented.=
opened by lintool 0

dependency conflict

Hi,

When I install chatty-goose from github using:

python -m pip install git+https://github.com/castorini/chatty-goose.git

I met this issue:

ERROR: Cannot install chatty-goose and chatty-goose==0.2.0 because these package versions have conflicting dependencies.

The conflict is caused by:
    chatty-goose 0.2.0 depends on pyserini==0.14.0
    pygaggle 0.0.3.1 depends on pyserini==0.10.1.0

It seems that chatty-goose requires pyserini==0.14.0 as well as pygaggle 0.0.3.1. However, pygaggle 0.0.3.1 and pyserini==0.14.0 do not play nice with each other

Could someone provide some help?

Thanks!

opened by dayuyang1999 1

Expansion to new datasets
Does it make sense to expand Chatty Goose to new datasets? For example:

MANtIS - a multi-domain information seeking dialogues dataset: https://guzpenha.github.io/MANtIS/

ClariQ - Search-oriented Conversational AI (SCAI) EMNLP https://github.com/aliannejadi/ClariQ

enhancement
opened by lintool 1
Checkpoint transformation

According to @edwinzhng's replication log, we have a reranker checkpoint mismatch issue. Currently, we have diffs in our reranking model and the pygaggle's default model.

Related to this issue: I think we need a folder to put/track our tf2torch ckpt transformation/sanity check scripts?

opened by justram 0

Releases(0.2.0)

0.2.0(May 7, 2021)

Breaking changes

Renamed several classes to follow Python conventions / be more consistent

chatty_goose.agents.cqragent -> chatty_goose.agents.chat HQE -> Hqe T5_NTR -> Ntr HQESettings -> HqeSettings T5Settings -> NtrSettings CQRType -> CqrType CQRSettings -> CqrSettings CQR -> ConversationalQueryRewriter
Source code(tar.gz)
Source code(zip)
v0.1.0(Mar 8, 2021)
BREAKING CHANGES

Integrate ParlAI Facebook Messenger example for a demo by @edwinzhng

Integrate Pyserini/Pygaggle for a reference implementation of multi-stage passage retrieval by @edwinzhng

Add replication log for TREC CAsT 2019 conversational passage retrieval task by @edwinzhng

Source code(tar.gz)
Source code(zip)

Owner

Castorini

Deep learning for natural language processing and information retrieval at the University of Waterloo

GitHub Repository

Huawei Hackathon 2021 - Sweden (Stockholm)

huawei-hackathon-2021 Contributors DrakeAxelrod Challenge Requirements: python=3.8.10 Standard libraries (no importing) Important factors: Data depend

32 Nov 08, 2022

Python Multi-Agent Reinforcement Learning framework

- Please pay attention to the version of SC2 you are using for your experiments. - Performance is *not* always comparable between versions. - The re

1.3k Jan 05, 2023

Framework for abstracting Amiga debuggers and access to AmigaOS libraries and devices.

Framework for abstracting Amiga debuggers. This project provides abstration to control an Amiga remotely using a debugger. The APIs are not yet stable

39 Nov 22, 2022

Shōgun

The SHOGUN machine learning toolbox Unified and efficient Machine Learning since 1999. Latest release: Cite Shogun: Develop branch build status: Donat

2.9k Jan 04, 2023

OpenAi's gym environment wrapper to vectorize them with Ray

Ray Vector Environment Wrapper You would like to use Ray to vectorize your environment but you don't want to use RLLib ? You came to the right place !

15 Nov 10, 2022

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation YouTube | BiliBili 16X interpolation results from two input images: Introd

28 Dec 09, 2022

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

This repository holds NVIDIA-maintained utilities to streamline mixed precision and distributed training in Pytorch. Some of the code here will be included in upstream Pytorch eventually. The intenti

6.9k Jan 03, 2023

Everything you want about DP-Based Federated Learning, including Papers and Code. (Mechanism: Laplace or Gaussian, Dataset: femnist, shakespeare, mnist, cifar-10 and fashion-mnist. )

Differential Privacy (DP) Based Federated Learning (FL) Everything about DP-based FL you need is here. （所有你需要的DP-based FL的信息都在这里） Code Tip: the code o

83 Dec 24, 2022

[SIGGRAPH Asia 2021] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning.

DeepVecFont This is the homepage for "DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning". Yizhi Wang and Zhouhui Lian. WI

17 Dec 22, 2022

Implementation of the paper "Language-agnostic representation learning of source code from structure and context".

Code Transformer This is an official PyTorch implementation of the CodeTransformer model proposed in: D. Zügner, T. Kirschstein, M. Catasta, J. Leskov

131 Dec 13, 2022

Open-source code for Generic Grouping Network (GGN, CVPR 2022)

Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity Pytorch implementation for "Open-World Instance Segmen

99 Dec 06, 2022

Anomaly detection in multi-agent trajectories: Code for training, evaluation and the OpenAI highway simulation.

Anomaly Detection in Multi-Agent Trajectories for Automated Driving This is the official project page including the paper, code, simulation, baseline

12 Dec 02, 2022

Embracing Single Stride 3D Object Detector with Sparse Transformer

SST: Single-stride Sparse Transformer This is the official implementation of paper: Embracing Single Stride 3D Object Detector with Sparse Transformer

385 Dec 28, 2022

Parameter Efficient Deep Probabilistic Forecasting

PEDPF Parameter Efficient Deep Probabilistic Forecasting (PEDPF) is a repository containing code to run experiments for several deep learning based pr

10 Jun 13, 2022

Real time sign language recognition

The proposed work aims at converting american sign language gestures into English that can be understood by everyone in real time.

6 Jun 13, 2022

code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"

HCV_IIRC code for our BMVC 2021 paper HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification by Kai Wang, Xialei Li

13 Oct 03, 2022

This is a repository of our model for weakly-supervised video dense anticipation.

Introduction This is a repository of our model for weakly-supervised video dense anticipation. More results on GTEA, Epic-Kitchens etc. will come soon

2 Apr 09, 2022

R-package accompanying the paper "Dynamic Factor Model for Functional Time Series: Identification, Estimation, and Prediction"

dffm The goal of dffm is to provide functionality to apply the methods developed in the paper “Dynamic Factor Model for Functional Time Series: Identi

3 Dec 09, 2022

Machine learning and Deep learning models, deploy on telegram (the best social media)

Semi Intelligent BOT The project involves : Classifying fake news Classifying objects such as aeroplane, automobile, bird, cat, deer, dog, frog, horse

5 Mar 06, 2022

General Multi-label Image Classification with Transformers

General Multi-label Image Classification with Transformers Jack Lanchantin, Tianlu Wang, Vicente Ordóñez Román, Yanjun Qi Conference on Computer Visio

154 Dec 21, 2022