Our CIKM21 Paper "Incorporating Query Reformulating Behavior into Web Search Evaluation"

Last update: Mar 05, 2022

Overview

Reformulation-Aware-Metrics

Introduction

This codebase contains source-code of the Python-based implementation of our CIKM 2021 paper.

Chen, Jia, et al. "Incorporating Query Reformulating Behavior into Web Search Evaluation." To Appear in Proceedings of the 30th ACM International Conference on Information and Knowledge Management. 2021.

Requirements

python 2.7
sklearn
scipy

Data Preparation

Preprocess two datasets TianGong-SS-FSD and TianGong-Qref into the the following format:

[Reformulation Type][Click List][Usefulness List][Satisfaction Label]

Reformulation Type: A (Add), D (Delete), K (Keep), T (Transform or Change), O (Others), F (First Query).
Click List: 1 -- Clicked, 0 -- Not Clicked.
Usefulness List: Usefulness or Relevance, 4-scale in TianGong-QRef, 5-scale in TianGong-SS-FSD.
Satisfaction Label: 5-scale for both datasets.

Then, bootsrap them into N samples and put the bootstapped data (directories) into ./data/bootstrap_fsd and ./data/bootstrap_qref.

Results

The results for each metrics are shown in the following table:

Metric	Qref-Spearman	Qref-Pearson	Qref-MSE	FSD-Spearman	FSD-Pearson	FSD-MSE
RBP	0.4375	0.4180	N/A	0.4898	0.5222	N/A
DCG	0.4434	0.4182	N/A	0.5022	0.5290	N/A
BPM	0.4552	0.3915	N/A	0.5801	0.6052	N/A
RBP sat	0.4389	0.4170	N/A	0.5165	0.5527	N/A
DCG sat	0.4446	0.4166	N/A	0.5047	0.5344	N/A
BPM sat	0.4622	0.3674	N/A	0.5960	0.6029	N/A
rrDBN	0.4123	0.3670	1.1508	0.5908	0.5602	1.0767
rrSDBN	0.4177	0.3713	1.1412	0.5991	0.5703	1.0524
uUBM	0.4812	0.4303	1.0607	0.6242	0.5775	0.8795
uPBM	0.4827	0.4369	1.0524	0.6210	0.5846	0.8644
uSDBN	0.4837	0.4375	1.1443	0.6290	0.6081	0.8840
uDBN	0.4928	0.4458	1.0801	0.6339	0.6207	0.8322

To reproduce the results of traditional metrics such as RBP, DCG and BPM, we recommend you to use this repo: cwl_eval. 🤗

Quick Start

To train RAMs, run the script as follows:

python run.py --click_model DBN \
	--data qref \
	--id 0 \
	--metric_type expected_utility \
	--max_usefulness 3 \
	--k_num 6 \
	--max_dnum 10 \
	--iter_num 10000 \
	--alpha 0.01 \
	--alpha_decay 0.99 \
	--lamda 0.85 \
	--patience 5 \
	--use_knowledge True

click_model: options: ['DBN', 'SDBN', 'UBM', 'PBM']
data: options: ['fsd', 'qref']
metric_type: options: ['expected_utility', 'effort']
id: the bootstrapped sample id.
k_num: the number of user intent shift type will be considered, should be less than or equal to six.
max_dnum: the maximum number of top documents to be considered for a specific query.
use_knowledge: whether to use the transition probability from syntactic reformulation types to intent-level ones derived from the TianGong-Qref dataset.

Citation

If you find the resources in this repo useful, please do not save your star and cite our work:

@inproceedings{chen2021incorporating,
  title={Incorporating Query Reformulating Behavior into Web Search Evaluation},
  author={Chen, Jia and Liu, Yiqun and Mao, Jiaxin and Zhang, Fan and Sakai, Tetsuya and Ma, Weizhi and Zhang, Min and Ma, Shaoping},
  booktitle={Proceedings of the 30th ACM International Conference on Information and Knowledge Management},
  year={2021},
  organization={ACM}
}

Contact

If you have any questions, please feel free to contact me via [email protected] or open an issue.

Our CIKM21 Paper "Incorporating Query Reformulating Behavior into Web Search Evaluation"

Related tags

Overview

Reformulation-Aware-Metrics

Introduction

Requirements

Data Preparation

Results

Quick Start

Citation

Contact

Owner

xuanyuan14

PushForKiCad - AISLER Push for KiCad EDA

arxiv-sanity, but very lite, simply providing the core value proposition of the ability to tag arxiv papers of interest and have the program recommend similar papers.

LONG-TERM SERIES FORECASTING WITH QUERYSELECTOR – EFFICIENT MODEL OF SPARSEATTENTION

A PyTorch Implementation of SphereFace.

Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals

Exploring Simple 3D Multi-Object Tracking for Autonomous Driving (ICCV 2021)

The official repository for paper ''Domain Generalization for Vision-based Driving Trajectory Generation'' submitted to ICRA 2022

Pytorch implementation of NEGEV method. Paper: "Negative Evidence Matters in Interpretable Histology Image Classification".

Code for "Neural 3D Scene Reconstruction with the Manhattan-world Assumption" CVPR 2022 Oral

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Bayesian Inference Tools in Python

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

SwinTrack: A Simple and Strong Baseline for Transformer Tracking

공공장소에서 눈만 돌리면 CCTV가 보인다는 말이 과언이 아닐 정도로 CCTV가 우리 생활에 깊숙이 자리 잡았습니다.

DABO: Data Augmentation with Bilevel Optimization

Deep Learning for Human Part Discovery in Images - Chainer implementation

Minimalist Error collection Service compatible with Rollbar clients. Sentry or Rollbar alternative.

Dynamic Graph Event Detection

A 3D sparse LBM solver implemented using Taichi

Source code for EquiDock: Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking (ICLR 2022)