AI-OpenMic

Code and Dataset repository for the EMNLP 2021 paper:

“So You Think You’re Funny?”: Rating the Humour Quotient in Standup Comedy.

Paper: Link
Poster: Link
Slides: Link
Video: Link

Dataset

The dataset is available for download via the following link.

Code

The complete codebase shall be made available very soon. Thank you for being patient.

Citing

Please use the following citation while citing this work:

@inproceedings{mittal-etal-2021-think,
    title = "{``}So You Think You{'}re Funny?{''}: Rating the Humour Quotient in Standup Comedy",
    author = "Mittal, Anirudh  and
      P, Pranav Jeevan  and
      Gandhi, Prerak  and
      Kanojia, Diptesh  and
      Bhattacharyya, Pushpak",
    booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2021",
    address = "Online and Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.emnlp-main.789",
    pages = "10073--10079",
    abstract = "Computational Humour (CH) has attracted the interest of Natural Language Processing and Computational Linguistics communities. Creating datasets for automatic measurement of humour quotient is difficult due to multiple possible interpretations of the content. In this work, we create a multi-modal humour-annotated dataset ({\textasciitilde}40 hours) using stand-up comedy clips. We devise a novel scoring mechanism to annotate the training data with a humour quotient score using the audience{'}s laughter. The normalized duration (laughter duration divided by the clip duration) of laughter in each clip is used to compute this humour coefficient score on a five-point scale (0-4). This method of scoring is validated by comparing with manually annotated scores, wherein a quadratic weighted kappa of 0.6 is obtained. We use this dataset to train a model that provides a {`}funniness{'} score, on a five-point scale, given the audio and its corresponding text. We compare various neural language models for the task of humour-rating and achieve an accuracy of 0.813 in terms of Quadratic Weighted Kappa (QWK). Our {`}Open Mic{'} dataset is released for further research along with the code.",
}

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.gitattributes		.gitattributes
ExtractAudioEmbeddings.py		ExtractAudioEmbeddings.py
ExtractTextEmbeddings.py		ExtractTextEmbeddings.py
LICENSE		LICENSE
Model.py		Model.py
README.md		README.md
cfilt-dark-vec.png		cfilt-dark-vec.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipynb_checkpoints

.ipynb_checkpoints

.gitattributes

.gitattributes

ExtractAudioEmbeddings.py

ExtractAudioEmbeddings.py

ExtractTextEmbeddings.py

ExtractTextEmbeddings.py

LICENSE

LICENSE

Model.py

Model.py

README.md

README.md

cfilt-dark-vec.png

cfilt-dark-vec.png

Repository files navigation

AI-OpenMic

“So You Think You’re Funny?”: Rating the Humour Quotient in Standup Comedy.

Dataset

Code

Citing

About

Releases

Packages

Contributors 3

Languages

License

cfiltnlp/AI-OpenMic

Folders and files

Latest commit

History

Repository files navigation

AI-OpenMic

“So You Think You’re Funny?”: Rating the Humour Quotient in Standup Comedy.

Dataset

Code

Citing

About

Resources

License

Stars

Watchers

Forks

Languages