Quick insights from Zoom meeting transcripts using Graph + NLP

Last update: Sep 17, 2022

Overview

Transcript Analysis - Graph + NLP

This program extracts insights from Zoom Meeting Transcripts (.vtt) using TigerGraph and NLTK.

In order to run this program, modify the auth.ini file with your proper graph solution credentials and file paths. Then, simply run main.py. A sample transcript has been provided, but feel free to add your own into the \a_raw_transcripts directory!

As of now, this program performs the following tasks:

Convert .vtt into compact version (stored in \b_cmt_transcripts)
NLP analysis of compact transcript (using NLTK)
- Sentiment analysis
- Trigrams (collocations)
- Frequency of words (plotted)
- Meaningful words (shown as wordcloud)
- Number of speakers, names of speakers
- Who spoke the longest, least, average
Graph analysis of compact transcript (using TigerGraph)
- Analyze relationships between speakers
- Asked the most/least questions
- Pair w/ the most back-and-forth
- (TODO): Linking topics in semantic graph
- (TODO): Named-Entity Recognition
Visual output of all determined insights

Usage

A TigerGraph Cloud Portal solution (https://tgcloud.io/) will be required to run this program.

Kindly find the GraphStudio link here: https://transcript-analysis.i.tgcloud.io/

The schema utilized in this graph is fleshed out below:

Vertex: speaker

(PRIMARY ID) name - STRING

Edge: asked_question

text - STRING

Edge: answered_question

Here is an example of the graph populated with the sample transcript provided:

Analysis

Here is a screenshot of the command-line output produced:

Here is a frequency chart of meaningful words generated:

Here is a word cloud that visualizes common, key terms:

More features coming soon! In the meantime, feel free to continue creating and adding new insights 😁 😁

Quick insights from Zoom meeting transcripts using Graph + NLP

Related tags

Overview

Transcript Analysis - Graph + NLP

Usage

Analysis

References

Owner

Advit Deepak

Pytorch implementation of Tacotron

Twitter bot that uses NLP models to summarize news articles referenced in a user's twitter timeline

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples

GPT-2 Model for Leetcode Questions in python

Text Analysis & Topic Extraction on Android App user reviews

Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and GPT-NEO (2.7 B) on a single 16 GB VRAM V100 Google Cloud instance with Huggingface Transformers using DeepSpeed

Learning Spatio-Temporal Transformer for Visual Tracking

Library for fast text representation and classification.

Perform sentiment analysis on textual data that people generally post on websites like social networks and movie review sites.

aMLP Transformer Model for Japanese

Pre-training with Extracted Gap-sentences for Abstractive SUmmarization Sequence-to-sequence models

Interactive Jupyter Notebook Environment for using the GPT-3 Instruct API

NLP, before and after spaCy

Text Classification in Turkish Texts with Bert

Code for the paper "Are Sixteen Heads Really Better than One?"

HF's ML for Audio study group

A python framework to transform natural language questions to queries in a database query language.

Reproduction process of BERT on SST2 dataset

Pretrained Japanese BERT models