kaldi-asr/kaldi is the official location of the Kaldi project.

Last update: Jan 05, 2023

Overview

Kaldi Speech Recognition Toolkit

To build the toolkit: see ./INSTALL. These instructions are valid for UNIX systems including various flavors of Linux; Darwin; and Cygwin (has not been tested on more "exotic" varieties of UNIX). For Windows installation instructions (excluding Cygwin), see windows/INSTALL.

To run the example system builds, see egs/README.txt

If you encounter problems (and you probably will), please do not hesitate to contact the developers (see below). In addition to specific questions, please let us know if there are specific aspects of the project that you feel could be improved, that you find confusing, etc., and which missing features you most wish it had.

Kaldi information channels

For HOT news about Kaldi see the project site.

Documentation of Kaldi:

Info about the project, description of techniques, tutorial for C++ coding.
Doxygen reference of the C++ code.

Kaldi forums and mailing lists:

We have two different lists

User list kaldi-help
Developer list kaldi-developers:

To sign up to any of those mailing lists, go to http://kaldi-asr.org/forums.html:

Development pattern for contributors

Create a personal fork of the main Kaldi repository in GitHub.
Make your changes in a named branch different from master, e.g. you create a branch my-awesome-feature.
Generate a pull request through the Web interface of GitHub.
As a general rule, please follow Google C++ Style Guide. There are a few exceptions in Kaldi. You can use the Google's cpplint.py to verify that your code is free of basic mistakes.

Platform specific notes

PowerPC 64bits little-endian (ppc64le)

Kaldi is expected to work out of the box in RHEL >= 7 and Ubuntu >= 16.04 with OpenBLAS, ATLAS, or CUDA.
CUDA drivers for ppc64le can be found at https://developer.nvidia.com/cuda-downloads.
An IBM Redbook is available as a guide to install and configure CUDA.

Android

Kaldi supports cross compiling for Android using Android NDK, clang++ and OpenBLAS.
See this blog post for details.

kaldi-asr/kaldi is the official location of the Kaldi project.

Related tags

Overview

Kaldi Speech Recognition Toolkit

Kaldi information channels

Development pattern for contributors

Platform specific notes

PowerPC 64bits little-endian (ppc64le)

Android

Owner

Kaldi

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Detect the mathematical formula from the given picture and the same formula is extracted and converted into the latex code

Python-based tools for document analysis and OCR

scene-linear test images

Code for CVPR 2022 paper "SoftGroup for Instance Segmentation on 3D Point Clouds"

Hiiii this is the Spanish for Linux and win 10 and in the near future the english version of PortScan my new tool on which you can see what ports are Open only with the IP adress.

Autonomous Driving project for Euro Truck Simulator 2

An interactive document scanner built in Python using OpenCV

Text Detection from images using OpenCV

Using computer vision method to recognize and calcutate the features of the architecture.

Demo processor to illustrate OCR-D Python API

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

Just a script for detecting the lanes in any car game (not just gta 5) with specific resolution and road design ( very basic and limited )

Some Boring Research About Products Recognition 、Duplicate Img Detection、Img Stitch、OCR

Distort a video using Seam Carving (video) and Vibrato effect (sound)

🖺 OCR using tensorflow with attention

Code for AAAI 2021 paper: Sequential End-to-end Network for Efficient Person Search

EQFace: An implementation of EQFace: A Simple Explicit Quality Network for Face Recognition

Scene text detection and recognition based on Extremal Region(ER)

Pixie - A full-featured 2D graphics library for Python