🎲 KenLM extension for spaCy 2.0.
-
Updated
Dec 6, 2017 - Python
🎲 KenLM extension for spaCy 2.0.
demo of domain corpus bootstrapping using language model perplexity
CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
Generate language models from OSCAR corpora
Basic setup to use kenlm library in cpp
A Java JNI wrapper for KenLM: Faster and Smaller Language Model Queries
Wave2vec 2.0 Recognize pipeline
Scripts to train a n-gram language models on Wikipedia articles
Romanian Automatic Speech Recognition from the ROBIN project
Optical Character Recognition + Instance Segmentation for russian and english languages
A complete instruction for training a Persian spell checker and a language model based on SymSpell and KenLM, respectively using Wikipedia dataset.
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
Neural Grammatical Error Correction for Romanian using Transformer
Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech
Developed an AI tool to automatically generate captions and transcripts for YouTube videos in 67 languages and can generate summarized texts in 133 languages.
We create n-gram language models that quantify the likelihood of various sound sequences occurring in the English language.
End-to-End Automatic Speech Recognition on PyTorch with CTC Decoder and Ken LM
This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.
Add a description, image, and links to the kenlm topic page so that developers can more easily learn about it.
To associate your repository with the kenlm topic, visit your repo's landing page and select "manage topics."