#

kenlm

Here are 22 public repositories matching this topic...

tokestermw / spacy_kenlm

🎲 KenLM extension for spaCy 2.0.

nlp spacy language-model spacy-nlp spacy-extension kenlm

Updated Dec 6, 2017
Python

Targoman / TargomanSMT

Targoman SMT framework source code

Updated Feb 12, 2018
C++

SNUDerek / lm_perplexity_bootstrapping

demo of domain corpus bootstrapping using language model perplexity

text-classification language-modeling nltk bootstrapping kenlm language-model-perplexity perplexity

Updated Feb 14, 2018
Jupyter Notebook

Sundy1219 / ctc_beam_search_lm

CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统

beam-search chinese-characters asr kenlm

Updated Jun 27, 2018
C++

kmario23 / KenLM-training

Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2

python natural-language-processing deep-neural-networks language-modeling speech-recognition automatic-speech-recognition language-model probabilistic-models kenlm deep-speech kenlm-toolkit

Updated May 20, 2019

mozilla / scorertool

Generate language models from OSCAR corpora

machine-learning oscar scorer language-model deepspeech kenlm

Updated Mar 27, 2020
Python

gv22ga / kenlm-cpp-example

Basic setup to use kenlm library in cpp

cpp sample-code kenlm kenlm-cpp kenlm-library

Updated Sep 10, 2020
C++

levyfan / kenlm-jni

A Java JNI wrapper for KenLM: Faster and Smaller Language Model Queries

java nlp machine-learning natural-language-processing jni language-model kenlm

Updated Oct 25, 2020
Java

loretoparisi / wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

docker pytorch automatic-speech-recognition asr wav2letter kenlm wav2vec

Updated Dec 22, 2020
Python

mpoyraz / ngram-lm-wiki

Scripts to train a n-gram language models on Wikipedia articles

speech-recognition kenlm n-gram-language-models

Updated Jan 17, 2022
Python

racai-ai / RobinASR

Romanian Automatic Speech Recognition from the ROBIN project

text-to-speech pytorch romanian automatic-speech-recognition asr deepspeech kenlm

Updated Feb 16, 2022
Python

Lednik7 / nto-ai-text-recognition

Optical Character Recognition + Instance Segmentation for russian and english languages

ocr torch segmentation copypaste ocr-recognition instance-segmentation kenlm beam-search-decoder detectron2

Updated Mar 6, 2022
Jupyter Notebook

pooya-mohammadi / persian-spell-checker-kenlm

A complete instruction for training a Persian spell checker and a language model based on SymSpell and KenLM, respectively using Wikipedia dataset.

python nlp bash spellcheck persian language-model spellchecker kenlm symspell

Updated Jul 20, 2022
Python

DeutscheKI / tevr-asr-tool

State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.

Updated Aug 9, 2022
C

teodor-cotet / RoGEC

Neural Grammatical Error Correction for Romanian using Transformer

deep-learning tensorflow transformer grammatical gec kenlm

Updated Dec 8, 2022
Python

fquirin / kaldi-adapt-lm

Create and adapt n-gram and JSGF language models, e.g. for Kaldi-ASR nnet3 chain models from Zamia-Speech

speech-recognition language-model g2p kaldi-asr kenlm asr-model jsgf-grammars ngram-models zamia

Updated Jun 2, 2023
Python

Msparihar / Transcriber

Developed an AI tool to automatically generate captions and transcripts for YouTube videos in 67 languages and can generate summarized texts in 133 languages.

nlp deep-neural-networks audio-processing kenlm wav2vec2

Updated Nov 10, 2023
Python

Leen-Alzebdeh / NLP-LMs

We create n-gram language models that quantify the likelihood of various sound sequences occurring in the English language.

nlp n-grams nltk language-model kenlm

Updated Dec 27, 2023
Python

LuluW8071 / Automatic-Speech-Recognition-with-PyTorch

End-to-End Automatic Speech Recognition on PyTorch with CTC Decoder and Ken LM

python deep-neural-networks pytorch cuda-support kenlm asr-model cnn-lstm-models pytorch-lightning ctc-decode

Updated Mar 27, 2024
Python

Sarasadeghii / Sharif-Wav2vec2

This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.

nlp speech-recognition speech-to-text language-model wer kenlm farsi-datasets wav2vec2 xlsr

Updated May 12, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the kenlm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the kenlm topic, visit your repo's landing page and select "manage topics."