hyeonsangjeon / computing-Korean-STT-error-rates Star 47 Code Issues Pull requests STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지 aws amazon test speech-recognition korean speech-to-text evaluation-functions evaluation-metrics normalization cer transcribe evaluate speech-analysis wer word-error-rate text-evaluation text-digitisation character-error-rate computing-error-rates Updated Aug 25, 2023 Python
jo-valer / tesseract-ocr-enhanced Star 3 Code Issues Pull requests Preprocessing methods to enhance Tesseract-OCR in the case of printed text on difficult background, or handwritten text on lined/squared paper. ocr tesseract tesseract-ocr optical-character-recognition htr handwritten-text-recognition handwritten-character-recognition shadow-removal text-digitisation Updated Mar 30, 2024 Jupyter Notebook
polifonia-project / textual-corpus-population Star 2 Code Issues Pull requests Repository containing code for downloading and digitising textual documents used as a corpus for the Polifonia Project. music ocr tesseract text-digitisation Updated Oct 21, 2022 Jupyter Notebook