Skip to content

Latest commit

 

History

History
138 lines (88 loc) · 3.35 KB

CHANGELOG.md

File metadata and controls

138 lines (88 loc) · 3.35 KB

1.10.23

  • flutter: add lang, emotion, event to OfflineRecognizerResult (#1268)
  • Use a separate thread to initialize models for lazarus examples. (#1270)
  • Object pascal examples for recording and playing audio with portaudio. (#1271)
  • Text to speech API for Object Pascal. (#1273)
  • update kotlin api for better release native object and add user-friendly apis. (#1275)
  • Update wave-reader.cc to support 8/16/32-bit waves (#1278)
  • Add WebAssembly for VAD (#1281)
  • WebAssembly example for VAD + Non-streaming ASR (#1284)

1.10.22

  • Add Pascal API for reading wave files (#1243)
  • Pascal API for streaming ASR (#1246)
  • Pascal API for non-streaming ASR (#1247)
  • Pascal API for VAD (#1249)
  • Add more C API examples (#1255)
  • Add emotion, event of SenseVoice. (#1257)
  • Support reading multi-channel wave files with 8/16/32-bit encoded samples (#1258)
  • Enable IPO only for Release build. (#1261)
  • Add Lazarus example for generating subtitles using Silero VAD with non-streaming ASR (#1251)
  • Fix looking up OOVs in lexicon.txt for MeloTTS models. (#1266)

1.10.21

  • Fix ffmpeg c api example (#1185)
  • Fix splitting sentences for MeloTTS (#1186)
  • Non-streaming WebSocket client for Java. (#1190)
  • Fix copying asset files for flutter examples. (#1191)
  • Add Chinese+English tts example for flutter (#1192)
  • Add speaker identification and verification exmaple for Dart API (#1194)
  • Fix reading non-standard wav files. (#1199)
  • Add ReazonSpeech Japanese pre-trained model (#1203)
  • Describe how to add new words for MeloTTS models (#1209)
  • Remove libonnxruntime_providers_cuda.so as a dependency. (#1210)
  • Fix setting SenseVoice language. (#1214)
  • Support passing TTS callback in Swift API (#1218)
  • Add MeloTTS example for ios (#1223)
  • Add online punctuation and casing prediction model for English language (#1224)
  • Fix python two pass ASR examples (#1230)
  • Add blank penalty for various language bindings

1.10.20

  • Add Dart API for audio tagging
  • Add Dart API for adding punctuations to text

1.10.19

  • Prefix all C API functions with SherpaOnnx

1.10.18

  • Fix the case when recognition results contain the symbol ". It caused issues when converting results to a json string.

1.10.17

  • Support SenseVoice CTC models.
  • Add Dart API for keyword spotter.

1.10.16

  • Support zh-en TTS model from MeloTTS.

1.10.15

  • Downgrade onnxruntime from v1.18.1 to v1.17.1

1.10.14

  • Support whisper large v3
  • Update onnxruntime from v1.18.0 to v1.18.1
  • Fix invalid utf8 sequence from Whisper for Dart API.

1.10.13

  • Update onnxruntime from 1.17.1 to 1.18.0
  • Add C# API for Keyword spotting

1.10.12

1.10.11

  • Support the iOS platform for Flutter.

1.10.10

  • Build sherpa-onnx into a single shared library.

1.10.9

  • Fix released packages. piper-phonemize was not included in v1.10.8.

1.10.8

  • Fix released packages. There should be a lib directory.

1.10.7

  • Support Android for Flutter.

1.10.2

  • Fix passing C# string to C++

1.10.1

  • Enable to stop TTS generation

1.10.0

  • Add inverse text normalization

1.9.30

  • Add TTS

1.9.29

  • Publish with CI

0.0.3

  • Fix path separator on Windows.

0.0.2

  • Support specifying lib path.

0.0.1

  • Initial release.