Skip to content

Latest commit

 

History

History
125 lines (90 loc) · 6.17 KB

README.md

File metadata and controls

125 lines (90 loc) · 6.17 KB

CloudVocal - Professional Cloud AI Transcription & Translation Plugin

GitHub GitHub Workflow Status Total downloads GitHub release (latest by date) GitHub stars Discord
Download:

Introduction

CloudVocal brings professional-grade cloud transcription and translation to your OBS streams and recordings. Powered by industry-leading cloud providers, it delivers exceptional accuracy and real-time performance for your live streaming needs. ✅ Professional-grade accuracy, ✅ support for 100+ languages, ✅ enterprise-level reliability, and ✅ blazing-fast performance!

CloudVocal integrates seamlessly with leading cloud providers to deliver enterprise-grade speech recognition and translation services. Simply configure your API credentials and start streaming with professional-quality captions and translations.

Features

Current Features:

  • Professional-grade transcription with industry-leading accuracy
  • Providers: Google Cloud, Naver Clova, Rev AI, Deepgram, AWS Transcribe (upcoming)
  • Real-time translation using enterprise cloud translation services
  • Translation providers: Google Cloud, Naver Papago, DeepL, AWS Translate, Anthropic Claude, OpenAI
  • Streaming-optimized performance with minimal latency
  • Caption output in multiple formats (.txt, .srt)
  • Sync'ed captions with OBS recording timestamps
  • Direct streaming to platforms (YouTube, Twitch) with embedded captions
  • Partial transcriptions for a streaming-captions experience

Roadmap:

  • Additional cloud providers and services (e.g. Microsoft Azure)
  • Custom vocabulary and pronunciation support
  • Professional terminology handling for specific industries
  • Advanced text filtering and customization options
  • Speaker diarization for multi-speaker environments
  • Advanced profanity filtering options
  • Custom translation glossaries
  • Additional subtitle format support
  • Enhanced analytics and caption quality metrics

Usage

Tutorial videos and screenshots - coming soon!

Download and Installation

Check out the latest releases for downloads and install instructions.

Configuration

  1. Download and install the appropriate version for your operating system
  2. Add CloudVocal as a filter to your audio source
  3. Configure your cloud provider credentials in the plugin settings
  4. Select your desired transcription and translation options
  5. Select an output text source for the captions and translations, send the captions to the stream or a file

Building

The plugin can be built on Windows, macOS, and Linux platforms. The build process is straightforward as all processing happens in the cloud.

Both Mac OSX and Linux rely on Conan for dependencies. Install Conan, e.g. pip install conan, and install the dependencies:

$ conan profile detect --force
$ conan install . --output-folder=./build_conan --build=missing -g CMakeDeps

Mac OSX

Build the plugin:

$ ./.github/scripts/build-macos --config Release

You may want to change to RelWithDebInfo for a debug build.

If you're developing the plugin, I find this command to be useful for direct deploymet into OBS after building:

$ ./.github/scripts/build-macos --skip-deps && cp -R release/RelWithDebInfo/*.plugin ~/Library/Application\ Support/obs-studio/plugins/

Linux

Build the plugin:

$ ./.github/scripts/build-linux

Windows

Windows also needs Conan for OpenSSL. Run conan to get the dependency (make sure to run conan on the conanfile_win.txt):

> pip install conan
> conan profile detect --force
> conan install .\conanfile_win.txt --output-folder=./build_conan --build=missing -g CMakeDeps

Build the plugin:

> .\.github\scripts\Build-Windows.ps1 -Configuration Release

If you're developing the plugin, I find this command to be useful for direct deploymet into OBS after building:

> pwsh -ExecutionPolicy Bypass -File .\.github\scripts\Build-Windows.ps1 -Configuration RelWithDebInfo -SkipDeps && Copy-Item -Force -Recurse .\release\RelWithDebInfo\* "C:\Program Files\obs-studio\"

Contributing

We welcome contributions from the community! Please fork the repository and submit a pull request with your changes. We will review and merge your changes as soon as possible.

License

This project is licensed under the GPLv2 License - see the LICENSE file for details.