k2-fsa/sherpa-onnx
Fork: 214 Star: 1381 (更新于 2024-06-10 19:29:06)
license: Apache-2.0
Language: C++ .
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
最后发布版本: v1.9.27 ( 2024-06-05 00:28:15)
Introduction
This repository supports running the following functions locally
- Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
- Text-to-speech (i.e., TTS)
- Speaker identification
- Speaker verification
- Spoken language identification
- Audio tagging
- VAD (e.g., silero-vad)
- Keyword spotting
on the following platforms and operating systems:
- x86,
x86_64
, 32-bit ARM, 64-bit ARM (arm64, aarch64), RISC-V (riscv64) - Linux, macOS, Windows, openKylin
- Android, WearOS
- iOS
- NodeJS
- WebAssembly
- Raspberry Pi
- RV1126
- LicheePi4A
- VisionFive 2
- 旭日X3派
- etc
with the following APIs
- C++, C, Python, Go,
C#
- Java, Kotlin, JavaScript
- Swift
Links for pre-built Android APKs
Description | URL | 中国用户 |
---|---|---|
Streaming speech recognition | Address | 点此 |
Text-to-speech | Address | 点此 |
Voice activity detection (VAD) | Address | 点此 |
VAD + non-streaming speech recognition | Address | 点此 |
Two-pass speech recognition | Address | 点此 |
Audio tagging | Address | 点此 |
Audio tagging (WearOS) | Address | 点此 |
Speaker identification | Address | 点此 |
Spoken language identification | Address | 点此 |
Keyword spotting | Address | 点此 |
Links for pre-trained models
Description | URL |
---|---|
Speech recognition (speech to text, ASR) | Address |
Text-to-speech (TTS) | Address |
VAD | Address |
Keyword spotting | Address |
Audio tagging | Address |
Speaker identification (Speaker ID) | Address |
Spoken language identification (Language ID) | See multi-lingual Whisper ASR models from Speech recognition |
Punctuation | Address |
Useful links
- Documentation: https://k2-fsa.github.io/sherpa/onnx/
- Bilibili 演示视频: https://search.bilibili.com/all?keyword=%E6%96%B0%E4%B8%80%E4%BB%A3Kaldi
How to reach us
Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.
最近版本更新:(数据更新于 2024-06-10 19:27:35)
2024-06-05 00:28:15 v1.9.27
2024-05-31 13:18:29 v1.9.26
2024-05-17 10:54:32 v1.9.25
2024-05-11 14:33:06 v1.9.24
2024-04-25 12:29:52 v1.9.23
2024-04-19 18:40:40 v1.9.22
2024-04-13 19:10:00 v1.9.19
2024-04-13 16:35:21 v1.9.18
2024-04-12 18:46:48 punctuation-models
2024-04-09 16:04:32 audio-tagging-models
主题(topics):
aarch64, android, arm32, asr, cpp, csharp, dotnet, ios, linux, macos, mfc, onnx, openkylin, raspberry-pi, risc-v, speech-to-text, text-to-speech, vits, windows
k2-fsa/sherpa-onnx同语言 C++最近更新仓库
2024-07-02 20:10:46 facebook/react-native
2024-06-30 03:57:54 WerWolv/ImHex
2024-06-29 22:15:14 YimMenu/YimMenu
2024-06-29 09:39:51 LizardByte/Sunshine
2024-06-23 20:54:06 CleverRaven/Cataclysm-DDA
2024-06-22 10:12:29 ExpressLRS/ExpressLRS