MyGit

k2-fsa/sherpa-onnx

Fork: 214 Star: 1381 (更新于 2024-06-10 19:29:06)

license: Apache-2.0

Language: C++ .

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift

最后发布版本: v1.9.27 ( 2024-06-05 00:28:15)

官方网址 GitHub网址

Introduction

This repository supports running the following functions locally

  • Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
  • Text-to-speech (i.e., TTS)
  • Speaker identification
  • Speaker verification
  • Spoken language identification
  • Audio tagging
  • VAD (e.g., silero-vad)
  • Keyword spotting

on the following platforms and operating systems:

with the following APIs

  • C++, C, Python, Go, C#
  • Java, Kotlin, JavaScript
  • Swift

Links for pre-built Android APKs

Description URL 中国用户
Streaming speech recognition Address 点此
Text-to-speech Address 点此
Voice activity detection (VAD) Address 点此
VAD + non-streaming speech recognition Address 点此
Two-pass speech recognition Address 点此
Audio tagging Address 点此
Audio tagging (WearOS) Address 点此
Speaker identification Address 点此
Spoken language identification Address 点此
Keyword spotting Address 点此

Links for pre-trained models

Description URL
Speech recognition (speech to text, ASR) Address
Text-to-speech (TTS) Address
VAD Address
Keyword spotting Address
Audio tagging Address
Speaker identification (Speaker ID) Address
Spoken language identification (Language ID) See multi-lingual Whisper ASR models from Speech recognition
Punctuation Address

Useful links

How to reach us

Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.

最近版本更新:(数据更新于 2024-06-10 19:27:35)

2024-06-05 00:28:15 v1.9.27

2024-05-31 13:18:29 v1.9.26

2024-05-17 10:54:32 v1.9.25

2024-05-11 14:33:06 v1.9.24

2024-04-25 12:29:52 v1.9.23

2024-04-19 18:40:40 v1.9.22

2024-04-13 19:10:00 v1.9.19

2024-04-13 16:35:21 v1.9.18

2024-04-12 18:46:48 punctuation-models

2024-04-09 16:04:32 audio-tagging-models

主题(topics):

aarch64, android, arm32, asr, cpp, csharp, dotnet, ios, linux, macos, mfc, onnx, openkylin, raspberry-pi, risc-v, speech-to-text, text-to-speech, vits, windows

k2-fsa/sherpa-onnx同语言 C++最近更新仓库

2024-07-02 20:10:46 facebook/react-native

2024-06-30 03:57:54 WerWolv/ImHex

2024-06-29 22:15:14 YimMenu/YimMenu

2024-06-29 09:39:51 LizardByte/Sunshine

2024-06-23 20:54:06 CleverRaven/Cataclysm-DDA

2024-06-22 10:12:29 ExpressLRS/ExpressLRS