Sen descrición

Abandon-ht 5503f8d036 Update wave-reader.cc (#356) hai 11 meses
.github f92325f041 Add Silero VAD version 4 (#353) hai 11 meses
android ca9c19facd Add Android WearOS demo (#319) hai 1 ano
c-api-examples 401de81194 Add hotwords support to Swift/Go/Python/C#/Kotlin APIs (#260) hai 1 ano
cmake f92325f041 Add Silero VAD version 4 (#353) hai 11 meses
dotnet-examples 77c34f3cff Add microphone example for .Net (#248) %!s(int64=2) %!d(string=hai) anos
ffmpeg-examples 0ba47330ce FFmpeg: Feed PCM to K2 direclty, without any buffer. (#183) %!s(int64=2) %!d(string=hai) anos
go-api-examples 343eab32ab Update sherpa-ncnn-go version (#245) %!s(int64=2) %!d(string=hai) anos
ios-swift b4260f7acf Refactor code (#163) %!s(int64=2) %!d(string=hai) anos
ios-swiftui f12aac47c3 Add SwiftUI demo project (#175) %!s(int64=2) %!d(string=hai) anos
mfc-examples 98ba3218db Fix MFC examples (#268) hai 1 ano
nodejs-examples 41ebcfb11b Add WebAssembly + NodeJS example. (#311) hai 1 ano
python-api-examples 87899c26fe Add Python ASR example with alsa (#324) hai 1 ano
scripts 2e89994fc6 Fix building. (#325) hai 1 ano
sherpa-ncnn 5503f8d036 Update wave-reader.cc (#356) hai 11 meses
swift-api-examples f92325f041 Add Silero VAD version 4 (#353) hai 11 meses
toolchains f9a850c725 Support RISC-V (#213) %!s(int64=2) %!d(string=hai) anos
wasm 41ebcfb11b Add WebAssembly + NodeJS example. (#311) hai 1 ano
.clang-format 78ce6f6233 Add CI test and refactor the code a bit. (#4) %!s(int64=2) %!d(string=hai) anos
.gitignore 41ebcfb11b Add WebAssembly + NodeJS example. (#311) hai 1 ano
CMakeLists.txt 87899c26fe Add Python ASR example with alsa (#324) hai 1 ano
CPPLINT.cfg 78ce6f6233 Add CI test and refactor the code a bit. (#4) %!s(int64=2) %!d(string=hai) anos
LICENSE f91cd77f83 Use standard apache 2.0 license (#123) %!s(int64=2) %!d(string=hai) anos
MANIFEST.in 2b5300e975 Add Python API (#47) %!s(int64=2) %!d(string=hai) anos
README.md f92325f041 Add Silero VAD version 4 (#353) hai 11 meses
build-aarch64-linux-gnu.sh 661b4e715c fix the version of alsa-lib (#348) hai 1 ano
build-android-arm64-v8a-with-vulkan.sh 6b7072b3d9 Pack all sources into a single tar for embedded systems (#96) %!s(int64=2) %!d(string=hai) anos
build-android-arm64-v8a.sh 6b7072b3d9 Pack all sources into a single tar for embedded systems (#96) %!s(int64=2) %!d(string=hai) anos
build-android-armv7-eabi.sh 6b7072b3d9 Pack all sources into a single tar for embedded systems (#96) %!s(int64=2) %!d(string=hai) anos
build-android-x86-64.sh 6b7072b3d9 Pack all sources into a single tar for embedded systems (#96) %!s(int64=2) %!d(string=hai) anos
build-android-x86.sh 2878e796a5 build script for android-x86 (#228) %!s(int64=2) %!d(string=hai) anos
build-apk.sh fc4d3bc2d9 Rlease pre-built APK for English and French (#256) hai 1 ano
build-arm-linux-gnueabihf.sh 661b4e715c fix the version of alsa-lib (#348) hai 1 ano
build-ios.sh 7a5f80edc4 fix building for iOS (#92) %!s(int64=2) %!d(string=hai) anos
build-m3axpi.sh b701bc8daa Fix building on m3axpi (#99) %!s(int64=2) %!d(string=hai) anos
build-riscv64-linux-gnu.sh 661b4e715c fix the version of alsa-lib (#348) hai 1 ano
build-swift-macos.sh f92325f041 Add Silero VAD version 4 (#353) hai 11 meses
build-wasm-simd-for-nodejs.sh 41ebcfb11b Add WebAssembly + NodeJS example. (#311) hai 1 ano
build-wasm-simd.sh 72ea103e9b Add WebAssembly (#300) hai 1 ano
install-vulkan-macos.md d5dd41a7e2 Support building android app (#37) %!s(int64=2) %!d(string=hai) anos
pack-for-embedded-systems.sh 884ce6d9eb Use tencent/ncnn master. (#320) hai 1 ano
release.sh 797f1e4243 Release pre-compiled libs for Android and iOS (#181) %!s(int64=2) %!d(string=hai) anos
setup.py df9ae1ff84 Publish to PyPI (#48) %!s(int64=2) %!d(string=hai) anos

README.md

Supported functions

Real-time Speech recognition Voice activity detection
✔️ ✔️

Supported platforms

Architecture Android iOS Windows macOS linux
x64 ✔️ ✔️ ✔️ ✔️
x86 ✔️ ✔️
arm64 ✔️ ✔️ ✔️ ✔️ ✔️
arm32 ✔️ ✔️
riscv64 ✔️

Supported programming languages

1. C++ 2. C 3. Python 4. JavaScript
✔️ ✔️ ✔️ ✔️
5. Go 6. C# 7. Kotlin 8. Swift
✔️ ✔️ ✔️ ✔️

It also supports WebAssembly.

Introduction

This repository supports running the following functions locally

  • Streaming speech-to-text (i.e., real-time speech recognition)
  • VAD (e.g., silero-vad)

on the following platforms and operating systems:

with the following APIs

  • C++, C, Python, Go, C#
  • Kotlin
  • JavaScript
  • Swift

We support all platforms that ncnn supports.

Everything can be compiled from source with static link. The generated executable depends only on system libraries.

HINT: It does not depend on PyTorch or any other inference frameworks other than ncnn.

Please see the documentation https://k2-fsa.github.io/sherpa/ncnn/index.html for installation and usages, e.g.,

  • How to build an Android app
  • How to download and use pre-trained models

We provide a few YouTube videos for demonstration about real-time speech recognition with sherpa-ncnn using a microphone:

Links for pre-built Android APKs

Description URL
Streaming speech recognition Address

Links for pre-trained models

https://github.com/k2-fsa/sherpa-ncnn/releases/tag/models

Useful links

How to reach us

Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.

See also