v1.1.0
版本发布时间: 2023-01-15 20:00:56
ggerganov/whisper.cpp最新发布版本:v1.6.2(2024-05-27 15:36:55)
Overview
The major change in this pre-release is the improved decoding implementation in whisper.cpp
:
- Support for average logprob and entropy based criteria for fallback
- Support for temperature
T > 0
- Improved Greedy decoder via
best_of
parameter forT > 0
- Add beam search decoding (a.k.a
beam_size
)
More information about the decoding changes can be found in #291 Additionally, there are a few performance improvements for Apple Silicon, WASM and non-F16C platforms. Support for POWER9 architectures has been added.
The reason that this is a pre-release and not an official release is that the new implementation has not been sufficiently tested yet and the existing bindings for other languages have not been updated to support the API changes. The official release 1.1.x
will be created when there is enough feedback about the new decoding implementation and when the bindings have been updated. So make sure to send your feedback in the discussion created for this pre-release. For now, the 1.0.4
release should be considered more stable.
What's Changed
Core ggml
/ whisper
-
ggml
: POWER9 support by @fitzsim in #320, #349, #369 -
ggml
: simplify the SIMD code by @ggerganov in #324 -
ggml
: add SSE3 and fp16 conversion lookup table by @abitofevrything in #368 -
ggml
: utilise Accelerate's vDSP for some computations d51fc3ee0a0038cdf1522ca3d58b58299de41eb8 -
ggml
: speed-up softmax compute via Accelerate and loop unrolling d61d55cd4b9fe77511c8eea28d0220ce552f7008 -
ggml
: do not start extra threads when using BLAS d347a59a5f224f6a5ab0084ec95715451972d3b0 -
whisper
: do sample_to_timestamp calculation with 64 bit precision to avoid overflow by @boolemancer in #388 -
whisper
: various code clean-up and improvements by @asmaloney in #317 #318 #319 #322 etc -
whisper
: improve decoding by @ggerganov in #291 -
whisper
: account for speed_up flag for short audio #405
C-style API
- Add loader class to allow loading from buffer and others by @prsyahmi in https://github.com/ggerganov/whisper.cpp/pull/353
- Add
whisper_token_data::plog
- Add
whisper_init_from_file()
- Add
whisper_init_from_buffer()
- Change
whisper_init()
- Remove
whisper_sample_best()
- Remove
whisper_sample_timestamp()
- Add
whisper_n_audio_ctx()
- Add
whisper_get_logits()
- Remove
whisper_get_probs()
- Change
struct whisper_full_params
Bindings
- Golang bindings by @djthorpe in #287, #379, #384
Examples
-
whisper.android
: remove android ABI constraint by @Digipom in #301 -
whisper.swiftui
: SwiftUI example by @Digipom in #308 -
main
: add-ocsv
, aka--output-csv
for writing CSV file containing millisecond timestamps by @NielsMayer in #340 -
command
: refactor to split command list & general transcription modes by @asmaloney in #331 -
command
: always-prompt mode by @dnhkng in #383 -
stream
: fix data race on bool + avoid division-by-zero a466c3404dc62dc221061bb37fb8f78741d749b8 -
stream
: fix a bug that inserted a lot of empty audio at the start a6dbd9188b13378dc36e2c669b9a22e17b4201d1 -
bench.wasm
: print system info fafd78945d5a7ea11ffa31fa6c05dd6593b7d031
New Contributors
- @djthorpe made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/287
- @0xmohit made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/296
- @asmaloney made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/298
- @fitzsim made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/320
- @NielsMayer made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/340
- @aviks made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/345
- @eltociear made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/346
- @abitofevrything made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/368
- @Mike-Bell made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/381
- @dnhkng made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/383
- @prsyahmi made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/353
- @ianb made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/391
Full Changelog: https://github.com/ggerganov/whisper.cpp/compare/v1.0.4...v1.1.0
Highlights
- Sample SwiftUI application example/whisper.swiftui
1、 whisper-bin-Win32.zip 1.04MB
2、 whisper-bin-x64.zip 1.19MB
3、 whisper-blas-bin-Win32.zip 7.47MB
4、 whisper-blas-bin-x64.zip 12.66MB