v1.3.0
版本发布时间: 2023-04-15 22:41:02
ggerganov/whisper.cpp最新发布版本:v1.6.2(2024-05-27 15:36:55)
Overview
This release should be considered in Beta stage, since I haven't done a lot of testing and I am not sure if I didn't break something. But overall, I believe both the performance and the quality are improved.
- Added Core ML support #566
- Restored decoding fallbacks with default size of 2 instead of 5 (f19e23fbd108ec3ac458c7a19b31c930719e7a94)
- Pad the audio with zeros instead of the spectrogram (5108b30e6daf361c856abb6b86e5038500bdbeb1)
- Added talk-llama example
- Added
whisper_state
which allows parallel transcriptions with a single model in memory (#523)
The C-style API has been extended significantly to support the new whisper_state
, but in general should be backwards compatible.
The only breaking change is in the callbacks signatures.
Please provide feedback in the discussion if you observe any issues.
The next release v1.4.0
will follow up relatively soon and will provide 4-bit integer quantization support.
What's Changed
- update csv output format to match OpenAI's Whisper dataframe output by @hykelvinlee42 in https://github.com/ggerganov/whisper.cpp/pull/552
- Go binding: NewContext now returns a clean context by @polarmoon in https://github.com/ggerganov/whisper.cpp/pull/537
- Added whisper state + default state on the whisper_context by @sandrohanea in https://github.com/ggerganov/whisper.cpp/pull/523
- whisper.android: Enable fp16 instrinsics (FP16_VA) which is supported by ARMv8.2 or later. by @tinoue in https://github.com/ggerganov/whisper.cpp/pull/572
- Add quality comparison helper by @venkr in https://github.com/ggerganov/whisper.cpp/pull/569
- whisper.android: Support benchmark for Android example. by @tinoue in https://github.com/ggerganov/whisper.cpp/pull/542
- Fix MUSL Linux build by @ggerganov in https://github.com/ggerganov/whisper.cpp/pull/576
- Change default encoding to UTF-8 by @Kamilake in https://github.com/ggerganov/whisper.cpp/pull/605
- Provide option for creating JSON output by @tuxpoldo in https://github.com/ggerganov/whisper.cpp/pull/615
- readme : add react-native bindings by @jhen0409 in https://github.com/ggerganov/whisper.cpp/pull/619
- Fixed language auto-detection for state provided processing. by @sandrohanea in https://github.com/ggerganov/whisper.cpp/pull/627
- xcodeproj : add
-O3 -DNDEBUG
in release mode by @jhen0409 in https://github.com/ggerganov/whisper.cpp/pull/640 - Nodejs Addon blocking main thread. Implemented Napi::AsyncWorker by @LucasZNK in https://github.com/ggerganov/whisper.cpp/pull/642
- Include link to R wrapper in README by @jwijffels in https://github.com/ggerganov/whisper.cpp/pull/626
- Add a cmake flag to disable F16C by @a5huynh in https://github.com/ggerganov/whisper.cpp/pull/628
- Add talk-llama example by @ggerganov in https://github.com/ggerganov/whisper.cpp/pull/664
- Add Alpaca support to talk-llama example by @ejones in https://github.com/ggerganov/whisper.cpp/pull/668
- Update README.md by @razodactyl in https://github.com/ggerganov/whisper.cpp/pull/682
- issue #470 - working 32-bit ARM by @clach04 in https://github.com/ggerganov/whisper.cpp/pull/486
- whisper : add initial_prompt param by @jhen0409 in https://github.com/ggerganov/whisper.cpp/pull/645
- fix typo in JSON output by @egorFiNE in https://github.com/ggerganov/whisper.cpp/pull/648
- Fix shell script ./models/download-ggml-model.sh to handle spaces and special characters in paths by @be-next in https://github.com/ggerganov/whisper.cpp/pull/677
- Fixed test to new async implementation by @LucasZNK in https://github.com/ggerganov/whisper.cpp/pull/686
- Minor: fixing usage message for talk-llama by @InconsolableCellist in https://github.com/ggerganov/whisper.cpp/pull/687
- Small typo by @ZiggerZZ in https://github.com/ggerganov/whisper.cpp/pull/688
- feat: add progress callback by @pajowu in https://github.com/ggerganov/whisper.cpp/pull/600
- ggml : fix q4_1 dot product types by @novag in https://github.com/ggerganov/whisper.cpp/pull/759
- Exposed various parts to the Go Interface by @bmurray in https://github.com/ggerganov/whisper.cpp/pull/697
- Adds shell command example for --print-colors by @bocytko in https://github.com/ggerganov/whisper.cpp/pull/710
- Makefile: disable avx in case f16c is not available by @duthils in https://github.com/ggerganov/whisper.cpp/pull/706
- Making the quick start instructions clearer. by @Onlyartist9 in https://github.com/ggerganov/whisper.cpp/pull/716
- Add lrc output support by @WhichWho in https://github.com/ggerganov/whisper.cpp/pull/718
- Corrects default speak.sh path in talk-llama by @mab122 in https://github.com/ggerganov/whisper.cpp/pull/720
- Add msvc compiler args /utf-8 fix error C3688 by @WhichWho in https://github.com/ggerganov/whisper.cpp/pull/721
- Changed convert-pt-to-ggml.py to use .tiktoken tokenizer files by @ivan-gorin in https://github.com/ggerganov/whisper.cpp/pull/725
- talk/talk-llama: add basic example script for eleven-labs tts by @DGdev91 in https://github.com/ggerganov/whisper.cpp/pull/728
- readme : add Unity3d bindings by @Macoron in https://github.com/ggerganov/whisper.cpp/pull/733
- Update stream.cpp by @AliAlameh in https://github.com/ggerganov/whisper.cpp/pull/501
- Fix typos in whisper.h by @GitAritron in https://github.com/ggerganov/whisper.cpp/pull/737
- Update LICENSE by @masguit42 in https://github.com/ggerganov/whisper.cpp/pull/739
- fix potential memory leaks by @baderouaich in https://github.com/ggerganov/whisper.cpp/pull/740
- readme: Add alternate swift bindings by @exPHAT in https://github.com/ggerganov/whisper.cpp/pull/755
- Fix the bug related to word splitting errors in the "tokenize" function. by @AfryMask in https://github.com/ggerganov/whisper.cpp/pull/760
- Do not launch threads for
log_mel_spectrogram
when singlethreaded by @maxilevi in https://github.com/ggerganov/whisper.cpp/pull/763 - Core ML support by @ggerganov in https://github.com/ggerganov/whisper.cpp/pull/566
- ggml : fix build on whisper.android (ARM_NEON) by @jhen0409 in https://github.com/ggerganov/whisper.cpp/pull/764
New Contributors
- @hykelvinlee42 made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/552
- @tinoue made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/572
- @venkr made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/569
- @Kamilake made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/605
- @tuxpoldo made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/615
- @jhen0409 made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/619
- @LucasZNK made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/642
- @jwijffels made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/626
- @a5huynh made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/628
- @ejones made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/668
- @razodactyl made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/682
- @clach04 made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/486
- @egorFiNE made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/648
- @be-next made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/677
- @InconsolableCellist made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/687
- @ZiggerZZ made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/688
- @pajowu made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/600
- @novag made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/759
- @bmurray made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/697
- @bocytko made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/710
- @duthils made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/706
- @Onlyartist9 made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/716
- @WhichWho made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/718
- @mab122 made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/720
- @ivan-gorin made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/725
- @DGdev91 made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/728
- @Macoron made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/733
- @AliAlameh made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/501
- @GitAritron made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/737
- @masguit42 made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/739
- @baderouaich made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/740
- @exPHAT made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/755
- @AfryMask made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/760
- @maxilevi made their first contribution in https://github.com/ggerganov/whisper.cpp/pull/763
Full Changelog: https://github.com/ggerganov/whisper.cpp/compare/v1.2.1...v1.3.0
1、 whisper-bin-Win32.zip 1.25MB
2、 whisper-bin-x64.zip 1.42MB
3、 whisper-blas-bin-Win32.zip 7.65MB
4、 whisper-blas-bin-x64.zip 12.84MB