v3.1.0
版本发布时间: 2022-11-29 19:24:40
OpenNMT/CTranslate2最新发布版本:v4.4.0(2024-09-09 17:21:54)
Changes
- The input prompt is no longer included in the result of
Whisper.generate
as it is usually not useful in a transcription loop - The default beam size in
Whisper.generate
is updated from 1 to 5 to match the default value in openai/whisper - Generation options
min_length
andno_repeat_ngram_size
now penalize the logits instead of the log probs which may change some scores - Raise a deprecation warning when reading the
TranslationResult
object as a list of dictionaries
New features
- Allow configuring the C++ logs from Python with the function
ctranslate2.set_log_level
- Implement the timestamp decoding rules when the Whisper prompt does not include the token
<|notimestamps|>
- Add option
return_no_speech_prob
to the methodWhisper.generate
for the result to include the probability of the no speech token
Fixes and improvements
- Improve performance of the Whisper model when generating with a context
- Fix timestamp tokens in the Whisper vocabulary to use the correct format (
<|X.XX|>
) - Fix AVX and NEON log functions to return -inf on log(0) instead of NaN
- When info logs are enabled, log the system configuration only when the first model is loaded and not immediately when the library is loaded
- Define a
LogitsProcessor
abstract class to apply arbitrary updates to the logits during decoding - Update oneDNN to 2.7.2