v0.5.0

版本发布时间: 2023-04-25 23:04:48

SYSTRAN/faster-whisper最新发布版本:v1.0.3(2024-07-01 18:05:08)

Improved logging

Some information are now logged under INFO and DEBUG levels. The logging level can be configured like this:

import logging

logging.basicConfig()
logging.getLogger("faster_whisper").setLevel(logging.DEBUG)

New arguments were added to the WhisperModel constructor to better control how the models are downloaded:

download_root to specify where the model should be downloaded.
local_files_only to avoid downloading the model and directly return the path to the cached model, it it exists.

Improve the default VAD behavior to prevent some words from being assigned to the incorrect speech chunk in the original audio
Fix incorrect application of option condition_on_previous_text=False (note that the bug still exists in openai/whisper v20230314)
Fix segment timestamps that are sometimes inconsistent with the words timestamps after VAD
Extend the Segment structure with additional properties to match openai/whisper
Rename AudioInfo to TranscriptionInfo and add a new property options to summarize the transcription options that were used