Whisper——部署fast-whisper中文语音识别模型

扎眼的阳光 • 2024年2月19日下午7:29 • IT • 阅读 63

whisper：https://github.com/openai/whisper/tree/main
参考文章：Whisper OpenAI开源语音识别模型

目录

环境配置

pip install faster-whisper transformers

准备tiny模型

需要其他版本的可以自己下载：https://huggingface.co/openai

原始中文语音模型：

https://huggingface.co/openai/whisper-tiny

微调后的中文语音模型：

git clone https://huggingface.co/xmzhu/whisper-tiny-zh

补下一个：tokenizer.json

https://huggingface.co/openai/whisper-tiny/resolve/main/tokenizer.json?download=true

模型转换

float16：

ct2-transformers-converter --model whisper-tiny-zh/ --output_dir whisper-tiny-zh-ct2 --copy_files tokenizer.json preprocessor_config.json --quantization float16

int8：

ct2-transformers-converter --model whisper-tiny-zh/ --output_dir whisper-tiny-zh-ct2-int8 --copy_files tokenizer.json preprocessor_config.json --quantization int8

代码

from faster_whisper import WhisperModel

# model_size = "whisper-tiny-zh-ct2"
# model_size = "whisper-tiny-zh-ct2-int8"

# Run on GPU with FP16
# model = WhisperModel(model_size, device="cuda", compute_type="float16")
model = WhisperModel(model_size, device="cpu", compute_type="int8")

# or run on GPU with INT8
# model = WhisperModel(model_size, device="cuda", compute_type="int8_float16")
# or run on CPU with INT8
# model = WhisperModel(model_size, device="cpu", compute_type="int8")

segments, info = model.transcribe("output_file.wav", beam_size=5, language='zh')

print("Detected language '%s' with probability %f" % (info.language, info.language_probability))

for segment in segments:
    print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))

版权声明：本文为博主作者：Irving.Gao原创文章，版权归属原作者，如果侵权，请联系我们删除！

原文链接：https://blog.csdn.net/qq_45779334/article/details/135564786

windows 语音识别

赞 (0)

扎眼的阳光普通用户

0

2023考PMP考试难度大吗？全盘解析

上一篇 2024年2月19日

【看表情包学Linux】系统下的文件操作 | 文件系统接口 | 系统调用与封装 | open,write,close 接口 | 系统传递标记位 O_RDWR,O_RDONLY,O_WRONLY…

下一篇 2024年2月19日