五千年(敝帚自珍)

近期网站停站换新具体说明
按以上说明时间，延期一周至网站时间26-27左右。具体实施前两天会在此提前通知具体实施时间

主题：OpenAI Whisper 转换声音到文字 -- nanimarcusboy

共:💬12 🌺51

科技天地

全看分页树展 · 主题跟帖

复 OpenAI Whisper 转换声音到文字

利用 Whisper 服务识别，速度是本机的 20 倍

参考

https://platform.openai.com/docs/guides/speech-to-text

----

import openai

audio_file= open("/path/to/file/audio.mp3", "rb")

transcript = openai.Audio.transcribe("whisper-1", audio_file)

----

后一句修改一下，这样拿到的是 srt 文件

transcript = openai.Audio.transcribe("whisper-1", audio_file, ,

response_format="srt")

----

文件大小限制是 25M，所以一般先转换成mp3：

ffmpeg -hide_banner -y -i input.mp4 -acodec libmp3lame -b:a 64k output.mp3

25M mp3 差不多 50 分钟。

再大的话可切成小段，处理后再合并。

----

典型 mp3 文件长度 600秒，识别用了25秒：

-- length : 600.792 seconds

-- runtime: 25.17 seconds

----

好吧，现在看看给了我多少quota.

----

全看分页树展 · 主题跟帖

相关回复上下关系8
- 🙂OpenAI Whisper 转换声音到文字 34 nanimarcusboy 字2899 2023-02-12 02:35:21
  - 🙂利用 Whisper 服务识别，速度是本机的 20 倍
  - 🙂没看懂, 既然用 API 对文件大小有限制, 那就买块显卡本 4 nanimarcusboy 字768 2023-03-12 10:49:42
  - 🙂Whisper 隐含是英文，模型用 small.en，语言不 2 nanimarcusboy 字574 2023-02-26 06:28:47
    🙂请教下，可以识别中文语音然后翻译成英文吗 hamigua 字9 2023-06-03 00:39:47
    🙂试了下是可以的，多谢楼主了 hamigua 字283 2023-06-03 07:45:19
  - 🙂OpenAI Whisper 使用的感受和想法 4 nanimarcusboy 字864 2023-02-22 06:01:33
    🙂你显卡是哪一代的 1 川普字78 2023-04-14 23:23:40

有趣有益，互惠互利；开阔视野，博采众长。
虚拟的网络，真实的人。天南地北客，相逢皆朋友

Copyright © cchere 西西河

版面群落趣味社区帮助常见问题网站简介基本河规隐私条款使用条款广告说明