No description
Find a file
Niklas Mueller 7325f650b6 INIT
2024-07-23 18:23:54 +02:00
.gitignore INIT 2024-07-23 18:23:54 +02:00
Dockerfile INIT 2024-07-23 18:23:54 +02:00
init.sh INIT 2024-07-23 18:23:54 +02:00
README.md INIT 2024-07-23 18:23:54 +02:00
requirements.txt INIT 2024-07-23 18:23:54 +02:00
runner.py INIT 2024-07-23 18:23:54 +02:00

STT-Function

With the Speech-to-Text (STT) Function you can transcribe a file ("convert" an audio/video file into text). Internally Whisper from OpenAI (https://github.com/openai/whisper) is used to transcribe the audiofile.

Structure

  • The container has two folders attached, the input folder with files that should be transcribed and the output path, where the transcript should be saved to.

Setup

Make sure Podman or Docker is installed.

Download the Model into the Folder where you will build the container image. Download Link or run wget https://openaipublic.azureedge.net/main/whisper/models/e5b1a55b89c1367dacf97e3e19bfd829a01529dbfdeefa8caeb59b3f1b81dadb/large-v3.pt

podman build -t stt-function .
podman run -e LANGUAGE_CODE='de' -e WHISPER_MODEL='tiny' -v '/path/to/audio_video/file/':/app/input_files/ -v /output_path/of/transcript/:/app/transcripts/ --name stt-function_container --rm -t stt-function