STT-Function

With the Speech-to-Text (STT) Function you can transcribe a file ("convert" an audio/video file into text). Internally Whisper from OpenAI (https://github.com/openai/whisper) is used to transcribe the audiofile.

Structure

The container has two folders attached, the input folder with files that should be transcribed and the output path, where the transcript should be saved to.

Setup

Make sure Podman or Docker is installed.

Download the Model into the Folder where you will build the container image. Download Link or run wget https://openaipublic.azureedge.net/main/whisper/models/e5b1a55b89c1367dacf97e3e19bfd829a01529dbfdeefa8caeb59b3f1b81dadb/large-v3.pt

podman build -t stt-function .
podman run -e LANGUAGE_CODE='de' -e WHISPER_MODEL='tiny' -v '/path/to/audio_video/file/':/app/input_files/ -v /output_path/of/transcript/:/app/transcripts/ --name stt-function_container --rm -t stt-function

1.1 KiB Raw Permalink Blame History

STT-Function

Structure

Setup

1.1 KiB

Raw Permalink Blame History