Overview
Convert speech to text using Whisper model with word-level timestamps
Convert speech to text using the Whisper model with word-level timestamps.
-
Audio Input - Record your voice or upload an audio file
-
Automatic Speech Recognition - Processes the audio through Whisper model to generate transcription
-
String Output - Displays the transcribed text
How to Use:
-
Record your voice using the audio input
-
Hit the run button
-
Read the transcription in the output
Tags
start, audio, huggingface
Workflow Diagram
graph TD
audio_7["audio"]
transciption_8["transciption"]
automaticspeechrecognition_c51917["AutomaticSpeechRecognition"]
audio_7 --> automaticspeechrecognition_c51917
automaticspeechrecognition_c51917 --> transciption_8
How to Use
- Open NodeTool and create a new workflow
- Import this workflow from the examples gallery or build it manually following the diagram above
- Configure the input nodes with your data
- Run the workflow to see results
Related Workflows
Browse other workflow examples to discover more capabilities.