Type: gemini.audio.Transcribe

Namespace: gemini.audio

Description

Transcribe audio to text using Google’s Gemini models. google, transcription, speech-to-text, audio, whisper, ai

This node converts audio input into text using Google's multimodal Gemini models.
Supports various audio formats and provides accurate speech-to-text transcription.

Use cases:
- Convert recorded audio to text
- Transcribe podcasts and interviews
- Generate subtitles from audio tracks
- Create meeting notes from audio recordings
- Analyze speech content in audio files

Properties

Property Type Description Default
audio audio The audio file to transcribe. {"type":"audio","uri":"","asset_id":null,"data"...
model enum The Gemini model to use for transcription gemini-2.5-flash
prompt str Instructions for the transcription. You can customize this to request specific formatting or focus. Transcribe the following audio accurately. Retu...

Outputs

Output Type Description
output str  

Browse other nodes in the gemini.audio namespace.