Transcribe

Type: gemini.audio.Transcribe

Namespace: gemini.audio

Description

Transcribe audio to text using Google’s Gemini models. google, transcription, speech-to-text, audio, whisper, ai

This node converts audio input into text using Google's multimodal Gemini models.
Supports various audio formats and provides accurate speech-to-text transcription.

Use cases:
- Convert recorded audio to text
- Transcribe podcasts and interviews
- Generate subtitles from audio tracks
- Create meeting notes from audio recordings
- Analyze speech content in audio files

Properties

Property	Type	Description	Default
audio	`audio`	The audio file to transcribe.	`{"type":"audio","uri":"","asset_id":null,"data"...`
model	`enum`	The Gemini model to use for transcription	`gemini-2.5-flash`
prompt	`str`	Instructions for the transcription. You can customize this to request specific formatting or focus.	`Transcribe the following audio accurately. Retu...`

Outputs

Output	Type	Description
output	`str`

Browse other nodes in the gemini.audio namespace.

Edit this page on GitHub

Transcribe

Description

Properties

Outputs

Related Nodes