Skip to main content
Developer’s Playbook: Integrating Lyria 3 Music AI into Your Apps
Gemini

A Developer’s Guide to Lyria 3 & the New Music API

Google has posted another blog explaining how developers can now integrate these world-class music models into their own applications via Google AI Studio and the Gemini API.

For developers, Lyria 3 isn't just a song generator; it's a programmable audio engine with granular controls for tempo, structure, and multimodal inputs.

1. Choosing Your Instrument: Pro vs. Clip

Google is offering two distinct model variants to meet different production needs:

  • Lyria 3 Pro (lyria-3-pro-preview): The heavyweight. Designed for full-length, studio-quality songs (up to 3 minutes). It features high structural awareness, ideal for apps focused on complete musical works.
  • Lyria 3 Clip (lyria-3-clip-preview): The sprinter. Optimized for speed and high-volume requests. It generates 30-second clips, perfect for social media assets, background loops, or rapid prototyping.

2. Granular Controls for Precision Engineering

Lyria 3 introduces a level of control that goes far beyond a simple text prompt:

  • Tempo Conditioning: You can now set a specific tempo (e.g., Fast, Slow, or specific BPM) with high accuracy to ensure the music matches your app’s visual rhythm.
  • Time-Aligned Lyrics: Developers can control exactly when lyrics start and end within a track, allowing for perfectly synchronized karaoke or lyric-video apps.
  • Multimodal Image-to-Music: You can now pass an image into the API to influence the mood and style of the audio, letting the vibe of a photo dictate the genre and atmosphere of the song.

3. The New Music Playground in Google AI Studio

Google has launched a dedicated music generation workspace in AI Studio for experimentation:

  • Text Mode: A simple natural language interface for describing a track.
  • Composer Mode: A section-by-section builder. You can define the timing, intensity, and description for every individual part of the song (Intro, Verse, Bridge, Chorus).
  • Direct Integration: Using a paid API key, you can immediately export your experiments into your codebase.

4. Real-World API Demos

To jumpstart development, Google shared a few Cookbook examples of what's possible:

  • AI Alarm Clock: An app that generates a fresh song every morning, with lyrics that actually mention your local weather and calendar events for the day.
  • Video-to-Soundtrack: A demo that uses Gemini 3 Flash to analyze a video and then calls Lyria to compose a synchronized background track that fits the scene's emotional arc.

5. Trust & Transparency: SynthID Built-In

Every track generated via the API includes SynthID digital watermarking. This imperceptible watermark remains even if the audio is compressed or modified, allowing developers to ensure transparency and build trust within their own user communities.

The API is the stage, and Lyria 3 is the orchestra. With granular tempo control and multimodal inputs now in public preview, the next hit song might just be written by your application.