Skip to content

Home

About Us

Advertisement

Contact Us

  • Facebook
  • X
  • Instagram
  • Pinterest
  • WhatsApp
  • RSS Feed
  • TikTok

TechVilla

Your Trusted Voice Across the World.

Search

Google Opens Access to Gemini 2.5 Native Audio Dialog and Controllable Speech Generation in Preview

Admin Avatar
Admin
June 5, 2025
Google Opens Access to Gemini 2.5 Native Audio Dialog and Controllable Speech Generation in Preview

Google introduced new audio generation capabilities with the Gemini 2.5 models at the Google I/O 2025. The Mountain View-based tech giant is now letting developers and individuals test these features on its platform. The two new capabilities include native audio dialog and controllable text-to-speech (TTS) with Gemini 2.5 Flash preview. While the former can natively generate human-like audio while responding to user prompts, the latter can convert any script into conversational speech. These features are currently not available to developers via application programming interfaces (APIs).

Google Showcases Gemini 2.5 Flash’s Audio Output Capabilities

In a blog post, the tech giant detailed the features of these two audio generation modes, highlighting how developers can use them to build new experiences for people. Currently, native audio dialog can be tried out in Google AI Studio’s stream tab, whereas the TTS feature can be tested in the generate media tab within AI Studio.

Native audio dialog with Gemini 2.5 Flash preview is designed for real-time conversations between a human user and the AI. The user can either type a prompt or speak it, and the AI responds verbally. This process directly generates audio, instead of first generating text and then converting it into speech.

There are several advantages to that as well. It supports affective dialog, which means when Gemini 2.5 Flash responds to the user’s tone of voice, it can recognise the emotion behind the said words. It can understand when the user sounds scared, angry, or surprised and respond accordingly.

Apart from this, the audio generation feature can express emotions when speaking, adopt different accents and linguistic styles, can access tools such as Google Search, and supports more than 24 languages.

Coming to the controllable TTS feature, it offers multi-speaker dialogue generation, can produce emotions and accents while narrating the script, control delivery speed and emphasise pronunciation, and supports the same 24 languages and language mixing.

Google says these capabilities were assessed for potential risks across the development process. The company used both internal mechanisms as well as red teaming to find and fix any vulnerabilities. The company also highlighted that all audio outputs from these models are embedded with SynthID, its watermarking technology.

Featured Articles

  • Samsung’s Upcoming Running Events Reportedly Hint at Galaxy Z Fold 7, Flip 7 and Watch 8 Series Launch Timeline

    Samsung’s Upcoming Running Events Reportedly Hint at Galaxy Z Fold 7, Flip 7 and Watch 8 Series Launch Timeline

    June 14, 2025
  • Poco F7 Design Spotted in Leaked Renders; Battery Specifications Revealed via Flipkart

    Poco F7 Design Spotted in Leaked Renders; Battery Specifications Revealed via Flipkart

    June 14, 2025
  • Neuralink Device Helps Monkey See Something That’s Not There

    Neuralink Device Helps Monkey See Something That’s Not There

    June 14, 2025
  • Power Meets Affordability: Flipkart’s Best Gaming Laptop Deals for June 2025

    Power Meets Affordability: Flipkart’s Best Gaming Laptop Deals for June 2025

    June 14, 2025
  • Samsung Galaxy M36, Galaxy F36 Spotted on Google Play Console; Galaxy M36 Launch Reportedly Teased via Amazon

    Samsung Galaxy M36, Galaxy F36 Spotted on Google Play Console; Galaxy M36 Launch Reportedly Teased via Amazon

    June 14, 2025

Search

Author Details

Jenifer Propets

Lorem ipsum dolor sit amet, adipiscing elit, sed do eiusmod tempor ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.

  • X
  • Instagram
  • TikTok
  • Facebook

Follow Us on

  • Facebook
  • X
  • Instagram
  • VK
  • Pinterest
  • Last.fm
  • TikTok
  • Telegram
  • WhatsApp
  • RSS Feed

Categories

  • Tech (2,001)

Archives

  • June 2025 (212)
  • May 2025 (471)
  • April 2025 (424)
  • March 2025 (442)
  • February 2025 (371)
  • January 2025 (81)

Tags

About Us

Jetnews Magazine

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.

Latest Articles

  • Samsung’s Upcoming Running Events Reportedly Hint at Galaxy Z Fold 7, Flip 7 and Watch 8 Series Launch Timeline

    Samsung’s Upcoming Running Events Reportedly Hint at Galaxy Z Fold 7, Flip 7 and Watch 8 Series Launch Timeline

    June 14, 2025
  • Poco F7 Design Spotted in Leaked Renders; Battery Specifications Revealed via Flipkart

    Poco F7 Design Spotted in Leaked Renders; Battery Specifications Revealed via Flipkart

    June 14, 2025
  • Neuralink Device Helps Monkey See Something That’s Not There

    Neuralink Device Helps Monkey See Something That’s Not There

    June 14, 2025

Categories

  • Tech (2,001)
  • Instagram
  • Facebook
  • LinkedIn
  • X
  • VK
  • TikTok

Proudly Powered by WordPress | JetNews Magazine by CozyThemes.

Scroll to Top