Gemini 1.5 Pro is here! Get ready to experience the next level of audio recognition


Readers help support Windows Report. We may get a commission if you buy through our links.

Tooltip Icon

Read our disclosure page to find out how can you help Windows Report sustain the editorial team Read more

All major tech giants are working on AI models, and speaking of which, it seems that Google has released a new version of Gemini.

The Gemini 1.5 Pro has been released, and it offers some interesting features, so let’s dive into it and see what’s new.

Gemini 1.5 Pro is here, comes with an audio recognition feature

Google has recently updated its AI model, and Gemini 1.5 Pro is available in more than 180 countries through Google AI Studio’s public preview, as MSPowerUser writes.

Gemini now has a 1 million context window that allows developers to better analyze and understand information.

That’s not all, this version also has an audio recognition feature, so it can process spoken language. File upload is also supported, so you can upload an audio file and Gemini will analyze it.

Here’s what developers had to say about this feature:

You can upload a recording of a lecture, like 117,000+ token lecture from Jeff Dean, and Gemini 1.5 Pro can turn it into a quiz with an answer key.

The update also brings greater control and functionality to the developers, and there’s also support for system instructions so you can easily specify roles, formats, and goals.

Lastly, there’s JSON mode available that allows structured data extraction from both images and text. cURL is currently supported, and Python SDK support is coming soon, according to developers.

That’s not all from Google, there are also reports that Gemini will bring replay suggestions to Gmail for Android soon, so stay tuned.



More Reading

Post navigation

Leave a Comment

Leave a Reply

Your email address will not be published. Required fields are marked *