Google’s newest AI can watch and analyse an hour of footage for you

SAN JOSE: An hour of video footage, 700,000 words of text or 11 hours of audio material: Google’s latest AI update allows users to get a fast analysis of massive amounts of material, the tech giant has announced.

Google announced in a blog post that the latest update to its Gemini AI, the company’s answer to ChatGPT, was tested out with a search for “comedic moments” in a 400-page transcript of conversations from the Apollo 11 space mission to the moon.

In half a minute, Gemini version 1.5 delivered three instances of humour and could even give context on why a certain phrase was funny.

Proving its ability to understand things in their context, the software responded to an uploaded drawing of a boot by linking this to the moment when Neil Armstrong took the first step on the moon.

In addition to processing text, code and audio, the development could make it possible to look for certain visual elements in large amounts of footage without a person having to watch it.

“When given a 44-minute silent Buster Keaton movie, the model can accurately analyse various plot points and events, and even reason about small details in the movie that could easily be missed,” Google’s AI head Demis Hassabis wrote on Thursday.

Google says Gemini 1.5 Pro can capture and analyse up to one hour of video, up to 11 hours of audio recordings, texts of up to 700,000 words and up to 30,000 lines of software code.

The internet giant is competing with ChatGPT inventor OpenAI, which triggered global hype surrounding AI just over a year ago.

Earlier in February Google rebranded its AI apps and services under the name Gemini. The Gemini 1.5 model will initially be available to developers and corporate customers before it is rolled out to all users. – dpa

Tagged