Gemini incorporates an AI capable of analyzing any video, scene by scene, in real time

Deal Score0
Deal Score0

Google continues the extension of the capacities of His assistant Gemini. After the texts, images and documents, it is now possible to import a video into the interface and ask questions about it. This functionality, officially deployed worldwide, is already operational in France.

Advertising, your content continues below

Google Gemini can now analyze your videos

The principle is simple: once your video has been imported (up to 5 minutes in the free version), Gemini milks, extract the relevant elements, and then allows direct interaction. AI includes visual and sound content, identifies objects, people, scenes, and can respond to specific requests of the type: “What does this person say at 1 minute 40?”or “When does the stop sign appear?”

This new capacity is based on Gemini 1.5 flash and 1.5 pro modelsboth being compatible with video files. In free version, use is limited to a single short video, but the demonstration is convincing.

Users of Gemini Advanced (paid formula at € 19.99/month in France via the Google One AI Premium subscription) benefit from an extensive capacity: up to ten files, for a longer cumulative duration and increased analysis.

Currently, Gemini is the only general public assistant capable of directly analyzing an imported video, including both the image, the sound, and by answering specific questions about its content. OPENAI offers Whisper (Audio) and Clip (images), but no integrated tool that accepts complete video files in Chatgpt. This does not require any additional software, everything is done from the web interface or the mobile application.

How to analyze a video with Gemini:

  1. Open the Gemini app. Available on Android, iOS or via gemini.google.com (deployment web version).
  2. Start a new conversation.
  3. Import your video: click on the trombone or button Add a file. Select a short video (max. 5 minutes in free version, formats: MP4, MOV, Webm, etc.).
  4. Ask your questions.
  5. Gemini can indicate timestamps (temporal landmarks), precise descriptions, even key scenes.
Google Gemini

Google Gemini

Long -awaited, Gemini (Google Bard), the conversational agent fueled by the AI ​​developed by Google, is deployed in the form of a free online service and a mobile application for Android and iPhone devices

  • License:
    Free license
  • Author :
    Google
  • Operating systems:
    Online service, Android, iOS iPhone / iPad
  • Category :
    IA
Advertising, your content continues below

More Info

We will be happy to hear your thoughts

Leave a reply

Bonplans French
Logo