Introducing Project Astra. We created a demo where a tester interacts with prototype AI agents supported by our multi-modal core model, Gemini. There are two continuous shots: one of the prototype running on a Google Pixel phone and another of prototype glasses. The agent accepts a constant stream of audio and video input. It can reason about its environment in real time and interact with the tester in a conversation about what it sees. Learn more about Project Astra: #GoogleIO2024 Watch the entire Google I/O 2024 keynote: To watch this keynote with American Sign Language (ASL) translation, please click here: #GoogleIO Subscribe to our channel: Find us on X: Watch us on TikTok: Follow us on Instagram:…

By admin

37 thoughts on “Project Astra: Our vision for the future of AI assistants”
  1. This is great, but why does Gemini not understand my prompts correctly, so I have to repeat them several times. Also why does Gemini not have access to read my texts in Google messages. Maybe fixing these basics should make AI more appealing and usable?

  2. It's cool but I'm worried with AI assistance people will now become even more stupid and reliant on tech. Rather than figuring the code or schematic out ourselves we will just ask AI… that's not good for our brains

  3. Why I detest Google:

    None of my Google Home products work properly any more.

    Can't even make a Meet video call to my 80 year old father today because like so many other broken features on Nest Hub Max things just don't work properly..

    e.g. CAN NO LONGER RECEIVE BROADCAST MESSAGES, TEXT ONLY. CANNOT ACTIVATE NEST CAM ON NEST HUB MAX. The list goes on.

    Their support is joke, they just pass you around between operatives before closing your case.

    They massively cut back their workforce to try to catch up with OpenAI but as they said in their own internal memo, they have no moat and that's good.

    I want OpenAI to succeed and Google to fail. They have zero customer loyalty from me for their terrible product maintenance and preference for reskinning everything before making sure it actually works.

    This is why its a hard no for buying any Google products, business or personal ever again.

  4. C'est pas mal mais est-ce que tout Γ§a il fait des ondes et qui porte du mal a les yeux et est-ce que c'est des radiation est-ce que tout Γ§a c'est dangereux pour la santΓ© il faut savoir est-ce que c'est dangereux pour les est-ce que c'est dangereux pour les yeux. !

  5. This is awesome! Improve Gemini & Gemini Advanced of all futures that ChatGPT 4, 4.o, 5 have & will have.
    Put there projet Astra & make it that avaliable for mobile, PC, inteligent glasses with bone structure voice delivering.

    Make it at Gemini Advanced more books etc that Gemini can remeber it & can be more useful. More then 20…make it that remeber conversations like chat GPT to be more useful for user.

    Gemini Advanced still csn talk nonsense & make wrong answers etc, fix that please.

  6. I would really love to see GPS, a speedometer and mini map, end the time in the heads up display glasses. If the camera detects a steering wheel and gauge cluster have that be the only thing that pops up when driving and have no way to disable it. That way it can be safe for people driving without causing distractions like texting or anything. No pop-up messages like texts. Just an alert that you have one and that it has notified the person that you are driving. I would also love to see Google live translate signs to your language of choice. It would be really nice to walk up to a place in a foreign country or even Chinatown and read the signage. It would also be very handy if you could put a beacon in the sky with a line to the ground with distance to your friends. And you can click them and navigate to them so in crowded situations like fairgrounds or even cities you can find each other. Also for people who have trouble remembering names and faces, have a face recognition system that puts the person's name above them even integrate Google maps so you can ask it the nearest subway station or whatnot and get guided directions. As well as things like bringing up menus for restaurants and so on so you can look at a place down the street and say what do they have on the menu and it will bring up the menu. And even integrate a QR code scanner and barcode scanner application so you can look at items and skin their barcodes and do a search on the items nutritional values and so on, this could be so handy and way more handy then ever imagined if we could just integrate those few small things. But also would be nice to have a wired battery to attach and put in your pocket just so you don't have to worry about the glasses stopping halfway through a work day.

Leave a Reply

Your email address will not be published. Required fields are marked *