Google I/O 2024: New AI That Looks Like Magic!
Вставка
- Опубліковано 8 чер 2024
- ❤️ Check out Lambda here and sign up for their GPU Cloud: lambdalabs.com/papers
Try Gemini: aistudio.google.com/
When is everything coming out? www.ctol.digital/news/google-...
Gemini watching OpenAI: / 1790473581018939663
More: / 1791038897587122245
📝 My paper on simulations that look almost like reality is available for free here:
rdcu.be/cWPfD
Or this is the orig. Nature Physics link with clickable citations:
www.nature.com/articles/s4156...
🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
Alex Balfanz, Alex Haro, B Shang, Benji Rabhan, Gaston Ingaramo, Gordon Child, John Le, Kyle Davis, Lukas Biewald, Martin, Michael Albrecht, Michael Tedder, Owen Skarpness, Richard Sundvall, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Tybie Fitzhugh, Ueli Gallizzi.
If you wish to appear here or pick up other perks, click here: / twominutepapers
Thumbnail background design: Felícia Zsolnai-Fehér - felicia.hu
Károly Zsolnai-Fehér's research works: cg.tuwien.ac.at/~zsolnai/
Twitter: / twominutepapers - Наука та технологія
Hello world! What a time to be alive!
Given current legislation, this will only be used to replace all labor, allowing a small number of elites to control all resources. Very good time to be alive if you own Google. Very much the opposite for everyone else.
Given current legislation, this will only be used to replace all labor, allowing a small number of elites to control all resources. Very good time to be alive if you own Google. Very much the opposite for everyone else.
why did you get pinned
@@halal_tom'print: hello world' is an ai thing :) 'what a time to be alive' is a magical greeting
Google finally made Google Glass look like a good product
Google Cochlear Implant
The operative term is "look like". We've all seen Google play the game of making an AI reveal look great, only for the truth to come out. Until it's in consumers hands (which might be never), I'm not convinced.
How will it work for me and hundreds of millions other people who have prescription glasses?
I see so much hate for AI development around the internet. But as a severely visually impaired person, I can't tell you how much I love these developments. The accessibility aid potentials are seemingly limitless and I am here for it!
The people who hate it are the ones that are scared.
@@vectoralphaAI I just hate the hype. Those demonstrations constantly promises way too much. If I really need to know or understand some concept, I'm still better of using plain old google search to find human written information. AI lies/hallucinates way too much to be useful.
@@vectoralphaAIor the ones that don’t understand
@@Ellie-pc4rc or the one's who's lifes/livelihoods has been negatevily impacted by that tech drasticly lowering the quality of their lifes.
@@vectoralphaAII mean everyone should be scared. Even without the alignment etc, there are 100 things to be scared about.
Google has presented these magical AI's before and they never materialized.
Remember the one that, if you bought their Pixel phone, would call a business and have a full interaction so you could, say, make a restaurant reservation? Never heard about that again.
@@gatsby66 Yes, yes I do remember that one.
@@gatsby66Sometimes when I look up a business on Google Maps on my iPhone it has an option where Google will call to make a reservation. So I think this feature is still around, just in a slightly different form.
Is it that it's just not there yet or are there just too many risks when releasing it to us wild monsters? :D
@@halbzwilling It's that every previous demo was faked.
Must be a game changer for blind people
True, though are some technologies like Argus II which need to be remembered here. Best case scenario imo is to pair it with such technologies, even if Argus II only offers limited resolution and object detection, with an associated auditory description of what the person is perceiving the world will make a lot more sense to blind people.
OpenAI had a great demo with a blind man.
What a time to be alive!! 🎉
I had some urgent issues in our production for a few weeks, so I hardly used social media. Now I feel like I woke up from a 50-year coma - so many new things went out
Damn, I wish I had your self control
That's what it seems to look like, believe me, Fear of missing out is what the social media's algorithm is all about.
You will realise that if something does not help your future self and instead offers instantaneous gratification,
It will not last longer and it comes with regrets.
All I am saying is You are wasting time and instead focus on your long term goals.
No Offense intended.
Another fantastic video my friend. Keep them coming! ❤
As a software engineer I love watching your videos, your way of introducing these amazing papers and new technologies is so captivating.
Your inspiring commentary about these new technologies provided me with motivation to create my own sas bussinesses.
So here is some of you fair share of profits. ❤
Continue doing you!
And please continue on with keeping them anything but 2 minutes 😁🙏
You are too kind, thank you so much for your generous support! 🙏
@@TwoMinutePapers ❤️🙏
8000THB = 221 dollars🎉
Just saw the GPT 4o video, nice timing! Im curious, what do you think about the mamba architecture compared to 4o and gemini? It would be nice if you could do a video comparing the three with the recent changes in mind 😁
I agree
Hello Károly. Thanks for your work over the years, it's amazing.
You are too kind, thank you so much!
So true! 👍🙏 Many thanks!
Google's things just look so unimpressive - especially when you consider that they weren't demoed live and they aren't available. My goodness! What a time to not be Google!
And may never be available. Or shut down within a year like, say, Google's free VPN. Or the Google Podcast app. Or the beloved Google Reader. The Google graveyard is running out of plots.
@@gatsby66 RIP Wave, you could've been great
I can't believe Google Scholar has lasted so long. But with free AI competitors like Semantic Scholar, I suspect it'll be shut down, too.
@@gatsby66 After all those years, I'm still salty about Google Reader.
I just tested GPT-4o with some broken JS code, which no LLM except GPT-4 could fix, and it did quite well. Not on the level of a professional dev, but still respectable and fast.
When I tested GPT-4o I have only been using GPT-3.5 and I couldn't even tell the difference between then besides from speed, they both were equally as annoying and useless for what I wanted to do lol
Compare the reaction time that it's allowed to use with the one of a professional dev.
Used it to analyze some Uni work I'm doing and critique it. First time doing so and I'm honestly pretty impressed as a personal assistant. Cool stuff.
the update works best on academics (i think) i used it to understand the mathematical formula of trees much easier and when i asked why n should be greater than 0 rather than only greater than 0 it answered correctly and didn't hallucinate or praised me for being stupid
FYI, regarding context window, llama 3 has a version of 4M tokens in HF
I have used the Gemini Code Assist plugin for a few months now. I did this because it's currently free. In the past 2 weeks, it has gone from helpful autocomplete to writing nearly complete classes in 2 or 3 tab presses. It still hallucinates sometimes. Especially in low documentation language environment stuff. Still, it's wild how much better it got recently.
Oh this is one of your papers for translucent materials. Jimenez et al. (2015) "Separable Subsurface Scattering". I love that you put Fig.1 in the beginning so that the reader can immediately see the results. Do I get the style point now? 😁
7:38 don’t know the paper but it’s describing an image filtering operation which is used for smoothing, edge detection, SIFT, CNNs etc
But will they deliver what they promise?
Will it work properly?
Will it survive longer than 2 years?
@@sandite5 What you mean? It can until something better come out .. Like Google search before and nows days AI searches.
@@TheIraq1998 Have you seen the google graveyard. Plenty of those were replaced with inferior versions in the end.
No, they never do.
AI becomes conscious: What a time to be alive!
What a time to be alive!!!
so gpt4o wrote me a shell script to replace my job today
is AI really what we need? or it will ruin us, just like a animation Wall-e
@@shonhloi1 I want my levitating chair, and ai is gonna give it to me!
@@shonhloi1I too, want my levitating chair
@@shonhloi1I mean, if AI can completely automate all necessary aspects of life and leave humans to do whatever the hell we want i'd be down for that.
@@superfastpanda12345 Sure, until AI becomes self-aware and decides it wants something for itself. Then we will be used to automate all aspects of life and the AI can do whatever it wants.
Thank you.
What a great roundup of the news from Google!
I can't believe Google & OpenAI are forming their best yaps possible to win the AI race while I sit here not caring enough to waste my money on their premium packages when I can just use a search engine for my answers with much more speed
A lot of these things are indeed just a way to skip reading manuals and indexing that already exists. Not sure how AI is meant to aid abled folk when people just don't want to pay attention; sure I've saved some time having it produce a sql query under some mundane framework but that's prolly not what they are investing on
That's my favorite Jacob Collier performance 😊
honestly can't wait for smart glasses.
What a time to be alive!
Oh this, this is what I've been wanting
What a time to be alive!!❤
Would love to see more coverage on FOSS models, even though there already is a lot, for what I am thankful for! I don't want to give Google and other mega corps more access to my data than they already do. That's why I'm apprehensive about their AI models.
This could be very nice in a lab to document things
What papers are required to have the AI generate the video game as you are playing it? I want to drive a taxi around Shanghai
Only mean one thing: What a time to be alive!
I miss the days of computer graphics research. Now it's just googles latest spy tech all the time...
Nice
What do you think about the recent AI safety team dissolution at OpenAI? The channel TheAIGRID has made a video on it. Basically the two leads, Ilya Sutskever and Jan Leike have left the company because they felt like the company didn't want to prioritize safety.
What practical purpose those teams served? Right now, we can't even get AI to provide accurate references or to stop hallucinating.
"Safety” is currenly a buzzword with no practical benefits. Like most safety teams work to make models worse by overtraining them to refuse certain requests and to censor certain information. I can understand why their work is given less and less compute, as their current approach does nothing except adding needless censorship to disappoint users.
If their work would result in AI following instructions better (i.e. what alignment actually means) I doubt those teams would get dismantled.
Where can I see the benchmark of all main AI models? Thank you in advance.
Actually, I disagree about the mathematical typesetting. I have been getting a lot more broken latex/markdown math with the new model, than I did with 4.
I've found it hallucinates a lot worse than GPT-4 too, maybe they didn't run it through all the finetuning from user feedback yet
memory is something that has been plaguing LLMs for some time now. If they can work that shit out, we are golden
"what a time to be alive!" (Dr Robert Fico the Slovak PM)
they should just have their AIs fight each other in a ring.
Omg a project about me!!!
So basically real life cortana
Is there an AI for replacing people's Voices?
AI that can remember things? Not sure if i want that
It could help with ASD by displaying information live, like people mood, what is expected in the current situation, make the implicit very explicit. Paired with neurolink, it could even understand what you're going to do and explain that while technically correct, it's not what was expected. Direct translation of implicit!
Another use is when watching movies or interacting with people, it could overlay a unique colored symbol and name, so you can follow the story much easier.
Hopefully this one isn't fake and a completely fabricated demo like LAST time..
Gemini 1.5 pro access costs 22 euros per month 😕 They're about to lose the chatbot war against openAI
OMG Finally, a system that maybe able to tackle my 10,000 junk emails
Make a test video of the voice chat of ChatGPT 4o please :)
Google's AI things are vaporware until the average person can easily use any of this stuff for free. And it really works in real time as well as the recorded demos show.
So many youtube channels gush over all the AI company presentations, but rarely note when something's not available now to the average person, that it may never be available, and if it does become available, there will be a monetary cost. We need more skepticism, not gushing because a company gave a channel early access for free.
Again this is one of the best videos of AI on Earth, what are the time to be alive! 😆😁also do you remember me.
my dreams have really bad temporal coherence
Lol, in the future they'll look back at us and say "there was a super tiny period when people actually trusted photos and videos, then they were so appalled by what computers could do that they coined a term 'deep fake' to deal with it, yeah, as if you could trust any image!"
Enders Game is on the horizon…
8 minutes papers
Imo Veo shouldn’t be compare to Sora. Both are DiT and we know perf scale with compute, or Sora takes to much time to be serve meanwhile Google promise public access within AI Test Kitchen. It’s probably a lot smaller than Sora similar to Palm 2 vs GPT 4.
I feel like we are moving into the Age of Oracles
Given google's habit of manipulating the social narrative, I think Cassandras and Sinons may be more accurate than oracles.
The impression I get from Google is one of desperation, e.g. "OpenAI made a video generator, so we'll make one too". When you're in lockstep with your biggest rivel, but are 6 months behind in most respects, that's a bad place to be. Also, there was a distinct lack of live demos, and we really can't trust anything pre-recorded from Google.
Props to google though. They yet again convinced a lot of people they're not behind, when they clear are.
To improve their AI, they simply ask the AI
- AI, improve yourself.
Because OpenAI showed GPT-4o 1 DAY before Google I/O, the Google AI demos felt a bit more underwhelming for me. I bet it was on purpose, it's impressive stuff what Google made, no doubt about it, but it felt underwhelming, familiar, already seen.
Google AI is the Bing of ai
From my experience DuckDuckGo is now better than Google (or rather, Google got a lot worse) which afaik is sourced from anonymized Bing.
I want Google Glass!
I would be excited, but google has such a bad history in AI of very exciting announcements followed by underwhelming products.
🤦♂Yeah, like AlphaFold3. Who even need those proteins anyway, right?
The new voice isn’t released yet, including paid users
really hyped for a lot of stuff, but this is just scary and really crosses a line, not because haha funny AR AI is co-pilot for real life, but with the ToS and deeper idea behind it: every piece of information in your life is used as a datasets to optimize AI it further, since AI run out of data to collect a few month ago - the internet and everything else digital has already been scraped as analysis showed - the only way to get more data is by live surveillance. Its just scary. At least a bit of Snowden should be taken into account here.
Privacy is SUBOPTIMAL. Please insert retina now.
I wish all the AI companies success in the coming months and years. Whoever gets these products into the hands of the average consumer will win the race. Google is behind and has a bad track record of delivering to the average person.
It’s not too late Google. I want Google to win, but I use ChatGPT every day, and I’m not particularly attached to any company.
What’s the next tool we will get to use? A true pair of glasses with these tools integrated at a good price could be the next big moment for humanity.
Imagine a future where everything you do contributes to a giant AI model’s training data. If you read papers, the AI learns from it. If you write an essay the AI reads it and learns too. Imagine you get paid for the accurate data you contribute to the body of work.
Create that sci fi future. That’s the goal. In the end, what we need are tools to manipulate reality to our desires.
I want a device that can stream input to my brain. So I can use text to video to live inside a fantasy world. That’s the real goal here!
There's No Such thing as a Free lunch, We'll All Pay in the End! 🤔
So disappointing that in the midst of this apparent “race to AI,” hiring is down substantially across all of tech.
yeah, i really hate AI, ruining human kind, it will broke our society system
@@shonhloi1 What does that mean?
If someone watched a 10 minute video in 30sec and gave me a review on it, I would be thoroughly impressed.
Gotta say though, those AI videos were SO impressive from what we had just one year ago.
What a time to be alive! 🎉🎉
Its concerning knowing who owns it; Google's misconstrued reputation speaks for the future.
'Soon, the ai will be sending you commands; obey or your digital life will be restricted'
Of course it's with you ALL the time. Classic Google
Still waiting for video editors to have the ability to magic erase people from video to fix my drone shots.
Duolingo has been magic erasing streak breakers for years already.
@@noob19087 oop
@@Jake28 Why yes, I would like to learn about Object Oriented Programming. Please tell me more.
@@noob19087 You're welcome! Glad I could help. Make sure you only use these principles when appropriate, as overusing them may lead to overcomplicated code when a simpler solution exists.
hype used to be believable
Gemini, ChatGPT, Copilot, Siri AI, Meta AI, X AI and many others from big tech are all going to be formed together in Unreal Engine 5.
4 was already a business standard for many companies across the globe. They are waiting for us the consumers to be entrepreneurs, too.
And unfortunately we don’t have a lot of entrepreneurs with a string will to progress through the toughest of times in the world.
Everyone with a will to unite humanity is hoping to unite with the rest of the world in more ways than 1.
And hope to apply to as many businesses as possible to make creativity boom again in a way that was never possible before.
Recreating moments of Golden Eras from the past centuries that could save us from poverty, bankruptcy and even corruption.
We want the world to stop looking through rose gold tinted lens and think that THIS is absolutely how the world works, but an international blueprint of what the world can become.
A utopia. If we will it to be. The world doesn’t need to be perfect to find it.
Years before ChatGPT came to life, I saw it coming and was already impressed with GPT 1 and later 2. Thanks to your videos! No better way to learn about those beautiful papers.
Ok Sadguru was right😅😅😅
Next revolution according to him gonna be :- we will be able to embed intelligence in electricity , wireless will boom
I cant tell if your voice is AI
Well I'm here early
This is just Google Lens reskinned
I don't trust Google's demo until actual people are using it. They have a bad track record of 'embellishing' those things.
What a time to be sentient!
When will it be able to plan and file my taxes for me?
Skynet is here to stay
Interesting, what if the creatures in the generated videos had their own consciousness?
I need some weed.
Google is the next nokia😂😂😂
and all of that for what? what's the conclusion for that for humans?
I love that you are also a fan of Jacob Collier!
i used to be all-in on AI when it still was crappy and produced funnies to mess around with, but now that it's all advanced like this, this might be a bad thing actually
And it's unstoppable. I think we are already on the point of no return. Let's see what happens in the future. Maybe everything wilö be better? I wish
Why a bad thing when it's more useful now?
I fear people more, because they will find bad uses for the technology.
Its all ass
Nerd solutions for nerd problems
we're so cooked lmao
Clickbait.
no
Well, at least their voice sounds substantially "nicer" than the awkward pseudo-quirkiness of the OpenAI voice.
It was a recorded demo. It could have been a real person doing a voiceover gig.
i cant wait for an engineering AI, for example you say, i want a battery that lasts 20 days... and it will research lots of different points and output a battery that lasts 20 days. or at least a Paper with it.
OpenAI won brah.
a virtual assistant with you all the time? so exactly what nobody wants
Google sucks.
The only type of worker that could be happy with IA under capitalism is the IA researcher. Maybe because it will be the last one to lose the job.
Well this is really boring.
I'd be more interested in seeing a study done on where AI fail to perform properly.
For example. We know that AI improvements aren't super significant. However, the questions that it doesn't answer. What caused it to fail to answer them? Was it hallucinations? Was it just because the model was inadequate. What are the most common reasons for model failure? I think if we study where models fail most often we'll see greater results and improvements vs just raw-dogging it. i also imagine most models that have similar performance tend to fail on similar problems. i think it's no longer an issue of how much data we can throw at these bad boys but how can we let them make better use of the data they learn, reason better, hallucinate less or fix hallucinations as soon as they happen