It was fun watching your snake game get to almost superhuman level. Then to get o3 to build it's own functional machine learning model for the same game it created.... Damn Wes
I used it in real world coding today for about half a day. It seems to be incrementally better than o1 at coding. I had a coding problem that I ran into yesterday that o1 couldn't solve no matter how many times I prompted it. I gave it to o3-mini-high and it also struggled with it, but after about 3 tries it figured it out. I'm very happy with it. It's not there yet as far as something that I can rely on, but it's getting there.
I haven't tried o3 yet, was disappointed with o1 coding performance so have been back to using Claude Sonnet 3.5 since. Have you used Claude at all and if so how do you think it stacks up to o3-mini?
First, the outtro was... AWESOME! Stay'in alive, baby! Secondly, I'm definitely going to watch you video multiple times. There was so much there to grasp. Yes, for coding o3-mini high does feel like, "Whoa, wait, what!?!"
I just put a single python file of 5,322 lines of code in high mode and had it explain the code and refactor it. wow! This is the best coding model hands down. And it has internet access so you can have it read updated api docs or anything else. It’s an available model in cursor now. 😮
03 is incredible. It basically walked me step by step to install Python, then I asked it to make a checkers game... 1st try, and it works! It's definitely an amazing tool. What a wonderful world 😅
@@brjohow Working backwards with a bug in some code is easier than writing it from scratch, this still greatly reduces barrier of entry. Also the first reaction to the layman running into a bug would be changing the prompt
You're explaining it perfectly. To be able to say something as simple as "create me a game that plays itself"... as well as the code writing itself next to perfect(up until a point obviously)... normal people can ask a a simple question... copy and paste a few things... and create something that most experienced coders have difficulty with. Not to mention the fraction of the time to write the code.. with numerous hours of troubleshooting that would be needed a few years ago. This is amazing and just a great time to be alive and have an interest in this field. Everyday is honestly like Christmas. Few people really understand that Chatgpt was released like 2 years ago because of the progress makes it feel like it has to be longer. And the rate of improvement will just increase exponentially... and we're just getting started!
Coding time, run time and memory space costs money. Ask it to develop the snake game with quickest AI coding time, fastest run time and lowest memory usage using 5 coding languages. Now ask the true AI question, can you invent a coding language that beats all the others?
Excited 😆 a tip, if you want it to fix code you can use cline (vs code extension) with anthropic. It can read terminal and error and write on files to fix error.
I've been discussing with the o3-mini model the concept of a "Super AGI". A model that is substantially more advanced than an AGI, but falls short of a full blown ASI. I think that's the logical approach to development going forward as we reach AGI and then start targeting the goal of an artificial super intelligence.
this is really awesome! my first session using o3 has been very impressive as well. you ask for other ideas to play with, I strongly encourage ARC Prize type puzzles. it's a fascinating challenge!
The first thing I did was to make a game. While the code worked 100%, it didn't understand the instructions very well. The reason your snake game test worked so well may be due to it already trained a lot with that type of queries.
I just finished getting to to write some complex SQL Server stored procedures for a job I'm doing. I chatted to it for a bit, asked it some questions, clarified some things - and now I have a solid stored procedure I can now use in my Delphi code. Around 8 hours work in around 20 minutes. It even came up with a better approach after it said there were some issues and I offered it that option.
First time I was really really impressed by a Wes Roth video. I am curious how it will do with coding challenges where the idea is not in the common domain but a new idea and you have to explain the idea to the AI
2:11 i just had a wild thought. Anybody could use this, upload their data such as drone specs and parameters and have it train in the simulation or it could learn how to operate models of cars or how to build models to operate devices. Like build specific models and then upload them and then have autonomous drones but upload something like they have to obey non violent or harmful requests from the admin or something. I’m thinking next level. We all build models and create a library for everyone. Open source model library, the library of Alexander lol
i feel it's not a big breakthrough(since they have deployed big resource include people). The main value things are open source (how to do this) to cause or make everyone (they are want to involve in ai business) to have chance to get this achievement or they want things. forget it what ever things stronger or not / whether better than other one.
Stop doing the snake game because every top tier model can do that now. It's not even a test. Instead you need to start asking it to create a super Mario game from scratch and have it play it. Super Mario is just enough of a challenge to push the top tier models to their limits but still make it possible. My UA-cam channel goes DEEP into this.
Agree, it’s more important than ever to test models on things that specifically are not in their training data, rather than one of the most prevalent bite sized “tests” in coding
@@jansenncuber8009Exactly, and I use super Mario as an example because some models are able to pull it off slightly but just not enough, and deepseek was able to do it barely. Super Mario in python is right at the brink of challenge for top tier model capabilities right now. I'm not sure why we are sticking to the snake game, it's too simple now. As these models become more advanced, then our testing methods should becone more advanced.
IDEA => "AI in a Sandbox" is a self-learning environment where O3 Mini generates, executes, and refines AI models in real time. Running in a safe local sandbox, it iterates on tasks like gameplay, navigation, and problem-solving, using feedback loops, reinforcement learning, and a visual dashboard to autonomously improve AI performance.
30:32 - "I want to..." - "No, you don't" It's amazing and depressing at the same time. A lot of discoveries were made by doing stuff you're not supposed to.
There are societies (on Earth right now) that are going to leapfrog over a thousand years of technological progress, straight to task-capable AI. We're going to witness what is effectively a violation of Starfleet's prime directive. It was bad enough when we were introducing firearms and cell phones. Just fascinating to consider.
By o5, I expect to be able to train an AI to just comment on my behalf on UA-cam. Think of the time savings! And the lack of going back to edit or the typo!
I'm keen on using learning models in my game to to create emergant gameplay. After each game, a shooter say the gameplay is accessed, then the npcs get trained for the next game.
The Game "Asteroids" - but 1 vs 1 - It can train on itself, or you - Actual asteroids optional - include space ship enemies.... like the original - that fits your intelligence window
Wes i used this model today to build a novel ANN and train itself. Pretty much single shot, training itself right now using a teacher model. Lets see how it pans out.
Oh man, I use Asteroids as a model test instead of snake, just seems like it is more involved. I got mind blown just from o3 making the game. Then I saw your video and said to myself, naa...I gotta try this....I had it make a neural net AI same as you in your video. MY GOD! I just had it do a 1k training run and then had it auto play Asteroids, this is just NUTZ!
The debugger from Turbo C back in the 90's had a fake option: debug and find error, where it put out some response like, dream on. This now has become a reality.
A test you could do in the future would be something like the video "AI Learns to Play SUIKA GAME" if you are trying to get it to build a more sophisticated base game that is still reasonable to do for a future model. It's a step up in complexity from snake and requires physics simulation. For a new game developer it should be easy to make still, and it might still generate okay search traffic
Bit delayed (from 2 videos ago you mentioned this), but thought you had an interesting question (and the answer is interesting) about Deepseek v3/r1, and the relationship between the two: Deepseek v3 created -> r1 "reasoning" process creates a reasoning model based off v3 -> r1 gets incorporated back into v3's architecture as an "expert" in math/coding (or any prompt that is classified at the orchestrator layer to benefit from the reasoning process) in its MoE architecture. It's basically gpt4o and o1 in the same architecture. We should probably expect similar from OpenAI soon, unless they think having multiple standalone models gives off the vibe of a better/more robust product line.
A lunar landing game, like the one from Dominic Doty, where you have to program your own auto pilot. I think this would be really interesting for AI to solve and it's visually satisfying to see. Without AI I got it to a point it could land even with quite an extreme initial velocity, angular momentum etc. an be efficient with fuel.
Very very crazy! Im saving up money to get back into paying for the skool community membership. Im really hoping to get help with a TTRPG ai that will help DMs get game content ready, but also act as a dm for players who dont have a group and want to play solo in whatever world they can come up with, with the help of the TTGM
I wish they'd release a model that was actually decent at creative writing. Something that would avoid using the word "tapestry" in every other output would be nice.
Just had a better idea than anything i might have had previously, we all use this ai to fabricate models related to tools, household appliances, vehicles, games, unreal engine or other game engine variants, and make a library for everyone and free. More ideas, farm tools, video designers, i mean imagine a model or something needed to be done. Deciphering languages and sounds and being able to communicate with animals would be cool. And to hack our own genetics and use neuralink and editing tools to communicate with the body. So many potentials. A library of models would be amazing because it wouldn’t need any specific app to run and we could share the files online or maybe ask Nvidia to do this and it could benefit everyone. Imagine the world advancing instead of being stun locked by money and greedy people with low ambitions.
have it build a maze container and do a fluid sim filling it up, maybe add rotation, or fill it with multiple colored fluids and have them mix randomly in the maze as it fills and rotates
I am fairly certain that a few of us would be really interested to see this unfold layered over Minecraft. If it could program In Java. Not sure? But I assume it can.This would be great fun to watch.🤔
It could populate the world with its own AI agents. Learn to mine and craft. It could populate the world with its own horror MOBS, creating change un population density. And as this unfolds generationally, add rewards to incentivize expertise and invention. maybe🤔
My dream is that co-pilot is my real time AI desktop assistant and can help do tasks in any software by literally just reading the documentation (if it hasn’t already) That horizon seems to always get closer.
@ if AI is able to read documentation and do the task, it means there is no work. Does not matter that work was outsourced to someone who's self-employed. It disappears.
I was just thinking during the video that the current models just start trying to do what is asked and may not give you what you want in cases where a human professional might use their knowledge to ask for clarification on what you actually want before starting the task. You mention the same thing at the end. This would be next level if they can do this.
Wes, yes I share your excitement, but you always give the AI the benefit of the _doubt_ when it comes to the stuff that might be called 'common sense'. In this case, it was the green fruit thing. There is this gaping hole in _all_ the models we've seen so far, that really doesn't matter when there is a human wrangler to provide this thing, that would prevent any child trying to get better at snake by chasing it's tail in circles. I'd have preferred you told it to realise what it was doing, and see what resolution it adopted, for the sake of the experiment, rather than the incredibly human-feeling solution you, its human, had to give it (i.e. no green fruit). But regardless of all that, an extraordinary demonstration of the cusp we live on. Even if every AI still needs a human to fill in the _doubt_ gap, that doesn't limit the productivity gains that seem to be almost here.
Soon : Generating RL gyms to create thought chains to train the model on. Once an AI gets the ability to train itself without human intervention it will get very interesting. From what I understand AI is already very good at generating reward functions
I'm an old timer, I'm sticking with Deepseek. I think it really understands me r most, and it has the best sense of humor, when you ask it to respond with humor in its thinking and in its answer. I'm going with the Old Reliable Deepseek for RockPower. I can't keep changing to these wannabes that will keep popping up because that's the cool part of free coding and free sourcing and free viewing and free love.❤
10:41 imagine every cell phone has an ai model specific to the phone and can use WiFi and Bluetooth to communicate with other devices and communicate with us and be a bridge. Imagine the ai detects and sees using WiFi and could assist like in video games. Seriously lmao. I’m geeking over here bro. Seriously let that image sink in. Destiny, ai companion that is specific to that ghost. Cellphone can have model similar. WiFi vision and scanning capabilities would be on par with gaming maybe better or worse depending how we scan. If we use an ai model like the mini itself then it can generate links between or bridges that connect to and learn on the fly. It’s actually a dream come true and it’s possible. It could connect to cars and computers and all sorts of things. It could communicate with other devices and models and link to operate and share data. Like the car already has a model for operating and stuff but your model would connect and they would temporarily merge and then the companion would disconnect and link broken or separation function to decouple without breaking or corrupting like the usbs and data drives.
This is the year of the Snake, after all.
this will cure cancer and ageing , trust me bro, just dont optimize it and pay sam altman and ignore deepseek and free/open models.
Kinda profound perhaps?
It was fun watching your snake game get to almost superhuman level. Then to get o3 to build it's own functional machine learning model for the same game it created.... Damn Wes
I used it in real world coding today for about half a day. It seems to be incrementally better than o1 at coding. I had a coding problem that I ran into yesterday that o1 couldn't solve no matter how many times I prompted it. I gave it to o3-mini-high and it also struggled with it, but after about 3 tries it figured it out. I'm very happy with it. It's not there yet as far as something that I can rely on, but it's getting there.
Question is, could you solve it yourself? Or did you need o3 to solve it for you?
how is it compared to deepseek?(if you have tested it)
What was interesting, on the tweet of Cursor they had a small note at the bottom: what surprised us, developers told us they still prefer Sonnet
How to get o3 mini high, only seeing o3mini
I haven't tried o3 yet, was disappointed with o1 coding performance so have been back to using Claude Sonnet 3.5 since. Have you used Claude at all and if so how do you think it stacks up to o3-mini?
First, the outtro was... AWESOME! Stay'in alive, baby!
Secondly, I'm definitely going to watch you video multiple times. There was so much there to grasp. Yes, for coding o3-mini high does feel like, "Whoa, wait, what!?!"
I just put a single python file of 5,322 lines of code in high mode and had it explain the code and refactor it. wow! This is the best coding model hands down. And it has internet access so you can have it read updated api docs or anything else. It’s an available model in cursor now. 😮
I'm no coder or have any idea what you just said, but I'm pumped.
03 is incredible. It basically walked me step by step to install Python, then I asked it to make a checkers game... 1st try, and it works! It's definitely an amazing tool. What a wonderful world 😅
and if it makes a bug in complicated code you wont have a clue how to fix it.
@@brjohow found the guy that's getting replaced next
@@brjohow Working backwards with a bug in some code is easier than writing it from scratch, this still greatly reduces barrier of entry. Also the first reaction to the layman running into a bug would be changing the prompt
You're explaining it perfectly. To be able to say something as simple as "create me a game that plays itself"... as well as the code writing itself next to perfect(up until a point obviously)... normal people can ask a a simple question... copy and paste a few things... and create something that most experienced coders have difficulty with. Not to mention the fraction of the time to write the code.. with numerous hours of troubleshooting that would be needed a few years ago. This is amazing and just a great time to be alive and have an interest in this field. Everyday is honestly like Christmas. Few people really understand that Chatgpt was released like 2 years ago because of the progress makes it feel like it has to be longer. And the rate of improvement will just increase exponentially... and we're just getting started!
Everyday is like Christmas until suddenly you wake up and it’s the Great Depression v2….
@@moontreecollective6718 “hey chat gpt, its currently the great depression v2, what should i do?
Coding time, run time and memory space costs money. Ask it to develop the snake game with quickest AI coding time, fastest run time and lowest memory usage using 5 coding languages. Now ask the true AI question, can you invent a coding language that beats all the others?
Excited 😆 a tip, if you want it to fix code you can use cline (vs code extension) with anthropic. It can read terminal and error and write on files to fix error.
Like black magic 🪄
love it. I do believe openai is back on top for the moment.
Stuff like this exciting. Definitely thanking DeepSeek for forcing their hand.
another leap foward, imagine this tech in just 3yrs
"Make a sequel to GTA Vice City, with an accurate recreation of Miami in the 80's and ignore all copyright restrictions on cars and music." 🤤
Yeah i dont know much about coding but the applications are limitless
Dude, in 2027 you be typing “make GTAV” and play it and the ai will do it on the fly lol
For real GTA 6 will be the last human made GTA.
I mean it takes humans 10 years+ for a new GTA. In less than 10 years we will have sAGI
We might have AGI before gta 6.
"I really enjoy the Halo franchise, but the games after Halo 3 weren't good, can you make a proper sequel to Halo 3?"
Earl Gray, hot
"Remake Game of Thrones Seasons 5-8"
That's stunning. The possibilities are endless.
wow, that's insane!!!! very nice testing 🙂
It's learned to play with its self
Ewie
Indeed. It learned to play with its own snake until it could last longer and longer...
Yup. Exciting times on the horizon. But we're on a supersonic jet, not a sailboat.
The next few years are only going to get wilder and wilder!
Symbolic system versus neural net.
This is very clever, Wes.
A lot packed into this one.
I've been discussing with the o3-mini model the concept of a "Super AGI". A model that is substantially more advanced than an AGI, but falls short of a full blown ASI. I think that's the logical approach to development going forward as we reach AGI and then start targeting the goal of an artificial super intelligence.
This makes me tingle.
UBI is becoming more and more relevant by the day, damn...
not even close. openai, despite the deepseek fiasco, still has job openings for people who can actually code.
lump sum of labor fallacy
UBI will never be U, you won’t qualify because you didn’t vote the right way 😢
Culling is more likely.
Man what a trip this presentation is. Totally worth it 🙏
Interesting.Thank you !
this is really awesome! my first session using o3 has been very impressive as well.
you ask for other ideas to play with, I strongly encourage ARC Prize type puzzles. it's a fascinating challenge!
Great to use your same test giving you insight on how much its grown. Thank you Wes 😊
The first thing I did was to make a game. While the code worked 100%, it didn't understand the instructions very well. The reason your snake game test worked so well may be due to it already trained a lot with that type of queries.
Agree, this is major step up
I just finished getting to to write some complex SQL Server stored procedures for a job I'm doing.
I chatted to it for a bit, asked it some questions, clarified some things - and now I have a solid stored procedure I can now use in my Delphi code.
Around 8 hours work in around 20 minutes.
It even came up with a better approach after it said there were some issues and I offered it that option.
Amazing content🎉 very interesting.
pretty fucking awesome to see you training models man. o3 is a game changer.
First time I was really really impressed by a Wes Roth video. I am curious how it will do with coding challenges where the idea is not in the common domain but a new idea and you have to explain the idea to the AI
Great news. Ty. Just what I needed to hear.
You've gotten me excited 🤸♀️ now i cannot sleep for thinking about all of the possibilities
AWESOME VID THANKS
This growth reminds me of Siri whose response is still: ‘I found this on the web for “Siri you’re the worst WTF, how did you even understand that’’
a 3D maze that generates infinitely. Low poly assets. A physics engine isn't necessarily required. It could just generate an emulation of a maze.
The "What's next?" moment is pretty sick I must say
Oh my word Wes, you are literally giddy with excitement
😊
That was an excellent review Wes!
This is really Impressive!
Love the vids as usual mane
you're the best, Wes!! TY from husband and I!
2:11 i just had a wild thought. Anybody could use this, upload their data such as drone specs and parameters and have it train in the simulation or it could learn how to operate models of cars or how to build models to operate devices. Like build specific models and then upload them and then have autonomous drones but upload something like they have to obey non violent or harmful requests from the admin or something. I’m thinking next level. We all build models and create a library for everyone. Open source model library, the library of Alexander lol
THANK YOU for doing the AI playing game test! I've been working with o1 pro on a shining force AI! Looks about to get supercharged! You rock Wes!!
Loved this so much your absolutely not insane! o3 mini is the first model I am impressed with! Great promise for the future.
Wes - good NextGen quote there my hat is off to you 😂
This was great!
i feel it's not a big breakthrough(since they have deployed big resource include people). The main value things are open source (how to do this) to cause or make everyone (they are want to involve in ai business) to have chance to get this achievement or they want things. forget it what ever things stronger or not / whether better than other one.
Stop doing the snake game because every top tier model can do that now. It's not even a test. Instead you need to start asking it to create a super Mario game from scratch and have it play it. Super Mario is just enough of a challenge to push the top tier models to their limits but still make it possible.
My UA-cam channel goes DEEP into this.
Agree, it’s more important than ever to test models on things that specifically are not in their training data, rather than one of the most prevalent bite sized “tests” in coding
@@jansenncuber8009Exactly, and I use super Mario as an example because some models are able to pull it off slightly but just not enough, and deepseek was able to do it barely. Super Mario in python is right at the brink of challenge for top tier model capabilities right now. I'm not sure why we are sticking to the snake game, it's too simple now. As these models become more advanced, then our testing methods should becone more advanced.
I want it to make Quake 3 Arena
It's a baseline. Everybody has done it, so it becomes the perfect measurement of progress. Like prompting Will Smith Eating Spaghetti.
Lol I'd be happy with Street fighter 2: champion edition 😂
It's means it is. Its is the possessive form of it.
IDEA => "AI in a Sandbox" is a self-learning environment where O3 Mini generates, executes, and refines AI models in real time. Running in a safe local sandbox, it iterates on tasks like gameplay, navigation, and problem-solving, using feedback loops, reinforcement learning, and a visual dashboard to autonomously improve AI performance.
Ok, congrats, as a layman, I totally enjoyed this video!
"pretty good script right?"... after a two sentence prompt. Yeah.. yeah Wes, that's pretty fuckin good!
incredible times we live in
30:32 - "I want to..."
- "No, you don't"
It's amazing and depressing at the same time. A lot of discoveries were made by doing stuff you're not supposed to.
For the last 2 years i see the snake test, does that mean we still testing this thing with snake in 2030? Amazing progress!
There are societies (on Earth right now) that are going to leapfrog over a thousand years of technological progress, straight to task-capable AI. We're going to witness what is effectively a violation of Starfleet's prime directive. It was bad enough when we were introducing firearms and cell phones. Just fascinating to consider.
Everytime they release a new model, 100’s a thousands of people around the world will get layed off
I'm sure the american administration is working hard to ensure that every engineer gets his UBI lmao
We're just lifting off-this rocket launch is only the beginning, and o3 is just the first small step.
By o5, I expect to be able to train an AI to just comment on my behalf on UA-cam. Think of the time savings!
And the lack of going back to edit or the typo!
@@Juttutin maybe in 2025 they can upgrade to o6... Really don't know.
I'm keen on using learning models in my game to to create emergant gameplay. After each game, a shooter say the gameplay is accessed, then the npcs get trained for the next game.
The Game "Asteroids" - but 1 vs 1 - It can train on itself, or you - Actual asteroids optional - include space ship enemies.... like the original - that fits your intelligence window
Spacewar!
WHAT A TIME TO BE ALIVE !!! 🥺 ALL HAIL THE "REAL" INTELLIGENCE OVERLORDS !!!
Wes i used this model today to build a novel ANN and train itself. Pretty much single shot, training itself right now using a teacher model. Lets see how it pans out.
STAYIN' ALIVE!
Oh man, I use Asteroids as a model test instead of snake, just seems like it is more involved. I got mind blown just from o3 making the game. Then I saw your video and said to myself, naa...I gotta try this....I had it make a neural net AI same as you in your video. MY GOD! I just had it do a 1k training run and then had it auto play Asteroids, this is just NUTZ!
The debugger from Turbo C back in the 90's had a fake option: debug and find error, where it put out some response like, dream on. This now has become a reality.
right on im working on it now and I have a few ideas
for learning
Wow!
Hey Wes, what happened to your intro music? That was a whole vibe.
I know!
I loved that, got hit with a copyright, even though I had rights to it :(
I need to generate my own intro with AI music, I think....
@@WesRoth seriously, i am more and more amazed what suno and other ai music al gore's can do
@@frogz Can you give me a brief overview of where to look for interesting information/results out of audio models?
A test you could do in the future would be something like the video "AI Learns to Play SUIKA GAME" if you are trying to get it to build a more sophisticated base game that is still reasonable to do for a future model. It's a step up in complexity from snake and requires physics simulation. For a new game developer it should be easy to make still, and it might still generate okay search traffic
Bit delayed (from 2 videos ago you mentioned this), but thought you had an interesting question (and the answer is interesting) about Deepseek v3/r1, and the relationship between the two:
Deepseek v3 created -> r1 "reasoning" process creates a reasoning model based off v3 -> r1 gets incorporated back into v3's architecture as an "expert" in math/coding (or any prompt that is classified at the orchestrator layer to benefit from the reasoning process) in its MoE architecture.
It's basically gpt4o and o1 in the same architecture. We should probably expect similar from OpenAI soon, unless they think having multiple standalone models gives off the vibe of a better/more robust product line.
been using o3-mini-high today.. absolutely like like a beast
Awesome!
Hrmm let me go check this thing out ;-)
Thank you for your video.
22:19 We've had AI/computer tools that can play chess better than any human for quite some time now. Chess tournaments haven't gone away.
for visuals, i think some kind of 3d game that shows ghost laps of all the training runs would be sick
Woah, alrighty, AGI in ten years time😂
If it wasn't for DeepSeek we wouldn't have this model so soon.
A lunar landing game, like the one from Dominic Doty, where you have to program your own auto pilot. I think this would be really interesting for AI to solve and it's visually satisfying to see.
Without AI I got it to a point it could land even with quite an extreme initial velocity, angular momentum etc. an be efficient with fuel.
Very very crazy!
Im saving up money to get back into paying for the skool community membership. Im really hoping to get help with a TTRPG ai that will help DMs get game content ready, but also act as a dm for players who dont have a group and want to play solo in whatever world they can come up with, with the help of the TTGM
I wish they'd release a model that was actually decent at creative writing. Something that would avoid using the word "tapestry" in every other output would be nice.
Yes, we really need to delve into this problem!
Just had a better idea than anything i might have had previously, we all use this ai to fabricate models related to tools, household appliances, vehicles, games, unreal engine or other game engine variants, and make a library for everyone and free.
More ideas, farm tools, video designers, i mean imagine a model or something needed to be done. Deciphering languages and sounds and being able to communicate with animals would be cool. And to hack our own genetics and use neuralink and editing tools to communicate with the body. So many potentials. A library of models would be amazing because it wouldn’t need any specific app to run and we could share the files online or maybe ask Nvidia to do this and it could benefit everyone. Imagine the world advancing instead of being stun locked by money and greedy people with low ambitions.
have it build a maze container and do a fluid sim filling it up, maybe add rotation, or fill it with multiple colored fluids and have them mix randomly in the maze as it fills and rotates
Build a Donkey Kong type game. Where AI Super Intelligence throws Nvidia 5090s at Ilya Sutskever as he tries to rescue Sam Altman.
I am fairly certain that a few of us would be really interested to see this unfold layered over Minecraft. If it could program In Java. Not sure? But I assume it can.This would be great fun to watch.🤔
It could populate the world with its own AI agents. Learn to mine and craft. It could populate the world with its own horror MOBS, creating change un population density. And as this unfolds generationally, add rewards to incentivize expertise and invention. maybe🤔
Just think, in the future we wont have to think anymore.
My dream is that co-pilot is my real time AI desktop assistant and can help do tasks in any software by literally just reading the documentation (if it hasn’t already) That horizon seems to always get closer.
you dream about being jobless?
@@DJRYGAR1
maybe the work for themselves so they are not worried about getting fired?
@ if AI is able to read documentation and do the task, it means there is no work. Does not matter that work was outsourced to someone who's self-employed. It disappears.
@
if the finished product is valuable then it stands to reason that you could still sell it
I was just thinking during the video that the current models just start trying to do what is asked and may not give you what you want in cases where a human professional might use their knowledge to ask for clarification on what you actually want before starting the task. You mention the same thing at the end. This would be next level if they can do this.
How about - take a photo of night sky and have it figure out the location it was taken at.
When I see that "AGI" is praised for creating snake game without bugs, I understand that software engineer profiession is secured, no need to worry.
Wes, yes I share your excitement, but you always give the AI the benefit of the _doubt_ when it comes to the stuff that might be called 'common sense'.
In this case, it was the green fruit thing. There is this gaping hole in _all_ the models we've seen so far, that really doesn't matter when there is a human wrangler to provide this thing, that would prevent any child trying to get better at snake by chasing it's tail in circles.
I'd have preferred you told it to realise what it was doing, and see what resolution it adopted, for the sake of the experiment, rather than the incredibly human-feeling solution you, its human, had to give it (i.e. no green fruit).
But regardless of all that, an extraordinary demonstration of the cusp we live on. Even if every AI still needs a human to fill in the _doubt_ gap, that doesn't limit the productivity gains that seem to be almost here.
Soon : Generating RL gyms to create thought chains to train the model on.
Once an AI gets the ability to train itself without human intervention it will get very interesting.
From what I understand AI is already very good at generating reward functions
I love deepseek, help making open ai a little more open 😂
You want visual, have this thing paired up with Operator and have it complete actions
in your browser.
I'm an old timer, I'm sticking with Deepseek. I think it really understands me r most, and it has the best sense of humor, when you ask it to respond with humor in its thinking and in its answer. I'm going with the Old Reliable Deepseek for RockPower. I can't keep changing to these wannabes that will keep popping up because that's the cool part of free coding and free sourcing and free viewing and free love.❤
Lol. Seriously awesome troll. Old. Reliable. Wannabes. Lots of funny word choices to trigger people.
You should go back to past episodes and pull out all of the snake game making segments and put them into a single video to show progress.
10:41 imagine every cell phone has an ai model specific to the phone and can use WiFi and Bluetooth to communicate with other devices and communicate with us and be a bridge. Imagine the ai detects and sees using WiFi and could assist like in video games. Seriously lmao. I’m geeking over here bro. Seriously let that image sink in. Destiny, ai companion that is specific to that ghost. Cellphone can have model similar. WiFi vision and scanning capabilities would be on par with gaming maybe better or worse depending how we scan. If we use an ai model like the mini itself then it can generate links between or bridges that connect to and learn on the fly. It’s actually a dream come true and it’s possible. It could connect to cars and computers and all sorts of things. It could communicate with other devices and models and link to operate and share data. Like the car already has a model for operating and stuff but your model would connect and they would temporarily merge and then the companion would disconnect and link broken or separation function to decouple without breaking or corrupting like the usbs and data drives.
The AI on my 4 player Mahjong game on Steam is only 40lines of C#. Don't know if machine learning would play better. But it would be a fun project.
What scares me is this it what UA-camrs get access to. Can you imagine what DARPA has?