Training a Neural Network to operate drones using Genetic Algorithm

Pezzza's Work

7 500

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 10 лют 2025
After my first try with flappy I wanted to see how would a genetic algorithm handle more complex situations.
Github github.com/joh...
Music used
freepd.com/mus...
freepd.com/mus...
freepd.com/mus...

КОМЕНТАРІ • 389

@Alayric 4 роки тому ⁺³⁰⁴
Good idea, and I like your smoke!
@PezzzasWork 4 роки тому ⁺¹¹⁹
Thanks! I think smoke is where I spent the most time :D
@mendelovitch 3 роки тому ⁺⁴²
@@PezzzasWork Why do we get hung up on those small sidequests?
@I_SEE_RED 2 роки тому ⁺¹⁷
@@mendelovitch it’s an easy way to procrastinate the main problem
@anujanshu2917 Рік тому
How or where you stimulate this in unity or special software
@katzen3314 3 роки тому ⁺³⁸⁹
I love how they seem to move so organically even though it seems like a relatively simple model. I bet there's some really interesting optimisation problems and extra restrictions you could throw at this.
@katzen3314 3 роки тому ⁺²⁷
Also thanks for uploading the demo and source code, very fun to play around with!
@NetHacker100 3 роки тому ⁺⁵¹⁰
I think that the need to center themselves perfectly with the sphere is what makes them not become speed machines. Because when they reach the target they always gotta somehow "dock". And that requires their inertia to be 0 when they reach that point so they have to slow down. If somehow this was changed by making the drones to just need to touch the point at any part and maybe making the orb bigger I would certainly expect that there would be more speedy manoeuvres to just arrive at the target and pass through it. Perhaps even in an elliptical patrolling. Would be certainly interesting to see.
@00swinter21 3 роки тому ⁺⁸
im currently working on the same thing but with more inputs;
I will try ours too;
@eliaswenner7847 3 роки тому ⁺⁴
@@00swinter21 Don't forget to post the result on your UA-cam channel !
@feffy380 3 роки тому ⁺¹⁹
Exactly my thoughts. It looks like the target requires pixel perfect precision to count as a success. Careful approach is the only way when the targeting criteria are so unnecessarily strict.
@christopheroldfield1066 3 роки тому ⁺⁶
@@Wock__ I believe you are right. On one of their videos, there is an actual clock face that counts down on top of the target, like a circular loading bar.
@UnitSe7en 3 роки тому ⁺¹²
The goal is to dock, not to touch the target. Changing the goals to achieve a better outcome does not mean that your model improved. Making them just have to touch the target so they could go really fast does not mean that they are suddenly better. Your thinking is flawed.
@blmppes9876 4 роки тому ⁺¹⁶⁵
5:28, gen 900: Ok, you guys are too good and I'm tired now. Bye!!!
@NanoCubeOG 3 роки тому
true
@tuna3977 3 роки тому ⁺¹
"I have to go now, my planet needs me"
@Furebel 3 роки тому ⁺²¹³
I'd love to see a game where your enemies are all neural network trained AI, and the higher the difficulty, the more trained AI variant you will have to face
@ChunkyWaterisReal 3 роки тому ⁺¹⁴
Give it 10 years
@kirtil5177 3 роки тому ⁺⁴²
imagine if the AI is being trained while you play. The better you play the less hard the ai is, but if you slow down the difficulty increases
@marfitrblx 3 роки тому ⁺¹⁵
@@ChunkyWaterisReal it's already possible now lol
@ChunkyWaterisReal 3 роки тому ⁺¹
@@marfitrblx AI has been shit since the 64 hush yourself.
@keyboardegg931 3 роки тому ⁺³
Or even the player being an AI - I can totally see a 2D game with your cursor being the target point, and the more you play/the more enemies you defeat/etc. the smarter your character gets
@phantuananh2163 3 роки тому ⁺²⁶
This channel is a gem
@dan_obie 3 роки тому ⁺⁸⁶
Would be really interesting to add fuel consumption to the mix and watch them optimize their fuel economy
@dazcarrr 2 роки тому ⁺¹¹
and give them more fuel for every target they reach as more reward for doing that
@markoftheland3115 4 роки тому ⁺⁹⁷
Very cool stuff, well done!
Now make them go through an obstacle course 😁
@PezzzasWork 4 роки тому ⁺⁵³
I am working on it ;)
@marc_frank 3 роки тому ⁺⁶
a combination of the ants finding the optimal path and then the drones following that? :)
@Vofr 2 роки тому ⁺⁵
@@PezzzasWork where's the video 🗿
@nandorboda8049 Місяць тому
These vids are awsome and very inspiring! It's amazing how a neuron network can adapt for a specific task!!!
@osman4172 3 роки тому ⁺²
Great work. I think many people would appreciate seeing background of the work.
@raffimolero64 3 роки тому ⁺²
love this channel. what separates this guy from others is his consistent ability to make his sims look cool.
@the0neskater 2 роки тому ⁺¹
This is one of the coolest projects I've ever seen. Would be awesome to extend to add walls and an environment! Great work.
@youssefelshahawy8080 3 роки тому ⁺³
This is one of the coolest implementations i've seen. Nj!
@SongStudios 3 роки тому ⁺¹
Dude I love it when they get sooo roofless! So fun to watch!
@GG64du02 3 роки тому ⁺²⁷
I wrote my autopilot cargo drone for space engineers and still i am impressed by the work
@YellingSilently 2 роки тому
The end of play lineup was a cute touch. Nice work!
@Phiwipuss 3 роки тому
5:56 The drone in the left down corner synchronized with the beat in the music. Perfection.
@xDeltaF1x 3 роки тому ⁺⁴⁵
That end result with the live-tracking is so good! I wonder how viable it is to train simple neural networks like this for game enemy AI
@originalbillyspeed1 3 роки тому ⁺²
Depends on the game, but on games with a clear goal, it is fairly trivial and will quickly surpass humans.
@AB-bp9fi 3 роки тому ⁺⁴
@@originalbillyspeed1 i guess for different difficulty levels game designer can use agents (enemies) from different generations, for example "easy" = generation 400, medium = generation 500, hard=generation 1000.
@commenturthegreat2915 2 роки тому ⁺⁶
@@AB-bp9fi I don't think that would work for most applications. When you want to make enemy AI easier or harder, you always have to think of it in relation to the player - for instance, in a stealth game, harder AI could mean it detects you faster - which pushes the player to improve and be more careful. That won't happen if you just made the enemies drunk (which is basically what would happen if you pick bad neural networks) - it just adds randomness which can be annoying to deal with. Maybe it could work better in things like racing games though.
@williambarnes5023 2 роки тому
I'm now imagining a game cloud coordinating through the internet. The AI uses background CPU while the game is running to simulate and evolve against itself, spits its best results against the player to see how they fare, and takes those results as more data to go back to the cloud with to keep working. The bots will start laughably bad at first, but they'll learn how players act, and make players devise new tactics... You might even get good teammate and wingman AI out of it if you put those AIs on the player's side.
@MrStealthWarrior 2 роки тому
@@commenturthegreat2915 What about training AI to match the certain level of intelligence? Like if AI detects a player too fast, then it failed the test.
@reaperbs7105 2 роки тому ⁺⁴
Props to Gen 300 and 400 for beings underdogs and yet surviving for so long
@noiky6164 2 роки тому
OMG This is so cool, your video actually change my attitude toward neural network from hate to love.
@thorbenpultke1350 3 роки тому ⁺²
Impressive Stuff! Had my hands on GAs too for my Bachelor Thesis but with a 6 DOF 3D acting robotic arm. Kinda addicting when you dive deep down in ML :)!
@s.m8766 2 роки тому ⁺²
very nice! I'd love to see the same tests, but with added random disturbances like wind gusts from the side, to see how well they can adapt to that!
@Lengthy_Lemon 2 роки тому
You are amazing. Thank you for sharing your fascinating work.
@Reverend-dd2lq 2 роки тому ⁺¹
Getting some strong Factorio vibes at 4:57
@ThePizzaGoblin 2 роки тому
I like how it learned to turn off its thrusters to arrest upward motion and to speed up descent.
@motbus3 Рік тому ⁺²
It would be great to have a remake of this one
@PezzzasWork Рік тому
I am actually working on a follow up :)
@motbus3 Рік тому
@@PezzzasWork noice! I will certainly watch it
@dromeosaur1031 3 роки тому ⁺¹
Thanks for the video! It's really inspiring.
@alessandrodamato5059 2 роки тому
give a consolation prize to generation 300!
It deserves it all
Have you ever tried using a neural network on a hardware platform?
@skoll6007 3 роки тому
1:58 that faint Vader "noooooo" put me on the floor for some reason
@darkfrei2 3 роки тому ⁺¹
Very nice! Please make more such content, with neural network and drones! :)
@flight_risk 2 роки тому
somewhat smaller models and policy gradient following might have increased convergence speed. MLPs are differentiable, so you could just backpropagate through them, sampling distance to the target at every frame and accumulating rewards over the trajectory for an unbiased estimate of a policy’s optimality. you could even use a decay term to incentivize the robots to move faster by downweighting rewards acquired later in the trajectory: distance to the target is ideally the same in the end, but according to the gradient of this reward function, faster would be better.
the only thing left would be running the simulations in parallel or faster than real-time by simply not fully rendering the state of the environment at every training step
@argmentum22 3 роки тому
Adding a fuel allowance would probably add a more varied result, possibly get those burn hard drones quicker. Also maybe increase your destination bubble a fraction ? This increase the prize rate and hopefully the drones would tighten up the homecoming naturally like the ants do for food routes
@jeremybertoncini6935 Рік тому ⁺³
Hello,
very interesting work !
Did you think about testing scenarios with obstacles ?
It would be also interesting to compare the last trajectories and controls with optimal control algorithms solutions.
Cheers.
@KiemPlant 3 роки тому ⁺¹
Other than giving us almost 20 seconds to read 6 words at 4:39 this was very enjoyable to watch :p
@kovacsattila8993 3 роки тому ⁺¹
I tryed the mouse controlled vesion what you uploaded on github. And i saw that it's easy to confuse the A.I. in that way to lose controll and fall off the map. I think if you crate a small Trainer A.I. for the target control what best interest to confuse the drone and make it fall off the map, it can train the drone to not fall off no matter how the target moves.
@PezzzasWork 3 роки тому ⁺³
Yes I did a more robust version that I can upload as well
@dromedda6810 2 роки тому
gen 400 is like that one kid in your class that cant stand still when waiting in a queue
@clairedcaptions 5 місяців тому
I’d love to see an algorithm where you simply add the direction from the current target point to the next, and see if it, with only that information learns to steer ahead of time.
@veggiet2009 3 роки тому
oooh idea. Space Invaders: Drones Addition. Different levels use different generations of drones as enemies.
@DeepRafterGaming 3 роки тому ⁺⁶
I suggest to add more then just time to the fitness equation. Fe. Energy use, pressicion, stability of flight and adding external forces like wind. with these factors the movement would become smooth like silk. But nice project anyway
@PezzzasWork 3 роки тому ⁺⁴
The current fitness evaluation takes speed, precision and stability into account. I tried to add wind after the training was done and it worked quite well :)
@DeepRafterGaming 3 роки тому ⁺²
@@PezzzasWorkahh I see, but the angled engines while hovering still seem very inefficient to me :)
@PezzzasWork 3 роки тому ⁺⁴
@@DeepRafterGaming Yes you're right and I don't really know why they do this. My assumption is that it is a way to reduce power, as if they couldn't go very close to 0 power so it is easier to add angle. This could be avoided by taking energy into account in the fitness function. If I increase gravity, they don't angle the thrusters to gain more power. Here is a windows demo with a config file if you want to try it out github.com/johnBuffer/AutoDrone/releases/tag/v1
@DeepRafterGaming 3 роки тому
@@PezzzasWork Yeah it's hard to tell why. The fitness function is the most complicated part of any neural network.
I would allway advocate for implementing energy use in any neural network because, if you think about it, if the network doesn't have to bother with the used energy it will always come up with unnecessary movement patterns that look jenky. It's more important than speed I'd say ^^
@jetison333 3 роки тому ⁺³
@@PezzzasWork if you watch the way generation 5500 flys sideways, it ends up with one thruster almost horizontal and the other almost vertical. They might like tilting the thrusters because its kind of an inbetween state between flying right and left. So when it gets a new target, it can start flying towards the target sooner. That might be part of the reason anyway.
@manuelpena3988 3 роки тому ⁺⁶
xDDD the "ok..." almost kills me
@Zygorg 3 роки тому
The memes are fun on this vid
@xandon24 3 роки тому
7:25 the music moves to your left and right ear as the drone in the top right moves it's power to it's left and right thruster.
@quinn840 2 роки тому
Pls make more vids like this I love them
@ferociousfeind8538 3 роки тому ⁺³
You could turn the target tracking into a game, try to get the drone to lose control as quickly as possible, using your mouse as the target! Or, just play with it. It looks fun.
@DogsRNice 2 роки тому ⁺⁴
Give the target to another network that tries to learn how to get the drones to crash while the drones learn how not to crash
@angelo.strand 2 роки тому ⁺¹
@@DogsRNice oh no the ai wars
@memento9979 3 роки тому
I like these projects !
@jayshukla6724 3 роки тому ⁺¹³
7:24 Loved how the Gen-400's legs synced with the music...
Btw, How do we decide the size of the hidden layers? Is there some rule or formula for the best size approximation?
@JavierAlbinarrate 2 роки тому
Beginning of the video: LOL!! those squeaks as they fall are really funny
End of the video: let's run to buy some food cans before they come for me!!!
@908animates 2 роки тому
Imagine spending hours and hours trying to get to something and then when you finally get there you just have to go to another one
@keltskiy 3 роки тому
This would be a great premise for a game where the character tracks the mouse so instead of controlling the character you're directing it and it gets better as you play through AI learning
@Algok17 3 роки тому
Very nice result!
@raphulali8937 3 роки тому ⁺³
i have no idea about how you did it ..but it seems like something fun to learn
@PezzzasWork 3 роки тому ⁺²
Machine learning is extremely fun and addictive :)
@00swinter21 3 роки тому ⁺¹
@@PezzzasWork can confirm
@artherius535 3 роки тому
400 was such a trooper
@EsbenEugen 2 роки тому
The target tracking would be cool for a background
@aiksi5605 2 роки тому
This video felt like it's 30 minutes because I somehow kept falling asleep every ten seconds or so.
And it's not boring and no I am not high, idk I guess I just got tired or something
@mytechpractice8924 2 роки тому
Totally amazing!!!
@Fallout3131 2 роки тому
That drone that got yeeted at 5:30 had me dieing 😂
@darkfrei2 3 роки тому ⁺⁵
Which parameters give the drone positive or negative feedback?
Is flying time a positive or a negative parameter? An acceleration to the target?
@bobingstern4448 3 роки тому ⁺²
im more impressed by the smoke, great project though!
@chinmayghule8272 3 роки тому
That was really cool.
@neut_ro Рік тому ⁺¹
I dont understand the sin and cos part in the inputs can someone explain?
@00swinter21 Рік тому
basically you take the angle the drone is currently at and get the sinus and cosinus of that angle
@mawa5702 3 роки тому
Love that video
@ziggyzoggin 2 роки тому
I'm kind of upset that you didn't publish the thing at the end on itch. Its so satisfying to see the drone follow your mouse and I want to play around with it. Great video!
@PezzzasWork 2 роки тому ⁺¹
You can download the control demo here github.com/johnBuffer/AutoDrone/releases/tag/v1
@ziggyzoggin 2 роки тому ⁺¹
@@PezzzasWork thank you! :)
@frodobolson213 2 роки тому
Wonderful!
@sded7126 3 роки тому
Dude the physics look so polished. This is amazing!
@UnitSe7en 3 роки тому ⁺¹
Acceleration (gravity, mass an inertia) is probably the simplest physics properties to program. Literally just adding or subtracting numbers. He does not require your compliments on the physics.
@cobaltxii 3 роки тому
@@UnitSe7en ?
@cobaltxii 3 роки тому
@@UnitSe7en shut the fuck up, he’s giving him a compliment
@Vinz_1223 3 роки тому
Now create an additional network which positions the orange dot (target) to navigate around obstacles on its own.
@crristox 3 роки тому ⁺¹
What about creating new variables? Like saving fuel or energy consumption, or giving priorities like speed over energy/fuel consumption
@JuanPabloLorenzo. 3 роки тому ⁺²
Great video! How long have you been training them? Greetings from Uruguay!
@SomeAutomaton 3 роки тому ⁺³
Ok, now make these drones fight in groups of 5, they can kill other drones in 2 ways one is to ram into enemy drones (killing both of them instantaneously), or shooting them with miniguns (only killing the target if it is hit X amount of times). But every time when they die they respawn, smarter, faster, more accurate, etc.
@SoulZeroTwo 2 роки тому
After a few tweaks, I have a feeling this could have real-world use.
@estebansanchezkanaan2567 3 роки тому
Amazing
@bluecrystal_7843 3 роки тому
if you had an body orientation/angle input they would have been able to recover from a spin out or even fly upsidedown
@notfirstTHERMAL 4 місяці тому
Great video! I've been trying to make a similar recreation of this project in Python but while I get some decent results, I'm struggling with local minima trapping and have failed to get the kind of 'brutal' drones you got at the end of training. Tried having a look at the source code but I'm not too familiar with C++. Just wanna know, what did you use for your fitness function and how did you mutate your networks? A reply would be very much appreciated!
@thetafritz9868 2 роки тому
the target tracking drone would be a really cool and distracting extension, it follows your cursor around where ever you put it lol
@kolterdyx 3 роки тому ⁺²
I love this! I'm gonna implement it right now in Python. What genetic algorithm were you using? I'm planning on using Neat
@CE-ov7of 3 роки тому ⁺¹
how did you get this environment in Python? I want to test policy gradient RL algorithms
@j_owatson 2 роки тому ⁺¹
@@CE-ov7of not sure if you still need this question answering however i'll give it my shot. My guess is hes implementing the basic algorithm of the envirment in python using pygame and and numpy. Then for the AI my second guess is he'll be using NEAT Python library or custom AI/NN algorithm for the agent and training. That's my guess however if you want any question just reply and i'll do my best to help. Python isn't my strongest language however but i'll try my best.
@CE-ov7of 2 роки тому ⁺¹
Hey @@j_owatson , unfortunately this is not something I have time/interest for anymore.
But I really appreciate your willingness to help! This is what makes the software/tech community great!
@sammyboy1112 3 роки тому
Very cool
@kahwigulum 3 роки тому
Gen 5500 appears to display knowing how to fall rather than turning the thrusters to push itself down.
@sulaimantriarjo8097 2 роки тому
how do you tune the weight and bias using GA,? do you intercept the backward process with GA?
@J3R3MI6 3 роки тому
Amazing 😮😮😮
@lightandsmoothcoffee 3 роки тому
Wowwww I'm amazed
@jakobheiter355 2 роки тому
You should make a game out of this, it looks very funny!!
@eyalsegal6730 2 роки тому ⁺¹
Nice work!
What mutation/crossover did you use?
@petersmythe6462 3 роки тому
Would be interesting to have a drone sumo where they can collide and try to shove each other out of a ring.
@abeltoth1878 Рік тому
Really cool project!!!
I was wondering what fitness function you used?
@aesvarash3256 2 роки тому
Can u make a tutorial how to choose the best inputs depend on sample ?
@MarkusBurrer 2 роки тому ⁺¹
you should place the targets randomly and not in a specific order. And for more challenge, they only have a specific time to reach the target. After the time the target disappears. And finally, the targets are fuel. If they miss too often they run out of fuel.
Edit: maybe even add obstacles.
@PezzzasWork 2 роки тому
In the video the targets are in a specific order to be able to benchmark the different generations, for the training I used random sequences
@MarkusBurrer 2 роки тому ⁺¹
@@PezzzasWork Ok, that makes sense
@ethos8863 2 роки тому
You may have to select more aggressively for speed. They seem a bit slower than what the optimal handmade algorithm could do
@aycoded7840 2 роки тому
This is cool.
@spoo77jj78 3 роки тому
"Im a Hovercraft like my Father before me and his before him!"
@baconofburger8784 3 роки тому ⁺⁴
why not add a fuel limitation (which would refill once they get to a point) forcing them to switch between points as quickly as possible from the beginning
@chfr 2 роки тому
wouldn't be necessary, they're already rewarded for speed
@UMMONARQUISTA 3 роки тому
Guys, how do I make a game or whatever these robots are in the video? What application? OpenGL, Unity, Unreal Engine... If someone can tell me.
@241lolololol 3 роки тому ⁺¹
man this is so cool. a bit off topic but how are you rendering the thruster particles and smoke?
@PezzzasWork 3 роки тому
The smoke is just made out of static sprites and the thruster particles are baked into the flame's texture
@sky_hawk0811 2 роки тому
What is being passed at cos Angle and sin Angle? the angle to the target or just the function sin and cos?
@00swinter21 Рік тому
the angle of the drone to the world
@rishiniranjan1746 3 роки тому
its really beautiful.... can you please suggest how do I learn all this. What I learn in what seuqence ??
@guillearnautamarit9102 2 роки тому ⁺¹
Wow that's amazing and looks amazing, how did you cross the two neural networks?
@cathsaigh2197 2 роки тому
Gen 2600 was a big leap in speed and control.
@markvarden3802 3 роки тому
I would love for you to make an eco system like the bibites using those drones
@linsproul3548 3 роки тому
you should make a game where you control a small ship like asteroids and your goal is to juke out the drones and cause them to crash or see how long you can survive before they hit you or something
@angelodeus8423 3 роки тому
it's cool to see your using dropout, so it learns better
@jenvetcar5319 3 роки тому ⁺¹
great Awesome!!👌👌😀. where did you learn to do this?
@ravenatorful 2 роки тому
While it was nice for the visual of all the different generations together, I feel like it would have been better to randomize the dot locations so that they have to learn to adapt to a new path every time
@Success_Unlimited_ 2 роки тому ⁺¹
Nice work! Can you propose me material so that I can understand in practice how to build a neural network? Something with examples.
@PezzzasWork 2 роки тому
That's a good tutorial idea, I will think about it :)

Наступне

Автоматичне відтворення