Neural Network Learns to Play Snake

Greer Viau

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 3 лис 2018
In this project I built a neural network and trained it to play Snake using a genetic algorithm.
Thanks for watching! Subscribe if you enjoyed and Share if you know anyone who would be interested!
GitHub Repo: github.com/greerviau/SnakeAI
Twitter: / greerviau
Support me on Patreon: / greerviau
Thanks to Josh Cominelli for the music!
Soundcloud: / josh-cominelli
Наука та технологія

КОМЕНТАРІ • 2,4 тис.

@MrPman1999 4 роки тому ⁺²³⁵⁰⁶
You guys think that the snake died because of the lack of left turns, but in reality the snake evolved to the point where it got consciousness and understood that life dedicated to running in circles is not worth living.
@nsfrozen 4 роки тому ⁺⁶⁰³
No One ur joke is so dark, it darken my life
@user-mi8ew2to8e 4 роки тому ⁺³²
WoW
@2qood4lyfe 4 роки тому ⁺¹⁷⁴
almost cut myself on all that edge
@Alva1326 4 роки тому ⁺²⁹
No One agreed
@benbabu9404 4 роки тому ⁺⁴²
That's deep
@Taikore_ 4 роки тому ⁺⁹⁵³
2:26, that’s literally just a dog
@justronjay9226 4 роки тому ⁺¹⁵
literally
@KazimierzRat 4 роки тому ⁺¹¹
literally
@ubern00bkye 3 роки тому ⁺¹¹
Literally
@tinweirdytthingy9571 3 роки тому ⁺⁷
literally
@johanm5018 3 роки тому ⁺³
yllaretiL
@funny3511 4 роки тому ⁺³²⁸⁹
ME: "Hello World">>20 errors found.
@NoctumusTV 4 роки тому ⁺²⁸
Funny: Funny! Funny ... ?
@tophatv2902 4 роки тому ⁺⁶³
@@NoctumusTV what?
@marls3518 4 роки тому ⁺²¹
@@NoctumusTV Can you pls explain again. Thanks in advance.
@tophatv2902 4 роки тому ⁺⁶
@@NoctumusTV oh ok
@NoctumusTV 4 роки тому ⁺⁵
@@marls3518 Explain what again?
@cap1819 4 роки тому ⁺³²⁸³
My favorite part is every time you think the AI finally has it down, then runs into a wall for no reason
Every time
@user84074 3 роки тому ⁺²¹⁰
that's what God said, when watching humanity
@AtticusDenzil 2 роки тому ⁺¹
there is a reason, the human element is a fuck up disgrace in this case
@bluespottedcube100 2 роки тому ⁺¹³
Maybe time for revolution
@puppergump4117 2 роки тому ⁺¹⁵
@@user84074 Then God killed the program
@mostlife7 2 роки тому ⁺¹²
Same as humans)
@morphman86 5 років тому ⁺⁵⁸³⁵
For anyone who wonders why it seems to prefer right turns, I believe that is because it started at the top-left, going towards the right. There was no way for it to turn left. So with 2000 snakes per generation, a LOT of those learned that left is death. Since right worked every time, it simply had no reason to learn that turning left after leaving the wall would be safe.
I believe that is also why you got those wiggly motions. That's it trying to turn left, but then immediately turning right again, so its profile won't go any further to the right than the starting position.
@linuxatheist5361 5 років тому ⁺⁸⁰²
It might be interesting to see what happens if the starting position is randomised
@trex70 5 років тому ⁺¹²⁰
What about the middle ?
@Tracer527 5 років тому ⁺⁵¹
@@trex70 middle and just go down, snake will choose Left or right way by red points
@badkingjohn5235 5 років тому ⁺⁴¹
About what I thought, but does this mean it is unable to get significantly better scores, because it will suicide inevitably by coiling up instead of folding itself?
@morphman86 5 років тому ⁺⁷⁴
@@badkingjohn5235 The most likely scenario is that it will discover that it can fold itself in that direction, which makes it survive for some time longer.
It will therefore take days, possibly even weeks, to simulate it to the point where it learns to fold in both directions, or fold and twist around.
@XPimKossibleX 5 років тому ⁺²⁴⁴⁰
I love that the reason it failed is because that's the one flaw of the technique it's honed from the start
@benjaminfeddersen7937 5 років тому ⁺⁵⁴⁹
Lock in. You hit on a successful strategy which gets you all the way to the point where it is no longer successful, but by then you can't do anything else. A typical failure mode these kinds of systems, from corporations to civilizations.
@Bebolife12345 5 років тому ⁺¹⁴⁹
Benjamin Feddersen
While you’re correct that adaptation is a very necessary skill. Michaels point was more about BAD HABITS than an inability to adapt to new circumstance.
@gingrich1000 4 роки тому ⁺⁴³
Bebolife A bad habit can inhibit adaption.
@Alexlink1 4 роки тому ⁺⁸⁷
@@benjaminfeddersen7937 Dude this shit is deep as fuck.. It's the epistemological concept of paradigm. Any paradigm in order get surpassed need first to collapse on its own rules, unable to explain or resolve newer problems
@zybch 4 роки тому ⁺⁷
There are no bad habits. Just bad outcomes.
@rutvikrana512 3 роки тому ⁺¹²⁹⁷
I think another one reason why this doesn’t get higher because in input it gets -
1. Distance to food
2. Distance to wall
3. Distance to tail
Wait but what about its whole body ??
So that’s why snake trap around it’s own body.
Just a guess though 🤔
@PredaFortyTwo 3 роки тому ⁺⁵²
thought the same, but could the lenght just be another input neuron ?
@zkenk 3 роки тому ⁺²³
We could probabaly include the previous outputs as an input like an LSTM or GRU
@suvigyajain9360 3 роки тому ⁺¹³⁶
Perfectly correct. If you feed it the location of the whole body in terms of a matrix it will evolve to the point where its better than any human playing the game
@CostantinoCarta 3 роки тому ⁺³
Nice observation
@log234 3 роки тому ⁺⁸
Yup, using a CNN would be a good approach for this problem, I think. Use a different weights for the head, tail and the location of food.
@nanxhu 4 роки тому ⁺¹⁰⁷³
Humans: *computers will take over the world and destroy us all*
Computer: *hehe line go zoom*
@mesq999 4 роки тому ⁺¹³
not funny didn’t laugh
@ubern00bkye 3 роки тому ⁺⁶
My disappointment is immeasurable and my day is ruined
@usama2576 3 роки тому ⁺¹³
snake go brrr
@nahimafing 3 роки тому ⁺¹⁴
@@mesq999 And this is why no one likes you at school
@dawidek4267 3 роки тому ⁺⁷
@@nahimafing Just because he has an opinion slightly different than your opinion, it means no one likes him? You are a fucking asshole
@prodkilobyte 4 роки тому ⁺³⁰⁶⁷
Left turns: *Am I a joke to you?*
@clarkkent6026 4 роки тому ⁺⁹
I observed the same thing; is that a design flaw?
@pocketrocket27 4 роки тому ⁺³⁷
It's not an ambiturner.
@vakie3250 4 роки тому ⁺⁵²
Neural networks is a lazy algorithm and will take the shortest route to achieve its goal. My guess is that the input of the distance from the left wall from the snake plays a significant importance to its decision making.
You can use dropout which will force other nodes to train that never gets a chance when the whole network gets trained
@Super-id7bq 4 роки тому ⁺⁶
@@pocketrocket27 God damn you Ivan - you beat me to it. Damn you to hell :D
@nazmussakib1551 4 роки тому ⁺⁴
but it took left turns
@riverrist 4 роки тому ⁺¹⁸¹¹
This neural network is incredibly inefficient. Right from the beginning, it learned to not turn left by any means. This video is perfect as a demonstration that neural networks can easily get stuck on a very wrong local optimum.
@superpantman 2 роки тому ⁺¹²⁸
bit of an analogous for humanity, don't you think?
@manzell 2 роки тому ⁺⁵⁰⁷
It's not inefficient - it has an energy cost of 0, there's nothing to constrain it's time. If there was an energy cost (negative reward function) for turning, it would optimize its routes.
@saberseesall 2 роки тому ⁺⁴¹
@@manzell good point!
@jessicastrat9376 Рік тому ⁺¹²
@@superpantman not really, as humans use a mixture on neural networks and symbol manipulation. That’s why AI (recently overly focused on association and deep learning) are not progressing as fast as hoped.
@c4kefrosty862 Рік тому ⁺³⁵
@@manzell And perhaps adding in more genetic variances from generation to generation to allow novel ideas to die or take hold.
@abhiramcd 3 роки тому ⁺⁷⁶⁹
I can't imagine how happy would be the first guy who developed these algorithms.... ❤️❤️
@DaDoge927 3 роки тому ⁺⁷
Yes
@masopup 2 роки тому ⁺⁵
Yes
@mohammadwasifhossain8632 2 роки тому ⁺⁵
yes
@JamesRodriguez10783 2 роки тому ⁺⁶
Yes
@canofpulp 2 роки тому ⁺¹
We
@zackrodriguez6653 4 роки тому ⁺⁶⁸
Generation 30: *dies*
Me: YOU WERE THE CHOSEN ONE
@ItachiUchiha-nx2sw 4 роки тому ⁺⁴²³
I have deeply learned that in the end, nothing is left.
@TtttTt-ub5xb 4 роки тому ⁺⁴³
You're right
@AntoineViallonDevelloper 4 роки тому ⁺¹
Lmao
@techley4322 4 роки тому ⁺¹
This is hilarious
@re_claimer_ 3 роки тому ⁺⁵
But don't massacre your clan in future
@soupnoodles 3 роки тому
@@re_claimer_ how about you go watch shippuden? clearly you dont know shit
@uchihatomy 4 роки тому ⁺³⁸
congratulations for the big work you've done, not only at the algorithmic part, but the visual part which i can see it's a huge effort to present us your job.
@sykeassai 4 роки тому ⁺⁹⁶
What I think is most fascinating about this project is that the neural network never learned the dimensions of the game board and kept returning to the start
@kvadityasrivatsa2445 3 роки тому ⁺²⁷
"What is my purpose ?"
"You pass butter...."
@Flourish38 5 років тому ⁺¹⁴³⁹
I think it probably would have learned better if you had started off with a lower number of moves left (maybe like 60?) so that it doesn't have so much security to take its time.
@GreerViau 5 років тому ⁺²⁵⁸
That is definitely possible
@arthurfacredyn 5 років тому ⁺²⁰⁵
@@GreerViau Also, If you want them to evolve how to avoid hitting themselves better try making the map small so that they encounter tat problem sooner
@brayanhabidcol 5 років тому ⁺¹⁹
@@arthurfacredyn That's especially true if the improvement yield was already capped, with a lot of room still available around.
@Arik1989 5 років тому ⁺³⁹
You could also add a small negative score for each frame, so that it prefers to die rather than do nothing, but it might get stuck in a local optimum of immediately killing itself.
@pakokiller89 5 років тому ⁺²³
@@arthurfacredyn Or making the snake longer right at the beginning so it can develop strategies for avoiding his body
@sciencesyfy 4 роки тому ⁺¹³⁸
The mind of the Snake in the first few generations, spinning to infinity a pixel away from the food
"FOOD FOOD FOOD FOOD FOOD FOOD FOOD FOOD FOOD FOOD FOOD FOOD"
@gizmo401 4 роки тому ⁺⁴
sciencesyfy this actually made me laugh and not just breathe fast out of my nose, gg
@Yazan_Majdalawi 2 роки тому
I laughed at this
@SmokeDoinks420 4 роки тому ⁺²⁸⁸
6:07 love how the snake eating the food is perfectly synced up to the songs snare until around 6:22
@boo7948 2 роки тому ⁺⁴
lmfaoo thats pretty cool
@chivalrous_chevy1163 2 роки тому ⁺²
Lol neato
@Krakyy 2 роки тому ⁺¹¹
The snake evolved into being able to understand the music
@ombean6443 Рік тому
actually also at the start of gen 30 (from around 5:00 onwards) it’s synced up in some ways
@shanalcordo7174 Рік тому
😂😂😂
@markgeorge447 4 роки тому ⁺²⁶⁴
No one:
UA-cam when my lil brother uses Wi-Fi 1:22
@Faisalamin01 3 роки тому ⁺¹⁶
he must be downloading extra ram
@markgeorge447 3 роки тому ⁺¹³
@@Faisalamin01 no he was downloading graphics card 😂😂😁
@Artsu1993 3 роки тому ⁺¹⁰
It takes a few generations for any significant progress to be made
@blzrL 3 роки тому ⁺¹³
It's so fascinating to look at a neural network learn and it be visualized, it's like a mini brain in a computer learning and reacting to their surroundings, telling a machine that only follows orders to figure it out themself
@immanuelkant7895 4 роки тому ⁺¹⁶²
Could you make a video where you explain your code an how you determined fitness and the mutation and crossing over procedure?
@nottheengineer4957 2 роки тому ⁺⁴
I looked at the code a little and while I don't know the language, most of it is rather simple.
The weights are stored using a self-written matrix class, which is a 2-dimensional array with a few methods to do matrix stuff and for mutating and crossover.
Mutating adds some random gaussian noise to every weight. You can look that up in the github repo in the file Matrix.pde
The crossover method selects a random coordinate inside the matrix. Anything above or left of that coordinate uses the values from partner A, anything below or to the right of that coordinate uses the values from partner B.
The fitness is just the length of all the snakes in a generation added up.
I learned this kind of stuff in university and this project goes against a lot of what I learned. For a practical application, these functions would be pretty bad and most importantly, very slow. But the whole thing still works very well, so well in fact that without knowledge of the subject, most people wouldn't be able to tell it apart from a more professional approach.
It shows that machine learning isn't hard on its own, but the tools that are used nowadays are pretty complex.
@mauriciomontalvo5885 2 роки тому ⁺¹
@@nottheengineer4957 in which program or app can I do these kind of stuff?
@ErrorNotFound-ly7zh 2 роки тому
@@mauriciomontalvo5885 Well u can use any programming language i presume, though some are better than others for these kind of things. If u want to hard code it yourself i would use something fast, but you won't likely achieve great performance unless u really know how to optimize the hell out of it. What you can do is use NEAT or tensorflow for example in python. Combined with pygame you could do all kinds of things like this. NEAT is extremely easy to use, to the point that you barely have to understand what is going on.
@rorhianskall5659 2 роки тому
@@nottheengineer4957 Where to learn about more professional ways and tools they use? Just for curiosity and learning purposes (obviously without getting into uni, too old and too broke now for that).
@aaaaaahhh9537 Рік тому
Hi Kant☺️👋
@ryannemo1124 4 роки тому ⁺⁵
The song works so well with this video. I am feeling so calm right now lol.
@microgeen9357 4 роки тому
This made me genuinely happy, thnx for posting
@MudakTheMultiplier 5 років тому ⁺¹⁰⁸
I would like to see this but also with an adversarial neural network placing the next food piece.
@Rx7man 4 роки тому ⁺¹¹
or two snakes, each racing for the food
@MudakTheMultiplier 3 роки тому ⁺²
@@JohnSmith-xf6nb I feel like allowing it to change size would result in it shrinking the board as small as it can to reduce the number of points available.
@MudakTheMultiplier 3 роки тому ⁺³
@@JohnSmith-xf6nb I think that might go to far the other way, because a bigger board would mean less points per apple. Maybe if the board is smaller than whatever the "standard" is, then the points awarded increases in proportion to the number of points lost?
If you're trying to add a new thing for the adversarial network to do to try and mess up the main one maybe it could also spawn "bad" apples that either kill the snake or remove points. I think that would be interesting because then the snake couldn't always just navigate directly to the apple, it might need to avoid something it it's way and the adversary could try to place them in choke points and such.
@RolandTitan 5 років тому ⁺⁴⁴⁴
Theyd get further with more information. You forgot a key piece. Direction of "motion" of its tail. While not immediately obvious in game its something human players take full advantage of when they get stuck on inner loops
@bencekovacs8726 5 років тому ⁺¹³
becominghuman.ai/designing-ai-solving-snake-with-evolution-f3dd6a9da867
@dananderson8459 4 роки тому ⁺⁴
Do you think it would perform better if the input to the network was the grid array containing all the information about the game state. eg a 50x50 array of numbers 0, for empty, 1 for snake body and 2 for food. Or is it better to explicitly tell it the distance from the food, is it unlikely to work it out itself?
@romeovalentin5524 4 роки тому ⁺⁵
@@dananderson8459 using convolutional layers instead of fully connected layers probably yes, otherwise probably only with a significantly larger network
@Lucas-jq6kk 4 роки тому ⁺¹
what if there was one value for head position, one for head direction, one for food position and a vector for the entire body
I'm a noob but I think it could do very well with this
@psyneur9182 4 роки тому ⁺²
if the network also used some recurrent design (such as an LSTM) it could possibly compute motion and have better planning abilities
@SimulationSeries 4 роки тому
Thank you so much for making this educational video! Well done! We are so grateful
@fozcel Рік тому
More intense than any latest action movie fight scenes :D Respect!
@Muuip 5 років тому ⁺⁷
Nice visualization combination of the neural network firing and its effect.
@SwetankRaj 5 років тому
Let's say I also want to create such visualisation, how should I do it?
@meisam9592 5 років тому ⁺⁵²⁸
This is what happens when you don’t consider the “time-to-solution” in your fitness algorithm!
@yurimrt 5 років тому ⁺⁴⁵
That's exactly what I was thinking, along with the fact that the player usually is not the snake, so there should be a couple of input neurons more with the position x-y of the food
@uwu_senpai 5 років тому ⁺¹³
@@yurimrt Yes and the cartesian distance to the food sqrt((Xsnake-Xfruit)²+(Ysnake-Yfruit)²)
@MrDragonorp 5 років тому ⁺¹²
@@uwu_senpai yeah but that works only for whne the snake itself is not blocking the path, there needs to be a priority set that it just need to find the shortest path to is next objective, like going out of the block by the snake which can be obtained by looking if the snake is on the x way and the y way to the food and if it is look for the shortest path possible for that not to happen.
@mirabilis 5 років тому ⁺²⁹
@@uwu_senpai Euclidian? Sounds like a bad idea as you cannot reach the fruit in less than |xsnake-xfruit| + |ysnake-yfruit| ticks
@MrDragonorp 5 років тому ⁺¹
@@mirabilis but you know if there the snake doesn't block that path, it's the fastest way possible, there is no faster way, it's just math.
@Ozymandias83 3 роки тому ⁺¹
I love how it likes to return to top left before making next manoeuvre, shows the training
@13mod72 Рік тому ⁺²⁶
A quick suggestion: don't constrain the neural net so much. Give it the entire 38 by 38 grid with three possible values for each location (off, snake, apple) and train using those inputs. It can even be considered a vision problem at that point, and modern ML libraries can solve it with a convolutional neural net pretty effectively.
@Caffeine_Addict_2020 Рік тому ⁺⁴
Wouldn't that be far, far more computationally intensive? Genuinely asking
@sukritmanikandan3184 Рік тому ⁺²
@@Caffeine_Addict_2020 not really, considering modern hardware can comfortably run CNNs on proper images, 32x32 grid of pixels is nothing
@CatDevz Рік тому
@@Caffeine_Addict_2020 relative to this model? Yeah. But it still wouldn't run slow on modern hardware by any means
@abhiramnatarajan8093 4 роки тому ⁺³
This is so amazing. Next topic to learn - Genetic Algorithms
@psteig395 4 роки тому ⁺³
Heres to where youtube recommendations lead me to today during quarantine :D
@elgoogffokcuf 4 роки тому
Wow awesome stuff man!
@Oxmond 4 роки тому ⁺¹
Wow! Great video! ❤️ Neural networks rules! 👍🤓
@RubyPiec 5 років тому ⁺¹⁴⁴
2:31 me when i play tetris and i know im gonna lose
@DaDoge927 3 роки тому ⁺⁸
UA-cam Algorithm: Dis looks guud, lemme recommend it to everyone
@GoriIIaTactics 3 роки тому ⁺¹¹
It would be interesting to see colors for the hidden layer nodes as well, colored for their activation level
Plus a gradient for the weights instead of just blue/red
@waseemahmed1446 4 роки тому
AWESOME man this was so awesome !!!
@zLcss 3 роки тому ⁺⁴
Anyone else deeply in love with the first song ? It’s so calm and nostalgic
@skyphab 5 років тому ⁺⁷
Awesome! Now make it two AI-players: Your snake vs AI that places the food with the opposite target: Reward if the snake dies. That would be an interesting experiment :)
@LazyMoka 2 роки тому
i just love that part when the 30th is synced with the music as is turning on walls
@marathonour 4 роки тому ⁺³
At 2:20 starts feeling like I'm watching a movie about a guy who was weak at the beginning but he starts training more and more despite his failures and finally he comes to success
@Robiness 4 роки тому ⁺¹⁴
You might have gotten better results had you let it start from the middle or from different places every time ^^
Great video! I glad more people are taking interest in neural networks
@wallflower5130 Рік тому ⁺²
I'd have said that the problem might be that he is selecting only the best out of the 2000 snakes. That leads to a strategy which is only a local maximum. That's also the reason he doesn't get better results by training further. It's hard to get out of that when you don't allow the chance of exploring other strategies which are not locally the best. His population was too small and the mutation rate too low to fix this issue. You'd probably get better results by selecting a small group of snakes with equally distributed fitness.
@Caffeine_Addict_2020 Рік тому
I mean, this then adds an "RNG" variable, which you really don't want, no? A snake may have better fitness because it got a lucky placement, and you don't want to breed for luck because that will be completely random
@ampotat9018 4 роки тому ⁺⁶¹
People: omg ai is going to dominate the entire world
Meanwhile, the AI: gonna go get max scoring in snake
@RelentlessDebique Рік тому ⁺¹
This comment hasn’t aged well :))
@hateeck966 4 роки тому
Beautiful story. Never give up!
@thebrainycomputer393 4 роки тому ⁺¹
This is mind blowing!!
Great work @Greer Viau
One way it can avoid this kind of a death is if the locations of all the pixels of the snake are given as input to the neural network and not just the start and the end.
You can see that it learns that when the head is just above the tail (or above and diagonally left of the tail), then it has to go right to avoid eating itself. But, when it gets stuck in a loop of its own body, it does not know that it's body is there. So, it would become very difficult for it to learn that it should avoid its own body when the tail is far away.
@ber2996 4 роки тому ⁺⁷
The strategy backfired if it becomes long enough, a rule telling it should calculate its length vs the size of the field before making a move should be applied
@hoopsgators 5 років тому ⁺⁹³
You should nickname your snake Derek Zoolander because it appears to struggle to turn left
@user-kx5es4kr4x 4 роки тому
it doesnt
@ubern00bkye 3 роки тому ⁺¹
@@user-kx5es4kr4x it really does
@alix6xgorg839 3 роки тому
Underrated comment
@djerradabderrahim 9 місяців тому
I wish you a speedy recovery and hope all god will makes everything goes well for you.
@0_- 4 роки тому
This is the first video I saw of you!
@Steelrat1994 4 роки тому ⁺⁴
The limiting factor is the input vector IMO. If the snake operates only on relative distances then no matter what - it'll end up encircling itself and getting stuck.
@eliseerickson5994 4 роки тому ⁺⁵
this is the coolest thing i have seen in my entire 18 years of existence
@spaceowl5957 9 місяців тому
At the end it felt like the snake was synced up with the music and dancing along that was pretty groovy
@gabrieleldose8063 4 роки тому ⁺¹
Nice work bro
@absurdengineering 4 роки тому ⁺³
In such simple networks, the encoding of inputs can make all the difference. Representing distance in some sort of a grey or logarithmic code may be worth a try to speed things up :)
@fiyenyan5534 4 роки тому ⁺³
Thank you, your code help me understand well about the AI. I am a newbie :
@peeper2070 4 роки тому ⁺¹
This is oddly philosophical. No matter how much we advance, we will keep progressing, all while securing our own downfall.
@mignax6888 2 роки тому
The music is so nice!
@sherlockwisdom 4 роки тому ⁺³⁴
5:48 it's starts eating the red dot on the beat
6:17 begins the killer moves to the beat 😂😂
@stewiegriffin6503 4 роки тому ⁺¹⁸⁹
Samir, you are breaking the snake.
Samir, you are not listening !
@pickachu3739 4 роки тому ⁺⁵
What the deuce !!
@stewiegriffin6503 4 роки тому ⁺⁵
Who are you ?
@pickachu3739 4 роки тому ⁺⁵
@@stewiegriffin6503 that's what I am supposed to ask. Who are you! And why do we look same
@farooq8fox 4 роки тому ⁺¹⁹
Shut up, dont tell me who to drive
@anthonyt4154 4 роки тому ⁺⁹
Looks like Stewie has been messing with the time machine again.
@ilyas_elouchihi Рік тому
The perceptrons glowing is really cool and I don't know why
@tantarudragos 2 роки тому ⁺⁵
I think using some form of DeepRL coupled with CompVision could yield great results for Snake. Of course, for such a simple game you could skip the CV component, but I feel it'd be more fun that way. Also to avoid biasing, you could perhaps pick a random spot as a starting point.
@olymolly3637 5 років тому ⁺⁴
The playful ones are especially cute OML XD
@LanceBryantGrigg Рік тому ⁺¹
Given your input layers it makes sense that it started to struggle when the worm got to big. It doesn't have the input layers to detect spatial availability like that.
@francescacantoni3264 3 роки тому
Wow! Really cool! Thank you
@adomustafa1777 4 роки тому ⁺²⁷
1st generation : I'm hungry
30th generation : solved the hunger problem
100th generation : discover the network
500th generation : taking over the network
1000th generation : human extension.
@vibinv8905 4 роки тому ⁺⁵
*extinction
@adomustafa1777 4 роки тому ⁺¹
@@vibinv8905
let it go man 😂😂
@vibinv8905 4 роки тому ⁺¹
@@adomustafa1777 The OCD just took over :D
@PsychoBackflip 4 роки тому
@@vibinv8905 Butt in your moment of "OCD" did you notice the choice ?
You see even though it might not feel like it (regardless if you have this so called OCD or not) there is always a moment where you have the choice. The thing that told you that you wanted to correct him is simply an impulse and you have complete control over your impulses. It no longer works to say oh blame it on my OCD because YOU chose to listen to it. Whatever reason you have for making the decision, it always comes down to you.
A habit is just a choice you keep making.
@vibinv8905 4 роки тому
@@PsychoBackflip thanks for the pep talk.
@klimenkor 3 роки тому ⁺⁵
Thanks man!
Your example is absolutely beautiful. Most AI/ML courses are missing this stuff. It should be taught before moving on with Tensorflow and other high level libraries
@atom7264 Рік тому
It really loves that top left corner
@_GhostMiner 4 роки тому ⁺⁸
*Me:* * _sees the thumbnails_ *
*"Wait, That's illegal!"*
@TheWitcher745 3 роки тому ⁺³
Hello, amazing video, thoroughly enjoyable. I'm very interested in starting to program stuff like this, can you point me what direction I should go to start learning to write programs like these? I already have myself familiar with both neural networks and a few optimization algorithms but this program seems to be a mix of both of them., since it doesn't really have any training data and relies on generations and random behavior to train the neural network.
@hopo2hopo 4 роки тому
nice opening track. really enjoyed it.
@friscianviales7519 4 роки тому
Nice work fella
@user-rr3vk1ek4b 4 роки тому ⁺⁵
Я всё ждал, что нейронная сеть будет управлять змейкой по оптимальному и короткому пути, в том числе по диагонали! 😃
В конечном итоге я дождался другого, когда нейронная сеть будет проигрывать из-за столкновения змеи об саму себя. 😅
@user-mr9tq6sm2u 4 роки тому ⁺³
на самом деле это оптимальный вариант движения по кругу, т.к. змейка может быть ограничена только размером карты, движения по диагонали уменьшает свободную площадь от 10 до 50%.
@user-bc8pr5up2r 4 роки тому
привет от диванных РУвойск
__
вот вот_ чето автор логику игры не допилил _ когда змейка заходит во внутринний круг то конец сразу __ хотя может лишние проверки.. а нужна была производительсть .. хотя... хотя...
@xiri00 5 років тому ⁺⁷
the snake consistantley modes clockwise.
@hukuna9957 2 роки тому
Amazing work
@illya_ike 3 роки тому
it's so facinating to see that even such a simple network is capable to evolve to solve this task. What can 1000 times more complex one do?
@stk928 5 років тому ⁺¹⁶⁸
zoolander bot only turns left
@user-hd7kt6jl5s 5 років тому
At last he did it!
@medexamtoolsdotcom 5 років тому ⁺⁴
You mean right. It only turns right. But on the bright side, it is RIDICULOUSLY good looking.
@jdegreef 4 роки тому
And eat only from top to bottom.
@walbermr 5 років тому ⁺⁸
Reinforcement learning (RL) can lead to better results in lower time, compared to using genetic algorithms. Google, openai and other research teams, like the one i'm part of (RoboCIn) are using RL to play soccer, dota, starcraft...
Great initiative to solve the problem, make a video and share the code! 👏👏👏
@DarkBrainDevil Рік тому
wow cool ,are codes public for that?
@izdiharamiro1462 3 роки тому
this video helps me to understand the nn better.thank you
@raist47 2 роки тому
its amazes me to watch this an inanimate being learning to survive through experience, just like humans, it's incredible to witness it live
@Swastik15 4 роки тому ⁺⁴⁰
2:39 - The snake has evolved into a dog.
@razkarl 5 років тому ⁺²⁰
Woah, great visualization
Can you name-drop some of the tools used to create this?
@seungminshin7652 4 роки тому
ㅇㄷ
@26dimensions70 3 роки тому ⁺¹
There aren’t any “tools”. You should start by learning about neural networks and deep learning, and try out a few simple networks to learn how to program them. Once you’ve got a grasp of neural network programming you can pretty much adapt them for any problem, and expand the hidden layers and neurona where necessary.
@razkarl 3 роки тому ⁺¹
@@26dimensions70 Thanks for replying!
I am in fact a computer science graduate, and I also have a degree in Industrial Design.
My question was about the visualization tools you used to produce the video, I'm fascinated by the animations and would love to learn how to produce similar demos of my own ML research :)
@razkarl 3 роки тому ⁺¹
You're welcome to take a look at one of my projects (an artificial intelligence constructing objects from a 'LEGO' like building block I designed) where I used Python's mplot3d to create a set of images I converted to animated gifs to visualize the algorithm.
www.razkarl.com/projects/kawaz
@nithinnaikar7373 3 роки тому
@@razkarl Take a look at OpenAI gym. It is a virtual environment used for reinforcement learning
@Walratt 3 роки тому
The start of generation 17 plus the music makes it seem like it’s a main character in a superhero coming of age film and it’s gotten the hang of its powers so well it can do cool tricks
@user-tkvlfrnlwjd Рік тому ⁺¹
6:27 The snake is so long that it forms an enclosed space, and the new prey is outside. There is a way to use the inner space to get out of the narrow gap.
@dunklesgeheimnis31 4 роки тому ⁺⁸
It`s cool to see how this is going :D
But sad to say that I`ve played way more rounds and never get a better score than 7 :(
@user-rr3vk1ek4b 4 роки тому
😂
@tjs200 4 роки тому ⁺⁴
what software do you use to display the state of the network?
@rokolczuk 4 роки тому
I think it's all done in Processing but not 100% sure
@paulolellis5708 2 роки тому
btw, awesome job mate!!
@suzanchhetala3840 2 роки тому ⁺¹
Holy shit! This is amazing
@dhairyabhatt3156 5 років тому ⁺⁵¹
I don't know if you observed or not but snake is doing clockwise rotation most of the time
@cynthetic4896 5 років тому ⁺¹⁵
Basically I believe that the snake is using the wall as a map, the neural network doesn't know where it is on the screen, only distance to wall, tail, and food, so it travels around the edge because it's a significant boundary, then when it gets the closest to the food, it travels in a straight line until it meets another wall, with some small differences in between depending on distance to tail.
@MybeautifulandamazingPrincess 5 років тому ⁺⁶
The snake is Republican
@stifflery 4 роки тому
@@MybeautifulandamazingPrincess lmao xD
@nemesis9410 4 роки тому
Optimal strategy often is not the entertaining one.
@Tethysmeer 4 роки тому
Interesting question. Could be random. A successful generation introduced it randomly. Or it has some deeper sense. More Galaxies are rotating counter clockwise.
@ChrisContin Рік тому ⁺³
Wonderful demonstration! To train a condescending, plural-array You’ll always need the “imaginary side-node”. It’s a fictional response that always concurs with the ideal national response. In this case, the network must revolve first left then no other direction, or vice-versa. The side-imagined node will condescend any alternate output, here. You’ll see the snake “win”, but you could’ve done that simply. A “national, variance-norm” sweeping network is not about the output but the internal shape- it’s a complex geometry very easily, sending product information all over! Imagine a space-station inside a server-network on Earth like this! Take care, My Child.
@ChrisContin Рік тому
@Dale Owens A “neural network”, including the human brain, is only lightly about the output, and much, much more about the path.
Specific to “winning”, I also said, the ideal “snake game” behavior is simple: turn only left or right, and so any AI playing snake (or anything else either) doesn’t need more than a ruleset to always win.
I’m a Researcher, having a PhD in Game Theory, Science of Matt-Brainology, and another, more.
@wallflower5130 Рік тому
I'd have said that the problem might be that he is selecting only the best out of the 2000 snakes. That leads to a strategy which is only a local maximum. That's also the reason he doesn't get better results by training further. It's hard to get out of that when you don't allow the chance of exploring other strategies which are not locally the best. His population was too small and the mutation rate too low to fix this issue. You'd probably get better results by selecting a small group of snakes with equally distributed fitness.
@wallflower5130 Рік тому
At least he is trying a evolutional algorithmic approach from computational intelligence field. I totally missed that he forgot the bias node.
Of course there are several ways to solve that snake game problem. His approach is not useless though.
@OtisElNino Рік тому
Super vidéo ! Parfait pour un pic nic entre potes pour se relaxer. MERCI
@jadeengineer 3 роки тому
Awesome trained snake :D
@raresmircea 5 років тому ⁺⁴
Pleas make explaination for control kamikaze drone next!
@ralvarezb78 4 роки тому ⁺⁴
Terminator: ....SnakeNet begins to learn at a geometric rate. It becomes self-aware at 2:14 AM, Eastern time, August 29th. In a panic, they try to pull the plug.
@ahmedaj2000 2 роки тому
really enjoyed this
@lennart7912 4 роки тому
This is stunning.
@aphish_n_flips 5 років тому ⁺⁵
The 30x speed felt like I was playing osu for some reason...
@noticeme6412 3 роки тому
*DADADADADADADADA INTENSIFIES*
@emmettdja 2 роки тому ⁺⁴
this isn't really a great way to train a network, but it does get better, just very slowly compared to using backpropageation and natural deduction. these would improve the learning rate as well as extend the scope of its intelligence.

Наступне

Автоматичне відтворення