How AI Learned to Reason

Art of the Problem

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 21 лис 2024

КОМЕНТАРІ • 169

@ArtOfTheProblem 10 днів тому ⁺²⁰
Thanks for watching, would love your thoughts below.
NO MUSIC version: ua-cam.com/video/DFDOyMZw9Q4/v-deo.html
Watch Full AI Series: ua-cam.com/video/YulgDAaHBKw/v-deo.html
Sponsored by Brilliant | Use brilliant.org/artoftheproblem for 30-day free trial and 20% discount
@8enos 8 днів тому ⁺¹
Thank you for the NO MUSIC version!
@heardistance 5 днів тому ⁺¹
Love your videos! Just a little suggestion. Background music is good, but too loud. It should never cover your speaking, like now. I suggest 15 - 20% less music volume, and you are good!
@victormuchina4865 9 днів тому ⁺³³
This Guy just explained all the core concepts in AI on one shot ,Congrats man!
@ArtOfTheProblem 9 днів тому ⁺³
:) thank you, i cut a LOT out of the video in my edit - going to post a shorter bigger summary soon
@user-hl2yj8kp2s 9 днів тому ⁺⁸
I love this video. I remember watching your videos like 10 years ago on Khan Academy about compression, entropy, Claude Shannon, etc. All timeless. I have always loved this style of documentaries. We need to protect you at all costs.
@ArtOfTheProblem 8 днів тому ⁺²
Thank you, I love hearing from og’s! support future work: www.patreon.com/c/artoftheproblem
@HayashiManabu 9 днів тому ⁺¹⁴
I love your video aesthetics, how you blend retro video clips with your explanations. I think you'd really enjoy retro-futuristic concepts and games like Bioshock and Fallout.
@ArtOfTheProblem 8 днів тому
Love this , I definitely know the style you are talking about
@KainniaK 8 днів тому ⁺⁸
Albert Einstein said: "If you can't explain something in a simple way so anybody can understand it you don't fully understand it yourself". Perhaps you are one of the few LLM experts we have!
@ArtOfTheProblem 7 днів тому ⁺¹
THANK you this means a lot to me.
@KainniaK 7 днів тому ⁺³
@@ArtOfTheProblem I did but reddit really hates it, it got removed on 4 subs. The internet does not like to get educated anymore man.
@KieranGarland 4 дні тому ⁺²
is this the last video in the series? regardless, can't tell you how valuable and enjoyable i've found them all. thank you for them.
@shawnbibby 8 днів тому ⁺⁵
anothr great video. Understanding the "world model" and the algorithm that makes the decisions in it was very expansive. Also adding the self training/emulation of dreams is a powerful analogy to the human.
seeing how thinking longer, blended in with intuition to make better chains of thoughts is also fantastic. Every time I reflect on machine learning, I learn more about myself. Which kind of makes you think its more sentient if it reminds me of myself? or the best emulator ever!
@ArtOfTheProblem 8 днів тому
thank you! ai agree....also you are my "top commentor" according to YT. :)
@Sawaedo 9 днів тому ⁺⁶
It is a great explanation of how current AI models reason. I liked the video a lot!
1. Simulation of future states.
2. LLMs that can give kind-of accurate answers with step by step reasoning.
3. RL approach that makes LLMs to give multiple answers, then evaluate them to select the best one. (Required more time)
It would be nice to see wether a model that wasn't trained on the internet data, could learn how to reason by interacting with an LLM, and practicing on its dreams, but maybe we'll see that in the future.
For the awesome review, history explanation and divulgation:
Thanks! 🎉
@ArtOfTheProblem 9 днів тому
thanks for sharing summary
@devbites77 7 днів тому ⁺²
Great vid. I love that it clearly explains the progression, like the pieces coming together. Can't wait to see the next steps!
@ArtOfTheProblem 7 днів тому
thanks, next up i'm taking a detour into economics
@nowweknow. 5 днів тому ⁺²
So good! Loved it
@dogukartal 7 днів тому ⁺⁹
Answering the question of "Does it think actually?" is as hard as the question "Are other people conscious like me?".
@whitb62 6 днів тому
The hard problem of consciousness.
@Farligefinn День тому
@@whitb62 Not really the same thing.
@whitb62 День тому ⁺¹
@@Farligefinn You know what, I just wrote a paragraph disagreeing with you but I reread the initial question and deleted it. Rereading and reinterpreting "Does it think actually?", I actually see what you're saying. A clearer word would have been "reason." "Think" can have a few different interpretations and I was contributing it towards consciousness. But whether AI "reasons" is a very different question entirely and I believe what him and you mean. Does it go through a sequence of logical steps from premises to a conclusion? Does it use deduction? This is what was meant.
@Farligefinn День тому ⁺¹
@@whitb62 thanks for the forthright and civil answer :) was about to expect some harsher language that seems to be the norm online these days.
@GodVanisher 22 години тому
@@whitb62 Do you all forget that an AI is literally just a mathematical algorithm that does things as it is told? No amount of complexity will change that. On the other hand, consciousness has been proven to be non-computable.
@DisProveMeWrong 7 днів тому ⁺²
"Charging down a path that often lead to the wrong conclusion." Yep, sounds human to me.
@ArtOfTheProblem 7 днів тому
@@DisProveMeWrong so very human
@KainniaK 8 днів тому ⁺²
Finally. I live for these videos. They are the most fascinating vids ever made. Thanks for keep on educating us further, you are a hero!
@ArtOfTheProblem 2 дні тому
thank you I appreciate it
@subashbaskota9948 3 дні тому ⁺²
Keep u up the great work!
@ArtOfTheProblem 3 дні тому
appreciate it
@antleredvixen 10 днів тому ⁺⁸
This is an absolutely amazing video!!!!!
@ArtOfTheProblem 10 днів тому ⁺¹
thank you! I was so in the weeds with it i hope it comes across as clear? I tried to strike a balance...
@DavidTaylor-cz9pz 9 днів тому ⁺⁸
THANK YOU for publishing a no-music version of this video (see pinned comment by ArtOfTheProblem). It is such a clear and informative video that I hated to see it loose views due to the competing sound track. I'm going to watch it again right now to see if I missed anything the first time around.
Thanks again for being so responsive to your followers.
@ArtOfTheProblem 9 днів тому
Thank you for saying that , I find the music keeps me interested as I take sooo long to edit
@MdKais-lf6wj 10 днів тому ⁺⁹
Best Channel I've ever followed.
@ArtOfTheProblem 10 днів тому ⁺²
Thank you! when did you join? Please help post to your networks
@notbfg9000 9 днів тому ⁺¹
@@ArtOfTheProblem I for one was looking up some "how does AI work" stuff yesterday and some of your vids came up a couple of times, I watched multiple authors with their own unique takes (3Blue1Brown and Nottingham Uni's Computerphile also good channels). This video made me follow tho. I think you earned it :3
@ArtOfTheProblem 9 днів тому ⁺¹
@@notbfg9000 great to hear, i've been working to try and fix my thumbnails to make them interesting to click on. always open to feedback
@notbfg9000 9 днів тому ⁺¹
@@ArtOfTheProblem No particular criticisms there :)
I don't really pay great attention to thumbnails, but maybe that's not true for most people lmao
@mostlynotworking4112 10 днів тому ⁺²
Thank you so much. I wish I had the time to give feedback thanks for being willing to open it up
@ArtOfTheProblem 10 днів тому
Appreciate the feedback! happy to share
@michaelpapadopoulos6054 10 днів тому ⁺⁶
Having read a bit about the AI safety arguements, learning about these arguably incredible developments into artificial minds is now accompanied by a sense of dread as well as the sense of awe.
@ArtOfTheProblem 10 днів тому ⁺²
I love to hear this...well said
@brainmuffins6052 10 днів тому ⁺²⁰
I wish i could learn how to think 🤔
@andrewdunbar828 10 днів тому ⁺³
Exactly. Reasoning is a skill.
@jessemiller1911 9 днів тому ⁺³
Amazing explanations, visuals, and historical context!
IIRC MuZero trained the policy and value networks (used to rollout the MCTS tree) also on the output of the MCTS tree. This seems super useful because search can be used to improve the the training of the networks (not just the results at inference time). I wonder if this also works for CoT/ToT in LLMS where the pretraining could include ToT to boost training performance?
@ArtOfTheProblem 9 днів тому ⁺¹
yes it did, and yes it seems to help. Look at inference time training, just a few days ago a group got a new record on the ARC test doing this kind of thing (i haven't had time to go deep). x.com/akyurekekin/status/1855680785715478546
@roylevy5897 10 днів тому ⁺⁴
Great video as always, cant wait for the next ones! Top research quality.
I think world models deserve more focus rather than llms, which are probably a dead end to true understanding of the real world. Yann lecun has very interesting ideas about these, in his JEPA and V-JEPA architectures and some of his lectures. I also think neuroscience can provide incredibly interesting and valuable insight into ml architectures as why not take ideas from a model undergone hundreds of millions of years of optimization for the same very abilities we are trying to model. Maybe memory is an interesting pathway (perhaps for a video), both working memory and long term (episodic, semantic)...
Anyways, just some of the ideas I've been thinking about recently.
@ArtOfTheProblem 10 днів тому ⁺²
appreciate you sharing these thoughts i've been follwoing LeCun as well and hope to do another update once I see more results
@TrotterG 7 днів тому ⁺²
One tweak that would help this video perform better is to decrease the relative volume of the background music, especially at the end right before the ad. But it may be too late for that on this one, idk how UA-cam works.
@ArtOfTheProblem 7 днів тому ⁺¹
yeah i wish I could, it's locked after upload...i do have a no music version (unlisted link above) thank you for feedback
@ivanyashchenko790 10 днів тому ⁺⁶
Thanks for video ❤
@ArtOfTheProblem 10 днів тому ⁺¹
appreciate the comment please share with anyone in your network who is interested!
@Flyingblackswan 9 днів тому ⁺²
The information and animations are both excellent but the music overpowers your audio. Either lower the volume of the music or get rid of it completely, please.
@ArtOfTheProblem 9 днів тому
Music free version in top comment and description
@khoakirokun217 10 днів тому ⁺²
Ah Yoo, I see "Art of The Problem", I click. Easy like that.
@ArtOfTheProblem 10 днів тому ⁺¹
:)
@scoffpickle9655 9 днів тому ⁺³
PLEASE make a video on memory augmented AI (neural turing machines/differentiable neural computers)
@ArtOfTheProblem 9 днів тому ⁺¹
thanks for suggestion, noted! currently watching the field
@mattsains 9 днів тому ⁺¹
I would love to see a video about the ethics of machine learning models and especially LLMs. There is a healthy body of literature out there to draw from about issues like intellectual property and copyright, enabling and obscuring bias, impact on marginalized communities, the resources used by model training and computation, etc
@ArtOfTheProblem 9 днів тому
thanks for sharing, noted!
@ArtOfTheProblem 7 днів тому
If you can help share my new video around any of your networks today it might catch fire and would help me support the channel. I appreciate your help! ua-cam.com/video/PvDaPeQjxOE/v-deo.html
@JavierSalcedoC 10 днів тому ⁺⁶
you'll never please 100% of any audience. 2nd law of conquest is a thing. keep doing your thing, your music is as iconic as vsauce's is to theirs
@ArtOfTheProblem 9 днів тому ⁺¹
:) thanks
@EvanMildenberger 9 днів тому ⁺¹
@artoftheproblem I agree! I love the music. But maybe if you just lower its volume compared to the narration, then you might appeal to more people without losing those of us who like the music (but not necessarily its intensity). I think ones who complain might just be easily distracted by the soundtrack’s loudness rather than hate the music choices.
@ArtOfTheProblem 7 днів тому
If you can help share my new video around any of your networks today it might catch fire and would help me support the channel. I appreciate your help! ua-cam.com/video/PvDaPeQjxOE/v-deo.html
@bbrother92 9 днів тому ⁺²
I love your channel. Are you programmer or more like mechanical engineer?
@ArtOfTheProblem 9 днів тому ⁺²
thank you! I studied both in school, and naturally land somewhere in the middle....bad at both! I enjoyed algorithm design, but what Iove most is putting on a 'show' whether movie, play, product or haunted house :)
@bbrother92 9 днів тому
@@ArtOfTheProblem Thanks for reply. Well about AI - think we sould call it just statistical machines or dynamic patterns parsers. I am really skeptical about non text machine learning - we still have not solved fly brain problems - scientists have fixed 3d map without undestanding how its works - it like mapping intel cpu - and still having knowing nothing about ALU register memory, gates.
@ArtOfTheProblem 7 днів тому ⁺¹
If you can help share my new video around any of your networks today it might catch fire and would help me support the channel. I appreciate your help! ua-cam.com/video/PvDaPeQjxOE/v-deo.html
@bbrother92 7 днів тому ⁺¹
@@ArtOfTheProblem "yes the fire rises" Bane =)
@andrewdunbar828 10 днів тому ⁺¹³
Here's a puzzle: Do all people reason or do many only memorize patterns? Even people who definitely do reason, do they always reason or do they also just memorize patterns most/much of the time?
@DavidTaylor-cz9pz 9 днів тому ⁺⁸
That's a wonderful question Andrew. I'm a cognitive scientist who is watching the emergence of LLM-based AI with that very question in mind. The fact that LLMs can come so close to our own cognitive abilities is usually viewed as a sign that AGI is almost here. But it can also be viewed as a demonstration that human cognition itself is nothing more than the repetition of learned patterns with minor variations. In one case we'll be thrilled by how clever we are to have reinvented the awesome capabilities of human intelligence. In the other, we're more likely to be humiliated by the realization that we are, essentially, repetition/prediction engines. The reality almost certainly falls between the two, but as someone who has studied human intelligence his entire life (in and out of academia), my bet is that we are much closer to repetition/prediction machines that we'd like to admit.
I'd love to find a deep discussion of this issue. Maybe a future video in this series (hint, hint)?
@jackmeyergarvey 9 днів тому ⁺⁶
I'd argue humans don't tend to rely on either very often. Instead, humans tend to think very heuristically. Deductive reasoning and memorization/recollection are really only required for very precise tasks. Instead, our brains learn a very general feeling of how to do things by strengthening neural pathways that are used repeatedly. Even humans who try to act very logically are generally heuristically feeling their way through tasks, occasionally thinking through algorithms that have been "memorized".
@sulemanmughal5397 8 днів тому ⁺⁴
Reason takes effort and the brain doesnt like to do that often it switches to pattern recognition and intuition as much as possible
@andrewdunbar828 8 днів тому ⁺¹
@@sulemanmughal5397 I would go further and say going from reasoning to this is one kind of learning and is also akin to 'muscle memory'.
@ArtOfTheProblem 7 днів тому ⁺¹
I agree :) also If you can help share my new video around any of your networks today it might catch fire and would help me support the channel. I appreciate your help!
@goekhanbag 6 днів тому ⁺¹
Great video, as always:)
@thebiggorp1623 8 днів тому ⁺¹
The perceptron is a universal approximation machine. Ai cannot think it can only approximate thought. Ai = approximate intelligence.
@maryjanecruise1674 9 днів тому ⁺¹
Excellent video! You are a born professor! 👍
@ArtOfTheProblem 9 днів тому
thanks mom
@유현석-p3m 2 дні тому ⁺¹
absolute cinema
@BrutusMyChild 7 днів тому ⁺¹
4:19 Could you elaborate on which hand-coded formulas used by Shannon with TD-Gammon in the year 1989 you are referring to? Also, when and how did Shannon work with TD-Gammon? "And so, the first key breakthrough in machines mimicking intuition for position quality came when neural networks replaced the hand-coded formulas Shannon used in 1989 with TD-Gammon"
@ArtOfTheProblem 6 днів тому
Yes! I made a whole video on this you can check it out here: ua-cam.com/video/Dov68JsIC4g/v-deo.html - please let me know if you have questions after watching. Shannon didn't do TD Gammon Tesaruo did. enjoy
@BrutusMyChild 6 днів тому
@@ArtOfTheProblem Thank you. I'll watch it.
@Grateful.For.Everything 4 дні тому ⁺¹
Thinking is for fools lol, now KNOWING….. knowing is Cool AF😎!
@timl2k11 10 днів тому ⁺¹
It seems like some of these developments regarding world models should have huge implications for robots that can function in a human centric world. I think we’ll see an explosion in development of robots that can help humans with everyday tasks and a robot that can be a useful household assistant will be a reality in the next 10 years!
@ArtOfTheProblem 10 днів тому
thanks for sharing, yes I'm watching this very closely
@nikbivation День тому ⁺¹
wow, thank you for this!
@ArtOfTheProblem 21 годину тому
appreciate it! stay tuned
@CC1.unposted 9 днів тому ⁺²
Context length is problem
that's the main reason models needs to keep becoming bigger
Or you could train a CNN inspired architecture where a model is shown some sliding window and they produce some token which is repeatedly given to it as input at last when the output is small enough to be taken as input for a full context model it is used like gpt Claude etc
Or you could also use RL and mutate or find a js code capable of generating code, js is so abstracted it's perfect
I made a small programing Language with hoisting such that sequence of process doesn't matter and simple Santax that local minimum escape problem is solved and I wanna train a model
If I get a model I will than continue training else I'll do a dev log video
eventually I'll get worlds first infinite context Model
@ArtOfTheProblem 9 днів тому
thanks for sharing
@shenrr6802 7 днів тому ⁺¹
Commenting to help with the algo, and moving to the no-music one to do the same
@ArtOfTheProblem 7 днів тому ⁺¹
@@shenrr6802 thank you! I have no music unlisted as to avoid splitting the momentum
@easlern 7 днів тому ⁺¹
Thanks so much for these, I had no idea about some of these approaches. I’m wondering now if anyone’s tried applying muzero to arc, since the challenge of arc is learning implicit rules from just a few examples
@ArtOfTheProblem 7 днів тому ⁺¹
@@easlern yes this is happening right now with test time fine tuning !
@Timme-m7d 10 днів тому ⁺¹
Once we understand how we reason, making LLMs reason like us is possible.
@ankrisstark7824 10 днів тому ⁺²⁵
The video is good but there are sooo many random sounds that make it difficult to focus on what you are saying, specifically towards the end.
@ArtOfTheProblem 10 днів тому ⁺⁶
Here you are! ua-cam.com/video/DFDOyMZw9Q4/v-deo.html
@ParsevalMusic 9 днів тому ⁺²
Goooood
@ArtOfTheProblem 9 днів тому
thank you! curious what questions you have after watching this?
@Phlosioneer 6 днів тому
Constructive criticism: 1) The substance of the video was very good. Script was well written, delivery was ok. A bit monotone but not that bad. 2) Sound design was poor towards the end. The music drowned out your voice, and the lyrics were both distracting and discordant. 3) Your choice of clips, footage, and visuals was good. The video was informative when needed, and abstract/entertaining/interesting otherwise. 4) The narrative structure was okay. It was a mostly clear progression. At the end it became unclear which AI was doing what strategy. 5) Visuals were reused way too often. Visuals can be reused, but I think the brain wormhole clip was shown 6 times, way too many. 6) Beware over-using a metaphor image. The upwards shot at two trees was reused so many times as a visual for tree-like thinking that it just became annoying.
@palousination 10 днів тому ⁺⁵
I like the music but it's too loud
@ArtOfTheProblem 10 днів тому
thanks for note
@retrofitter 9 днів тому ⁺¹
The audio mix is horrific, it's not simply a matter of adjusting the levels
@ArtOfTheProblem 9 днів тому ⁺¹
@@retrofitter no music version: ua-cam.com/video/DFDOyMZw9Q4/v-deo.html
@neithanm 20 годин тому ⁺²
Please, invest in a decent microphone. It's brilliantly presented, but hard to hear well. The music track is not ducking either so your voice and the music compete for the same ears.
@ArtOfTheProblem 19 годин тому
thanks, I have a great mic, but I do need to mix the audio better which i'll do next time (btw, i have a no music version in top comment)
@mndtr0 8 днів тому ⁺²
And that's kids why AI will replace programmers
@ArtOfTheProblem 8 днів тому ⁺²
it's interesting that the programmers I know are as divided as the field is
@Zeitgeist9000 10 днів тому ⁺¹
Thanks!
@ArtOfTheProblem 10 днів тому
mucho appreciated!
@justindie7543 10 днів тому ⁺³
Simply excellent video, your style reminds me of every frame a painting
@ArtOfTheProblem 10 днів тому
appreciate this feedback, I also enjoyed that channel
@Gekko-t4i 10 днів тому ⁺³
does language think?
@lakastusmanatus 9 днів тому ⁺²
To me ai is just some linear algebra and some complex algorithm that follow order and the things is human only need few examples to learn meanwhile ai need a massive database of object and image to "understand the subject"
@ArtOfTheProblem 8 днів тому
Lots of interesting research on learning with less , recent advances such as “learning to walk in 5 min” did u see my rl video ?
@lakastusmanatus 8 днів тому
@ArtOfTheProblem edit: I'm pretty sure in the future a lot of people will be fired and replace by those "ai" And well literally the people that use the ai and also I get what you mean
@deanian3128 10 днів тому ⁺¹
The reply works lol 👍
@BrianMosleyUK 9 днів тому ⁺²
This is a very hopeful video. There are billions of dollars being poured into bringing the resources to hand, to find an effective approach to AGI... Once AGI really kicks in, the acceleration of progress bounded only by our imagination will be something to behold. Absolutely awesome. I hope it leads to a world of abundance where we have no need for psychopathic power seekers. 🤞
@ArtOfTheProblem 9 днів тому
thank you for sharing, would love to know what you'd like to see next
@BrianMosleyUK 9 днів тому ⁺¹
@ArtOfTheProblem maybe something in response to the 5+ hours of Anthropic interviews on Lex Fridman... I'm sure that might inspire some topics? Sam Altman rarely gives any insights to what OpenAI are doing, Mark Zuckerberg is equally vague. I think that interview gives more of an insight to the direction of travel.
@ArtOfTheProblem 9 днів тому ⁺¹
@@BrianMosleyUK yes I have been catching up on those
@ArtOfTheProblem 7 днів тому
If you can help share my new video around any of your networks today it might catch fire and would help me support the channel. I appreciate your help! ua-cam.com/video/PvDaPeQjxOE/v-deo.html
@Nate-bl9hy 10 днів тому ⁺⁷
Although I know I’m in the minority, I really enjoy the music. The ambiance created adds to the experience for me
@ArtOfTheProblem 10 днів тому ⁺⁴
thanks for sharing, I feel same way. the music is part of the original idea for the channel...a feeling. but because people can get distracted I think i'll post music free as optional from now one.
@io9021 9 днів тому ⁺¹
I generally like the music. But in the second half of this video the music is very loud and distracting.
@io9021 9 днів тому
Maybe it's not only the loudness, but also the choice of music that is distracting to some. E.g. at 2:00 I don't feel distracted, but at 15:00 very much so. Anyways, thanks for making these great videos
@ArtOfTheProblem 9 днів тому
@@io9021 made a new music version too! ua-cam.com/video/DFDOyMZw9Q4/v-deo.html curious what questions you have after watching this
@diegoesteban5194 5 днів тому
Hey, what's the name of the song at 16:05? Thanks!
@ArtOfTheProblem 2 дні тому ⁺¹
these are all original tracks
@372leonard 10 днів тому ⁺¹
16:19 is there a third side that is a bit nicer to the ai's and believes in them, that they are good enough as they are at reasoning? 😂😂😂
@ArtOfTheProblem 10 днів тому
:)
@FB7ACCFFF8C 4 дні тому
I love your videos but the background music is just too loud
@ArtOfTheProblem 2 дні тому
see no music version in top comment
@吳錫亮-g1z 10 днів тому ⁺¹
I think people too difficult to conjecture computers’ thinking.
@ArtOfTheProblem 10 днів тому
Thank you so much, love this perspective
@orang1921 10 днів тому ⁺⁴
inb4 the dozens of "stochastic parrot" arguments
@ArtOfTheProblem 10 днів тому ⁺²
take cover!
@andrewdunbar828 10 днів тому ⁺¹
stochastic parrots are just autocomplete on steroids
@mancroft 10 днів тому ⁺¹
The background noise is incredibly irritating.
@ArtOfTheProblem 10 днів тому ⁺¹
sorry I should post silent versions, i'll do it and link here!
@mancroft 10 днів тому
@ArtOfTheProblem good idea
@ArtOfTheProblem 10 днів тому
@@mancroft Here you are! can you put this in your comment and then i'll pin it? ua-cam.com/video/DFDOyMZw9Q4/v-deo.html
@Blackhole.Studios 7 днів тому ⁺¹
is this 3blue1brown? 6:04
@Blackhole.Studios 7 днів тому
The network, not the graph
@ArtOfTheProblem 7 днів тому
yes! i credited him, as well. if you watch his video it points back to mine
@Blackhole.Studios 7 днів тому ⁺¹
@ArtOfTheProblem really? I was just pointing it out, I found it interesting how you used a youtuber's media who I enjoy watching.
@ArtOfTheProblem 7 днів тому
@@Blackhole.Studios yes I always loved that animation and credit to him for taking the time
@Blackhole.Studios 7 днів тому
@ArtOfTheProblem exactly
@MichaelScharf 5 днів тому
Grat Video but the annoying music makes it hard to follow
@ArtOfTheProblem 5 днів тому
see music free version in top comment
@pastuh 10 днів тому ⁺³
Quantum computers + AI + satellites
@alish5417 4 дні тому
its recording you simple as fart
@rs20894 10 днів тому ⁺¹
The methods used to train mu-zero seem to be conflated with the use of chain-of-thought methods for LLMs here, which tells me that this channel has gotten sloppy. Like, really sloppy.
With self-play and world-models, the weights of the model are changed by some external trainer after each round. With chain-of-though in LLMs, there is *literally no learning happening.* No weights are changing. No reasoning from one problem will be kept for future problems. Maybe Mister Gippity can reason through one problem if you explain it, but you *will* have to explain it *again* in a new session.
I expect "chain-of-thought" works because transformer-model LLMs have no internal feedback mechanisms in the way that older RNNs like LSTM models do. My understanding is that that fact is what has made transformer models so effective - they're easier to train at-scale when you don't need to take into account all previous states, essentially just giving it all previous inputs instead of training based on "what it might have been thinking at the time." But the result is that it is literally incapable of self-reflection, and the only way of recovering that feature is to give its own output back to it as input, which is what CoT does. CoT isn't some spectacular emergent behavior, it's just a workaround for some features that were removed to make training more efficient.
But why should that feedback mechanism take the form of human-readable text? That sounds horribly inefficient to convert between "thoughts" (latent spaces) and English-text and back again especially when the "reasoning" that results cannot be applied to other problems. Because again... the weights are *not* updated after solving a problem. That's the "P" in "GPT."
Sure, these "ai" companies will save your chat logs and use them to train and update their weights, but that's just training it on text that gets it wrong and has the corrections explained to it... which will lead to it continuing to get things wrong, expecting corrections to be explained to it. The "ai emperor" has no clothes, as far as I can tell.
@ArtOfTheProblem 10 днів тому
Thanks for sharing. I'm not conflating MuZero's training with CoT. Rather, it's drawing an analogy between search strategies - both use systematic exploration of possible paths before committing to an answer. Also have a look at test time training, this does include weight updates! And transformers do have dynamic weight updates through attention. I'd argue using natural language for reasoning isn't inefficient - it's actually leveraging the model's core strength not to mention explainability....what do you think?
@jeffsad8391 9 днів тому ⁺¹
Dislike man beceause I ask help from this "AI" to solve a problem=but crash and left the chat🤬🔪🪓

Наступне

Автоматичне відтворення

Why Does Diffusion Work Better than Auto-Regression?