OpenAI’s new “deep-thinking” o1 model crushes coding benchmarks

Поділитися
Вставка
  • Опубліковано 5 лис 2024
  • Let's take a first look at the new ChatGPT o1 model - a state-of-the-art reasoning AI model from OpenAI that shows unmatched abilities in math, science, and coding.
    #programming #ai #thecodereport
    💬 Chat with Me on Discord
    / discord
    🔗 Resources
    o1 model openai.com/o1/
    V0 UI development • Front-end web developm...
    Google's Doom Engine • The future of game dev...
    📚 Chapters
    🔥 Get More Content - Upgrade to PRO
    Upgrade at fireship.io/pro
    Use code YT25 for 25% off PRO access
    🎨 My Editor Settings
    Atom One Dark
    vscode-icons
    Fira Code Font
    🔖 Topics Covered
    What is o1?
    Update on Devin AI programmer
    How does OpenAI o1 work?
    GPT-4o vs o1 benchmarks
    o1 vs Claude
    What is the best LLM in 2024?
    Trends in Artificial Intelligence

КОМЕНТАРІ • 2,9 тис.

  • @divineaghulor3887
    @divineaghulor3887 Місяць тому +10125

    "Or maybe I'm just a horse influencer saying, a car won't take your job, but a horse driving a car will"...deep stuff man

  • @SpontaneouslyDeliberate
    @SpontaneouslyDeliberate Місяць тому +7412

    If my job was coding solutions to problems with rigorously-defined requirements, this would be concerning.

    • @nixielee
      @nixielee Місяць тому +1720

      If my job ever had a single rigorously-defined requirement, I would be happy

    • @abhishek-soni
      @abhishek-soni Місяць тому +44

      🤣🤣

    • @GSBarlev
      @GSBarlev Місяць тому +32

      People around me have been pushing "natural language code gen" for a while now in the data analysis space, to which I say-anyone who can execute a clear and unambiguous data ask using _natural language_ more efficiently than they can construct the ideal SQL query or DataFrame op is a savant, of one form or another.

    • @josk8936
      @josk8936 Місяць тому +681

      I want to see how future pro ai managers, that fired all the developers, do when the client tells them the app just stopped working without other details and they have to find the error in the codebase with 20k lines of code that pass hundreds of states up and down the component tree like a seesaw

    • @Rugg-qk4pl
      @Rugg-qk4pl Місяць тому +131

      That sounds like aerospace software development. I assure you they do not want AI code in their planes 😄

  • @arunkennedy9267
    @arunkennedy9267 Місяць тому +4462

    I like how Turing test now is how many r's are there in Strawberry.

    • @saeidtafazzol3892
      @saeidtafazzol3892 Місяць тому +32

      lol

    • @esarmiento7
      @esarmiento7 Місяць тому +12

      hahahaha

    • @Gawroon7
      @Gawroon7 Місяць тому +123

      I have a friend who manages to say "strawberry" without using any of the "r" in it.
      This example shows that is also a philosophical issue.

    • @justaname999
      @justaname999 Місяць тому +54

      @@Gawroon7 I asked whether by "there are two Rs" chat GPT meant that there's only two phonemes of R. The reply was very off. something like "Yes, I mean actual graphemes. Even though the second R might be hard to perceive, there are still 2 Rs in the word "strawberry" in correctly spelled English"
      It's very funny.

    • @genghiskhan6688
      @genghiskhan6688 Місяць тому +12

      why is this task so hard anyway?

  • @last_fanboy_of_golb
    @last_fanboy_of_golb Місяць тому +898

    PHD student here, the key to beat any LLM is to use a stick

    • @avg_user-dd2yb
      @avg_user-dd2yb Місяць тому

      I'll beat you with that , you are useless now.

    • @roosterru
      @roosterru Місяць тому +24

      Or a strawberry

    • @wesley6442
      @wesley6442 Місяць тому +11

      Also, unplugging it from the wall socket xD

    • @avg_user-dd2yb
      @avg_user-dd2yb Місяць тому +1

      @@last_fanboy_of_golb where to find this "stick" Is that some software?

    • @kindlin
      @kindlin Місяць тому +7

      @@roosterru A strawberry on a stick.
      EDIT: Sorry, Strawbery.

  • @Beknown107
    @Beknown107 Місяць тому +462

    O1 is a hilarious name for a program which has an exponential energy bill

    • @charfractal9441
      @charfractal9441 Місяць тому +8

      LOL

    • @kindlin
      @kindlin Місяць тому +2

      This comment section is next level.

    • @peterson6824
      @peterson6824 Місяць тому +20

      so many were freaking out about crypto energy costs, but since AI, everyone is like "well, we gotta advance"

    • @Manwith6secondmemory
      @Manwith6secondmemory Місяць тому +9

      You guys realize that they will get cheaper right. It has not even been 2 years since chatgpt 3.5 was released. It’s been about 7 years since transformers have been invented.
      So 7 years AT most, about 1.5 years of large scale efforts, and 5.5 years of niche work before that.
      Keep coping, how old will you be in 2035?

    • @Daniel-zh4ln
      @Daniel-zh4ln Місяць тому

      @@Manwith6secondmemory32

  • @florduka
    @florduka Місяць тому +4186

    My HTML job is really gone now

    • @Flavont_77
      @Flavont_77 Місяць тому +36

      cry more😂

    •  Місяць тому +400

      Don't worry: no one knows how to do good HTML, neither the AI

    • @vasiovasio
      @vasiovasio Місяць тому +45

      Front Page Express, Windows 98! 😊😊😊

    • @SamBrockmann
      @SamBrockmann Місяць тому +35

      You're still coding in html? Oh, sh*t. 😂😂

    • @nicholasmaniccia1005
      @nicholasmaniccia1005 Місяць тому +46

      I've never been more unsure of a joke. Are you are saying it's easy to write proper HTML it's just no one does it. Or you think it is hard to write proper HTMl because everyone has their own opinion or something.
      Because it is really easy to write proper HTML just nobody does it because they don't see learning it or taking the time worth the effort for their genius brains.

  • @Trait74
    @Trait74 Місяць тому +1844

    Thanks to fireship for almost giving me a heart attack at the beginning and then relieving me at the end lol

    • @bigboysdotcom745
      @bigboysdotcom745 Місяць тому +80

      That's literally his formula

    • @maxave7448
      @maxave7448 Місяць тому +167

      So, apparently this new million dollar idea from openai is just a self-proompter? Ironic how prompt "engineers" got replaced way before programmers ever could be

    • @w花b
      @w花b Місяць тому

      ​@@maxave7448 good.

    • @jhordanrojas9184
      @jhordanrojas9184 Місяць тому +2

      He's master that

    • @ethanfreeman1106
      @ethanfreeman1106 Місяць тому +16

      @@maxave7448 >prompt "engineers" got replaced
      hilarious how you pointed that out lol

  • @AwesomeDwarves
    @AwesomeDwarves Місяць тому +322

    Most of my job as a software engineer is meetings, design, documentation, and watching Fireship. Sitting down to code probably only accounts for 20%. I'm either totally safe or I'm doing it wrong and I'm in imminent danger.

    • @AmandaVieiraMamaesouCult
      @AmandaVieiraMamaesouCult Місяць тому +23

      I'm a data engineer. I spend more time talking to humans to figure out the requirements, quelling indecisive humans to create the requirements, translating the requirements into foundational/architectural decisions, clicking some stuff in whatever cloud tool I'm using and then, for a brief period of time, I code and maintain some intermediate level SQL in an 800-line query.

    • @callmeshen9754
      @callmeshen9754 Місяць тому +8

      It's exactly how it should be, People just doesn't know how many projects companies (Mostly the big ones talking from experience) having so many projects on hold/delays. At very least for the next 5 years I guaranteed there is no need to panic, It will push more interns/juniors to certain projects they would've need been able to join beforehand.
      The question should be in that regards, What would happens in the far future if there won't be enough projects (Or the need for more)? It's less likely in the upcoming years but I'm sure it's very likely situation.. And there is a raise of CS degrees already so ye, There is a case here but at very least not in the near future.

    • @Buzmanm
      @Buzmanm Місяць тому +12

      Your job isn't in danger, at least for now, it's juniors the ones that should be concerned, especially the ones graduating in 3 or 4 years. The barrier of entry has grown and will keep growing exponentially.

    • @RubenKelevra
      @RubenKelevra Місяць тому

      I'm pretty sure ChatGPT 4o is great at meetings. ;)

    • @purpose6113
      @purpose6113 Місяць тому +3

      This will change with AI agents

  • @naeemulhoque1777
    @naeemulhoque1777 Місяць тому +1419

    5:40 *"Ai won't take your job, but another man using Ai will.."*

    • @Monkeymario.
      @Monkeymario. Місяць тому +5

      3-x

    • @rumfordc
      @rumfordc Місяць тому +75

      another man with a decade of engineering experience, and a CS degree, using AI will* which is not too different to what was happening before AI. there's always been guys that are drastically faster than the average. the issue is that they're always rare and as tools and tasks become more complicated they become rarer.

    • @shipso6116
      @shipso6116 Місяць тому +24

      @@rumfordc yep, exactly. It's an eternal regularity and "using AI" is a coincidence here. They will win not because of "using AI", but because of being "at the top of their game", which *coincidentally* may now involve using AI, or may not. Different times different tools. May even find your own. Looking at the broad picture it's "staying ahead" what matters, not "using AI" per se. Those are not equal yet and hardly ever will be, at least for some parts of IT industry.

    • @moonwine7398
      @moonwine7398 Місяць тому +3

      ​@@rumfordcthere will be day when AI will not need human for anything and it is coming within 5-6 years, so your quote HUMAN USING AI WILL REPLACE HUMAN WITHOUT AI which is a parrot quote repeated by many AI supporter is a blind and misleading quote.
      They are working to make AI more intelligent then human they don't need human intervention in AI

    • @rumfordc
      @rumfordc Місяць тому +11

      @@moonwine7398 😆🤦‍♂ come back when you know what a quote is

  •  Місяць тому +57

    I think it’s pretty amazing they managed to build the equivalent of an all knowing but also friendly and helpful person on stackoverflow considering the lack of real training data.

    • @baileymickb324
      @baileymickb324 Місяць тому +5

      This is outrageously funny. They probably had to mash together Pinterest or a recipe blog with stack overflow answers just to make it palatable.

  • @pandoraeeris7860
    @pandoraeeris7860 Місяць тому +952

    The cutting edge of Code Reports.

    • @perthecther__203
      @perthecther__203 Місяць тому +10

      EDGE

    • @vertas.y
      @vertas.y Місяць тому

      @@perthecther__203 EDGE OR Chrome 😭😭😭😭😭😭😭😭😭😭😭😭😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😂😭😭😭😭😭😭😭😭😭😭😒😒😒😒😒😒😒😒🥰🥰🥰🥰🥰🥰🥰🥰☺☺☺☺☺☺☺☺😐😐😐😐😣😣😣😣😐😐🥳🥳🥳🥳🥳🥳🥳🥳

    • @CyanRooper
      @CyanRooper Місяць тому +1

      OF

    • @Tozu25
      @Tozu25 Місяць тому +1

      The fact that everyone is forgetting for some reason is that AI will also take doctors, engineers, architechts, creators, actors, editors, pretty much everyones jobs. It will be mass unemployment = no livable society.
      Why should anyone be excited? We are witnessing the start of something really bad.

    • @Gregorythe5_5551st
      @Gregorythe5_5551st Місяць тому

      ​@@Tozu25 To be fair, if billions of people have nothing to lose i can't imagine companies can keep such a status quo going for long. I hope anyway.

  • @richbaird9407
    @richbaird9407 Місяць тому +631

    If only a PhD were about skills like programming and solving equations. Literally every PhD student uses solvers for anything more complex than basic calculus anyways. The challenge of a PhD is learning how to think about things in unique ways and pushing boundaries and exploring new possibilities.

    • @some_one
      @some_one Місяць тому +160

      No no no you got it all wrong, you get a PhD to solve standardized questions on a test!

    • @o1-preview
      @o1-preview Місяць тому +10

      it has learning tokens now, wait another 2 models and get back to me

    • @VisionaryPathway
      @VisionaryPathway Місяць тому +1

      @@o1-preview facts

    • @bartekb4191
      @bartekb4191 Місяць тому +36

      There are too many PhDs with closed minds out there for it to be true...

    • @pavlinggeorgiev
      @pavlinggeorgiev Місяць тому

      @@o1-preview just another 2 models bro ... trust me

  • @marc-io
    @marc-io Місяць тому +1331

    Impressive it can beat PhD students. But remember a PhD in breakdancing is not the same as being a breakdancer.
    This one could be called GPT-Raygun.

    • @SkegAudio
      @SkegAudio Місяць тому +25

      😂 good one

    • @gabrielbarrantes6946
      @gabrielbarrantes6946 Місяць тому +57

      what exactly means it can "beat phd students"? I suspect is faster pretty well known problems that are well documented over the interned lol, so totally worthless.

    • @randomlettersqzkebkw
      @randomlettersqzkebkw Місяць тому +36

      @@gabrielbarrantes6946 well, it can either mean beating them in a fist fight, or getting more correct answers than they can. Im not sure which one though🤔

    • @icaromendes1250
      @icaromendes1250 Місяць тому +7

      If AI had feelings it would definitely being hurt by this insult

    • @tainicon4639
      @tainicon4639 Місяць тому +13

      PhD students are also still learning. How does it compare to the pissed off post doc who’s been stuck in academia for 15 years after he graduated…

  • @andrewcampbell7011
    @andrewcampbell7011 Місяць тому +27

    “It’s basically just like GPT4 with the ability to recursively prompt itself”. Exactly. We are in the parlor tricks phase of this hype cycle.

  • @waltersumofan
    @waltersumofan Місяць тому +14

    all this energy to just not pay employees properly, it's crazy

    • @radektheplayer
      @radektheplayer Місяць тому

      True

    • @florian5670
      @florian5670 Місяць тому

      Unemployment will go up in the future and people will wonder why. Very few will get a lot richer, the masses will be poor. We're just really bad at thinking about the future and the consequences of what we do. Just look at how long we've already been knowing about climate change.

  • @romangeneral23
    @romangeneral23 Місяць тому +903

    It still can't count how many r's in strawberry.
    I think we good for a while...

    • @vasiovasio
      @vasiovasio Місяць тому +17

      I too hope the Sarcasm hold us above the water... at least for a week or too! 😂😂😂

    • @itsdakideli755
      @itsdakideli755 Місяць тому +43

      It can...

    • @deep.space.12
      @deep.space.12 Місяць тому +22

      more likely a limitation from how the tokenizer breaks the word down (i.e. it's not aware of individual characters), than something fundamentally wrong with the model itself.

    • @hypno5690
      @hypno5690 Місяць тому +30

      there are two r's in strawberry though. There are also three r's and one r.

    • @jimmydesouza4375
      @jimmydesouza4375 Місяць тому +2

      How many r's are there in strawberrry though?

  • @midicine2114
    @midicine2114 Місяць тому +854

    Fuck it, I’m becoming a plumber.
    I’m also tired of these “snake game” examples. It’s just a glorified google at that point. Tons of snake examples on the web.

    • @dsfs17987
      @dsfs17987 Місяць тому

      and they mostly suck, which is what this "ai" is using to teach itself, garbage in - garbage out

    • @nuvotion-live
      @nuvotion-live Місяць тому +50

      I laughed out loud at these coding demos

    • @univera1111
      @univera1111 Місяць тому +10

      Iv already given up on programming. And just on how to use already created softwares.😢😢😢

    • @SMGA14
      @SMGA14 Місяць тому +43

      Buddy, the robots will be the plumbers, no job is safe plus you're not guaranteed to be a plumber since the workforce will be saturated from all the people that lost their jobs turning into plumbers

    • @GeneralKenobi69420
      @GeneralKenobi69420 Місяць тому +49

      @@SMGA14 Nah, robots are California tech bro copium. Trade jobs are mostly safe for the next 20 years

  • @bengrzybowski2487
    @bengrzybowski2487 Місяць тому +2137

    I've been seeing people freaking out about this new model, "it's better than PHD humans at X,Y,Z!" where X,Y,Z basically amounts to data processing... like oh my god??? A computer can process data faster than a person???? WHAT???? lmao

    • @The-Singularity-X01
      @The-Singularity-X01 Місяць тому

      Literally any modern computer can process data 'faster' than a human brain. Because a human brain is doing a whole bunch of shit at once in ADDITION to that data processing, while a computer does far less at any one time simply maintaining its 'active' state and therefor has more processing power to allocate for useful computation.

    • @deividfost
      @deividfost Місяць тому +287

      Not surprising, since most people hyping AI have no idea what a PhD actually is.

    • @rosco3
      @rosco3 Місяць тому +222

      "It can beat programmers in olympics" Yeah if given unlimited amount of submissions, those same issues that are either ENTIRELY on the web or every single concept is on the web already, most of those olympics are for undergrad students

    • @tambal40
      @tambal40 Місяць тому +47

      ​@@deividfostit doesn't matter it's evolving fast in 10 years it will be better than humans at everything EVERYTHING

    • @eagleeagle7360
      @eagleeagle7360 Місяць тому +32

      Exactly, it's as if one were trying to compete with the calculator hahahahahahaha

  • @ThisIsNotAUsername-v3o
    @ThisIsNotAUsername-v3o Місяць тому +67

    0:19 - it is now 100% proven that English is the hardest subject.

    • @ThisIsNotAUsername-v3o
      @ThisIsNotAUsername-v3o Місяць тому +7

      Also this is O(#); that is, the number of prompts until an AI that can't count letters properly thinks its answer is correct.

  • @johnsmith1953x
    @johnsmith1953x Місяць тому +46

    *How many 'r' characters are in the word "strawberry" ?*
    GPT-4 : TWO!!
    GPT-o1: "I have the answer for realsies, but it'll cost you $2,000"

    • @kindlin
      @kindlin Місяць тому +2

      Strawbery obviously has 2 R's, idk what all the hubbub is about....

    • @sirflimflam
      @sirflimflam Місяць тому

      @@kindlin just trolls

  • @ThreefieldsMedia
    @ThreefieldsMedia Місяць тому +225

    Hearing a slight raspiness in Fireship's voice is a subtle reminder that it is not AI-generated yet.

    • @diegogarcia.57
      @diegogarcia.57 Місяць тому +1

      Didn't someone else close his voice and he said that he didn't minded?

    • @unholycrusader69
      @unholycrusader69 Місяць тому

      *Yet.*

    • @w花b
      @w花b Місяць тому +4

      Or maybe that's a sign this video was... For the first time

    • @Ainigma
      @Ainigma Місяць тому +17

      prompt: add raspiness, increase by 15.000%

    • @o1-preview
      @o1-preview Місяць тому +2

      fireship cloned his own voice waaaay back when he had very few subs and used it for a couple of vids

  • @TonyCecala
    @TonyCecala Місяць тому +894

    They may replace PhDs. But never will they approach your PhD in sarcasm.

    • @nerlind
      @nerlind Місяць тому +58

      If I have learned anything...everything is a few models away

    • @mananshah3248
      @mananshah3248 Місяць тому +6

      Try prompting it to write the office starting scene.

    • @soulsmith4787
      @soulsmith4787 Місяць тому +7

      Have you seen Neuro on Twitch? That little AI is the master of sarcasm. It's so strong that you can even tell despite the monotone tts.

    • @alevyts3523
      @alevyts3523 Місяць тому +3

      They can replace PhDs. In the sense that they can answer standard questions that a PhD can answer in theory.

    • @CyanRooper
      @CyanRooper Місяць тому

      ​@@soulsmith4787 you mean that AI loli Vtuber that sings songs like Bury the Light and Never Gonna Give You Up?

  • @TheGrandChelem
    @TheGrandChelem Місяць тому +459

    Is it just me who feels so sad that words are disappearing from the internet ? In this video, the word drug is censored just to please an algorithm. The other day I even saw someone who censored the word hate in «she hates being called wifey» smh

    • @tacitozetticci9308
      @tacitozetticci9308 Місяць тому +128

      You're lucky the word "wifey" survived. Gotta cherish what we have.

    • @hypno5690
      @hypno5690 Місяць тому +202

      even scarier, we are now using words like "unalive" in real life which stems directly from online advertising censorship. Corpo speak

    • @turolretar
      @turolretar Місяць тому +44

      *t’s n*t j*st y*o b*d 😢

    • @livinghuman2298
      @livinghuman2298 Місяць тому +61

      The other day i replied to a comment with 100% innocent sentence, no reason to censor it, yet it was deleted, soon we won't be able to say anything.

    • @khhnator
      @khhnator Місяць тому +5

      that's just how language works. internet is not being special here

  • @4RILDIGITAL
    @4RILDIGITAL Місяць тому +88

    The potential of AI is indeed vast yet it falls short at times. In the end, it's a tool, at least for now.

  • @DETahaX
    @DETahaX Місяць тому +16

    "officer hardass" kills me every time with that picture 😭😭

  • @cryptaveli
    @cryptaveli Місяць тому +212

    They took our jerbs!

    • @Douchebagus
      @Douchebagus Місяць тому +43

      They Turk are Durrr

    • @robcz3926
      @robcz3926 Місяць тому +14

      took yer durr!!!

    • @JonathanHelvey
      @JonathanHelvey Місяць тому

      Tuk yer jerbs !!!!!

    • @aarushsaboo1194
      @aarushsaboo1194 Місяць тому +10

      Yarrrrr haarrrr

    • @zoeherriot
      @zoeherriot Місяць тому +1

      Make no mistake, they need that to happen to pay for the billions they’ve sunk into training these models. (It won’t work though).

  • @wayne8797
    @wayne8797 Місяць тому +135

    Very true. All these ai models look amazing but once you have used it for anything besides asking it rudimentary stuff then it falls apart very quickly.

    • @michaelnurse9089
      @michaelnurse9089 Місяць тому +8

      But each version pushes further up against the rudimentary limit. The first cars randomly exploded and had to have horses travelling behind to carry extra fuel.

    • @Pfennigfuchs-z7v
      @Pfennigfuchs-z7v Місяць тому +15

      @@michaelnurse9089You can’t equate past advances in some field with advances in a completely other one. Quite a few parameters are different. You can however try to formulate rules for technological advancements in general. Processes like these tend to follow a logistical curve and the question is at what point of the curve are we right now. I would argue we’re about to hit the plateau.

    • @Simonstoster
      @Simonstoster Місяць тому +7

      ​@@Pfennigfuchs-z7vAlso its just a confirmation bias. For every technological innovation there is a problem unsolved since decades

    • @Tozu25
      @Tozu25 Місяць тому +1

      @@michaelnurse9089 Many people are forgetting for some reason that its not only affecting developers. AI will also take doctors, engineers, architechts, creators, actors, editors, pretty much everyones jobs. It will be mass unemployment = no livable society.
      Why should anyone be excited and be joking? Now this is what’s should be concerning, nothing else. We are witnessing the start of something really bad.

    • @YaamFel
      @YaamFel Місяць тому +5

      ​@@Tozu25You have a fundamental misunderstanding of how LLMs work if you think they could ever replace engineers and doctors.

  • @evanseka4054
    @evanseka4054 Місяць тому +166

    "A car won't take your job, but another horse driving a car will." That hit way harder than it needed to.

    • @jamaludeenameen5361
      @jamaludeenameen5361 Місяць тому +5

      I dont understand it, please explain

    • @arxzhh
      @arxzhh Місяць тому +30

      @@jamaludeenameen5361this new technology won’t take your job, but someone who knows how to use that technology will, not the machines itself.

    • @VitorCosta-n2m
      @VitorCosta-n2m Місяць тому +5

      @HessW No, wronger, it's even deeper. The car with his horsepower would bestow the horse, revealing a zero sum. Which after would divide the AI capability of coding.

    • @RedactedBrainwaves2
      @RedactedBrainwaves2 Місяць тому +6

      No worries guys. Afghanistan still has a big market for horses.

    • @andrelgpinheiro
      @andrelgpinheiro Місяць тому +2

      @@jamaludeenameen5361
      The phrase "A car won’t take your job, another horse driving a car will" can be interpreted to mean that technology (like AI or cars) on its own doesn't inherently replace humans or living creatures in a direct way. Horses can't drive cars, just like AI can't independently replace the complex, nuanced roles humans perform. Instead, it's humans who use AI or other technologies effectively that change the job landscape.
      In the context of AI, this means that AI alone isn’t going to take jobs. It doesn’t have the inherent ability to think, adapt, or make decisions like humans can. Instead, humans who adapt and incorporate AI into their work will have the advantage. They’ll be the ones who change industries, outperform their peers, and potentially replace those who don’t evolve with the times.
      The point is that AI, like a car, is just a tool. It requires a driver-someone capable of steering it effectively. The future of jobs won’t be one where AI takes over, but one where people who master AI technology will reshape industries, and those who don’t learn to "drive" will be left behind.
      In essence: AI won’t replace humans because it isn’t natural for it to perform human roles. But humans who learn how to harness AI will redefine how those roles are performed, much like a person who learned to drive a car left behind those relying on horses for transportation.

  • @christopherchilton-smith6482
    @christopherchilton-smith6482 Місяць тому +22

    I've been having a blast with it. I used gpt4 to setup the bare bones of a mud-like text game, I've got a compass in every room showing the direction of exits, inventory, can equip and unequip items, drop items from inventory, pick them up, place monsters, really simple combat (saving the in depth stuff for later) but what I couldn't do with gpt4 or gpt4o was make a top down map that shows all the rooms and their connections in relation to each other just using unicode characters. No matter how I tried to break the problem down and describe it I just couldn't get useful code.
    o1 produced the code and put in a legend. I'm talking with it about branching dialogue solutions and think it may be able to help me import TWINE exports as json as a solution for doing branching dialogue.
    I litteraly could never have done any of this without these tools, I'm in love.

    • @Demoralized88
      @Demoralized88 Місяць тому

      you by chance a former or current dragonrealms player?

    • @christopherchilton-smith6482
      @christopherchilton-smith6482 Місяць тому

      @@Demoralized88 I played Gemstone IV briefly years ago, I don't think I ever gave dregonrealms a try, may have to rectify that. I mostly played around in the infinite supply of mediocre MUDs searching mud connector and similar listing sites.

  • @Ashash9877
    @Ashash9877 Місяць тому +3349

    Call me when it can become a professional poker player or blackjack counter so I can make millions at Stake, or how about a pro stock trader or something? Why has no one used openAI for this yet? In the future OpenAI might run entire countries GDP systems💀 Welcome our overlords.

    • @HockeyMan666
      @HockeyMan666 Місяць тому +80

      LOL that probably exist already but you cant rly share that with the public can u?? use ur brain

    • @peyopeev8909
      @peyopeev8909 Місяць тому +61

      1.4k likes and nobody has mentioned that AI has been and it's used for both atm, you are for a wild ride pretty soon 😵‍💫

    • @bozydargroch9779
      @bozydargroch9779 Місяць тому +13

      @@peyopeev8909 yep. Botted likes?

    • @amaiaa8815
      @amaiaa8815 Місяць тому +3

      Been there done that

    • @TheBcoolGuy
      @TheBcoolGuy Місяць тому +37

      "GDP systems"

  • @veenmikki27
    @veenmikki27 Місяць тому +339

    I used to be hopeful that AI could help me out a little through school but if this stuff’s already doing phd level physics I might not have school to finish

    • @Tmssef
      @Tmssef Місяць тому +15

      Atm there is no point in studying.

    • @ryzikx
      @ryzikx Місяць тому +173

      calculators can do arithmetic better than any humans why learn math ?

    • @paegr
      @paegr Місяць тому +1

      @@ryzikx Now the calculator can automatically do every job on Earth at 100 times the speed you can for 1/1000th of the cost, so you have no reason to be alive according to Capitalism

    • @MintBunHunter
      @MintBunHunter Місяць тому +7

      @@ryzikx its cool

    • @oioio-yb9dw
      @oioio-yb9dw Місяць тому +19

      ​@ryzikx because then the AI realises you are stupid and it will tell you that 2 + 2 = 5 and so on, you will end up becoming it's dog.

  • @GSBarlev
    @GSBarlev Місяць тому +69

    This is a huge leap forward in Sam Altman's ability to separate AI bros from their trust funds and crypto hodlings.

    • @Tozu25
      @Tozu25 Місяць тому +3

      Many people are forgetting for some reason that its not only affecting developers. AI will also take doctors, engineers, architechts, creators, actors, editors, pretty much everyones jobs. It will be mass unemployment = no livable society.
      Why should anyone be excited and be joking? Now this is what’s should be concerning, nothing else. We are witnessing the start of something really bad.

    • @spaghettiking653
      @spaghettiking653 Місяць тому +2

      ​@@Tozu25I'm not sure whether this will really replace doctors and stuff like that. Being a surgeon or dentist requires very fine motor control, extremely reliable expertise and knowledge, accountability, personality, etc., so as to not make a single mistake and to always navigate the patient's ill state perfectly. AIs and robots, which at this stage are far from known for their rigid foundations in any of these things, definitely have no ability to take any of these jobs. Moreover, if we really do eventually "solve" jobs, so that no one ever needs to work again, then we can rejoice at the fact that no one will be required to toil again. Things like UBI will become possible. The real doomsday scenario is if AI only succeeds in taking creative and artistic jobs, leaving humanity to do all the dead, manual labour. That is what I fear, not that doctors or actual trained professionals will be replaced.

    • @Tozu25
      @Tozu25 Місяць тому

      @@spaghettiking653 I was diagnosed by an AI chatbot when I got my paid sick leave. I told the AI my symptoms, and got questions and then a real doctor signed the digital document and left. So it's already happening. Similar to anything, the AI does the task and then someone checks the result. But it's good that you are critical about AI, and looking both ways. You are the first one out of anyone, and I've spoken to like 15 people. That tells about intelligence, in you.

    • @danielrodrigues4903
      @danielrodrigues4903 Місяць тому

      ​@@Tozu25 No, mass unemployment = new economic system and a break from the relentless capitalism dystopia we're experiencing. In big cities like London, regular new graduates can't even afford to buy houses on good salaries. The system is bullshit and needs to be torn down.

    • @Dorian-y3v
      @Dorian-y3v Місяць тому +1

      Cope.

  • @codeaperture
    @codeaperture Місяць тому +200

    Ah! 0 days since AI again?

    • @douwemusic
      @douwemusic Місяць тому +6

      Spoiler alert-this will happen every time Fireship uploads about AI

    • @Tozu25
      @Tozu25 Місяць тому +4

      Many people are forgetting for some reason that its not only affecting developers. AI will also take doctors, engineers, architechts, creators, actors, editors, pretty much everyones jobs. It will be mass unemployment = no livable society.
      Why should anyone be excited and be joking? Now this is what’s should be concerning, nothing else. We are witnessing the start of something really bad.

    • @mr.nixtheboarddrawer1175
      @mr.nixtheboarddrawer1175 Місяць тому

      ​@@Tozu25 people don't want to work thats why

    • @Tozu25
      @Tozu25 Місяць тому

      @@mr.nixtheboarddrawer1175 Well, the possible future products made by AI are not gonna be handed for free to you, unless society becomes socialist, and I don’t think that’s any more good.

    • @JimmyKrochmalska-f7p
      @JimmyKrochmalska-f7p Місяць тому

      @@Tozu25 None of that is going to happen. I wouldn't trust AI to be doing heart surgery even in 1,000 years, AI is AI, it's all guesswork, I would be more scared of *computers and simulations, as they actually involve math and physics, while AI just involves numbers multiplied by numbers multiplied by more numbers that eventually have an error that's small enough that works "good enough"* imagine that as your doctor, a doctor that MAYBE quite POSSIBLY will do the job right, also, you really think everyone's gonna lose their jobs in one night? Have you considered *us humans wanting the same thing as you, a livable society and preventing any of this happening, finding a solution, doing anything to make it all work out?*
      tl;dr AI is guesswork and we should worry more about nukes and simulations as simulations actually get math right (AI cannot make complex simulations because AI will get this line wrong or get this number slightly off)

  • @MacCrunch
    @MacCrunch Місяць тому +85

    The improvements are impressive, but there's still a lot to uncover about the true impact and capabilities of these models.

    • @Tozu25
      @Tozu25 Місяць тому +4

      Many people are forgetting for some reason is that AI will also take doctors, engineers, architechts, creators, actors, editors, pretty much everyones jobs. It will be mass unemployment = no livable society.
      Why should anyone be excited? We are witnessing the start of something really bad.

    • @RokeJulianLockhart.s13ouq
      @RokeJulianLockhart.s13ouq Місяць тому +4

      ​@@Tozu25 I disagree.

    • @Tozu25
      @Tozu25 Місяць тому

      @@RokeJulianLockhart.s13ouq Well, if an AI someday gets created which is equally as smart and conscious as a human, if not more, of course they can replace those jobs I mentioned as well.
      Edit: Before you mention, I know there is no such thing yet as a conscious AI and hopefully never will be. The speed of change in society would be so quick that it would mean hard times worldwide.

    • @RokeJulianLockhart.s13ouq
      @RokeJulianLockhart.s13ouq Місяць тому +1

      @@Tozu25 LLMs are search engines, like Google is. They're nothing more than correlators. They're not a form of intelligence, as their confident incorrectness when they get stuck in recursive loops demonstrates.

    • @mrX666-s9p
      @mrX666-s9p Місяць тому

      @@Tozu25 It is used as a tool stop being dumb you need human interaction even in programming it's not like I would give full access to an AI model to my business.

  • @kumarapillay3122
    @kumarapillay3122 Місяць тому +14

    before gpt used to be bad at doing even basic force questions. But to o1, i gave my fluid mechanics problem and it was able to do it and i didn't even upload the diagram pictures. Its gotten really good now

  • @joshroberts8944
    @joshroberts8944 Місяць тому +921

    This is concerning, it took the AI over 10,000 attempts with access to every relevant example on the internet during a contest to get gold lmao

    • @maxave7448
      @maxave7448 Місяць тому +284

      It basically tried everything until somwhing worked lol

    • @StickzDev
      @StickzDev Місяць тому +156

      Like dr strange searching through every possibility to win against Thanos

    • @J-Kimble
      @J-Kimble Місяць тому

      @@maxave7448 We're getting better at making software that throws sh*t on the wall and sees what sticks. Also known in the human world as a sh*tty programmer.

    • @genpotrait2274
      @genpotrait2274 Місяць тому +46

      Its not about those 10000 attempts, but how long it takes.

    • @fernandoacostaylara2586
      @fernandoacostaylara2586 Місяць тому +87

      @@genpotrait2274 Not really, its not viable to run 10000 attempts. In reality it won't know which scenario is the correct one

  • @hglbrg
    @hglbrg Місяць тому +107

    OpenAI needs money, releases some reskinned GPT3.5 that asks "are you sure" secretly and send the response after that to the user to maintain hype, investor money and altmans job. Same bubble. Same hot (AI)r.

    • @justanotherchannelname1273
      @justanotherchannelname1273 Місяць тому +23

      Yeah, this was plain dissapointing. I was expecting some major architectural change with all the hype around 'Q*' but this is just another chatbot except it's trained to ask itself 'are you sure about that?' a couple of times and provide long COTs with a fancy UI to hide the complexity from users who don't know how to prompt worth a dang.

    • @DavidJames-lz8js
      @DavidJames-lz8js Місяць тому +4

      (AI)r = Air. I see what you did there 😏

    • @gramioerie_xi133
      @gramioerie_xi133 Місяць тому

      @@justanotherchannelname1273How in the hell is consistently beating human experts in several abstract fields not impressive to you

    • @indigitalcreativity4500
      @indigitalcreativity4500 Місяць тому +1

      ​@justano so what you expect from new AI, ?

    • @LttlKnwnCompBehindWriteAssist
      @LttlKnwnCompBehindWriteAssist Місяць тому

      Altman write a for loop on chatgpt UI

  • @cbn1362
    @cbn1362 Місяць тому +33

    It amazes me every time how I think about this channel was all about angular and firebase back in the days and where it is now.

    • @crackwitz
      @crackwitz Місяць тому +1

      That's like a startup pivoting when they discover what the customers really need

    • @complexity5545
      @complexity5545 Місяць тому

      Both, angular and firebase, are currently being re-obsoleted (by react, htmx, and svelte (or some combination)). Firebase has been dead about 8 years after it was born. Most wise programmers never used Firebase.

    • @LttlKnwnCompBehindWriteAssist
      @LttlKnwnCompBehindWriteAssist Місяць тому

      That's what i stumbled across. A channel supposed to be firebase documentation is doing all crazy stuff in th name of firebase. How could that be. Thank you now I get it.

    • @gnarpow
      @gnarpow Місяць тому

      No kidding! lol

  • @Jackson_Zheng
    @Jackson_Zheng Місяць тому +17

    0:25 Man, that clip was perfect lol

  • @sandeepnautiyal3070
    @sandeepnautiyal3070 Місяць тому +4

    "And O stands for ohh sh*t we are gonna d*e" is so apt and hilarious lmao

  • @MustafaETKER
    @MustafaETKER Місяць тому +11

    How can be someone so funny and so informative at the same time in just 5 minutes

    • @turolretar
      @turolretar Місяць тому

      Something, not someone

    • @diegogarcia.57
      @diegogarcia.57 Місяць тому

      Humans are the original AI

    • @tzardelasuerte
      @tzardelasuerte Місяць тому

      And so biased. No our jobs are never going away!!!! 😡😡😡😭😭😭

    • @MustafaETKER
      @MustafaETKER Місяць тому

      @@turolretar wdym

    • @Dorian-y3v
      @Dorian-y3v Місяць тому

      He's from 4chan, that's why.

  • @DEUTSCHWULF
    @DEUTSCHWULF Місяць тому +84

    By the time I finish writing this comment, this model will already be outdated.

  • @yo-no9879
    @yo-no9879 Місяць тому +38

    1:19 good to see o1 is struggling big time with chemistry, gonna make a lot of chemists happy.

    • @lanceb9065
      @lanceb9065 Місяць тому

      I’ll be the 25th Chemist to give that a thumbs up 👍

  • @albercode9562
    @albercode9562 Місяць тому +6

    My only concern is that AI goes full apocalypse mode after spending 2 days with my manager

  • @MINIMAN10000
    @MINIMAN10000 Місяць тому +6

    To me the worst part is that it fails to strawberry test. For something that is a recursive self prompter, it sucks at prompting because constructing a proper prompt is literally the easiest way to pass the test.

  • @mirrorsreflectyou
    @mirrorsreflectyou Місяць тому +128

    But can this center a div?

    • @theterribleanimator1793
      @theterribleanimator1793 Місяць тому +62

      not yet. It can plagiarize the code for a snake game though.

    • @friendlyfox2189
      @friendlyfox2189 Місяць тому +2

      😂

    • @livinghuman2298
      @livinghuman2298 Місяць тому +1

      Cursor can, i think?

    • @CyanRooper
      @CyanRooper Місяць тому +1

      But can it do this?
      *bends chair backwards*

    • @gramioerie_xi133
      @gramioerie_xi133 Місяць тому +3

      @@theterribleanimator1793 Why do you people always accuse it of ‘plagiarism’ like that even makes any sense

  • @tamizharasanbe
    @tamizharasanbe Місяць тому +19

    "A car won't take your job, but a horse driving a car will" .... damn!!!! deeeeeeeeeeeppp

  • @imsleepy620
    @imsleepy620 Місяць тому +6

    Fireship's definitely my favorite horse influencer

    • @Adambd99
      @Adambd99 Місяць тому

      most based comment ever

  • @ArifBillahOnGoogle
    @ArifBillahOnGoogle Місяць тому +1

    Hi Jeff, I'm writing this comment to delightfully let you know that I absolutely like the way you do the "last kick" at the end of your videos sometimes. Beautifully crafted kick! Thanks. ❤

  • @nejiabdurrahmen
    @nejiabdurrahmen Місяць тому +2

    3:23 you can really feel the frustration, amazing

  • @tabiserebour5912
    @tabiserebour5912 Місяць тому +15

    Whenever i see your video notifications, i start laughing even before watching the video😂

  • @sentinelav
    @sentinelav Місяць тому +11

    I expected something crazy, but when I saw the benchmarks, they're really not that groundbreaking.
    o1's reasoning token paradigm serves as a middle layer for handling complex instructions, so it's more internally organised, but that doesn't necessarily mean the underlying architecture has substantially improved.
    Coding, maths and science are all topics where handling information in a purely linguistic context by default is detrimental, so it naturally follows that it would be more effective to logically deconstruct problems. However, you might see similar improvements with any other LLM by manually creating an intermediary prompting stage.
    This is still an improvement, but remember, a significant leap ahead at this stage would mean something as groundbreaking to transformers, as transformers were to RNNs, and this is nowhere close.
    Make no mistake, this is part of the plateau. There will still be progress, and we should be looking to concentrate that towards building tools to aid developers, rather an attempt to replace them.

    • @danielrodrigues4903
      @danielrodrigues4903 Місяць тому

      We should be aiming to replace everyone. Always aim high.

  • @DeusExRequiem
    @DeusExRequiem Місяць тому +45

    2:08 the reason many people are moving over to Claude is because Claude isn't censored and is more useful for things like generating erotic content and conversations that don't sound like you're talking to HR, which is all that the majority of people care about. The o1 model is going to be great for jobs, it's a little more reliable for perfect answers, but the problem remains that corporations want something that's specifically useful and not generally useful, a lot of them have internal systems and custom setups that don't generalize, and they worry about data leaks, and would prefer the ability to run all of this in-house. The majority of AI users are fine with some generalization, can't afford to run the best ones in-house, and want it uncensored. Unless Microsoft can stay ahead, people will move on the moment something almost as good comes out that isn't censored, and Microsoft will be stuck catering to corporations who have demands.

    • @JanVerny
      @JanVerny Місяць тому +25

      You're thinking about this all wrong. Consumer software is not where the money is at. Most profitable MS divisions are all centered around business products. They obviously want to sell AI to the business first and foremost. If you thought MS expects regular consumers to buy the Copilot+ computers, you're dead wrong. They don't care if literally no one buys it. Because business will eat that shit up. And big companies will pay insane money to get as you say their own specialized AI solutions. While things like Claude, will struggle to finance anything after they run out of venture capital.

    • @esarmiento7
      @esarmiento7 Місяць тому

      I just asked Claude for erotic content and he treated me like a pervert

    • @hastyscorpion
      @hastyscorpion Місяць тому +26

      You think the reason most people use Claude is for “ erotic content” ? Dude you need to go outside and talk to actual humans more

    • @nousquest
      @nousquest Місяць тому +15

      Claude is much more censored. I can't get it to help me with the CTFs in my ethical hacking course.

    • @freeottis
      @freeottis Місяць тому +6

      In my experience Claude censors more. I tried asking it a question about what a stolen vehicle could be used for (a screenshot from a driver’s license exam) and it said nope. Chatgpt answered it.

  • @toadlguy
    @toadlguy Місяць тому +1

    The is the best overview of o1 I have seen yet 😊😊😊

  • @nawawishkid
    @nawawishkid Місяць тому +12

    3:17 I've just tried asking the o1-preview model `How many "r" in the word strawberry?`, it answered 3 "r"s correctly at first try. Then in the same chat, I switched to 4o model, it said 2. 🤷 Then switched back to o1-preview, it even apologized for the mistake in the previous answer made by 4o. Pretty smart to me. 🎉

    • @rumfordc
      @rumfordc Місяць тому +5

      then you're not very smart

  • @SpragginsDesigns
    @SpragginsDesigns Місяць тому +81

    0:23 was a legit lol moment. Oh wait, so was most of the video.

    • @SkegAudio
      @SkegAudio Місяць тому +1

      came here to take a break from coursework, that avocado bit had laughing way too loud for a library 😂

    • @SpragginsDesigns
      @SpragginsDesigns Місяць тому +3

      @@SkegAudio Nobody does the developer / comedy / memes / but still informative style he has. He's one of those "never miss a video" channels I have to watch on the spot.

  • @turolretar
    @turolretar Місяць тому +7

    A horse walks into a bar. The bartender asks - why the long face?

  • @hendrx
    @hendrx Місяць тому +27

    Remember guys, we nerfed o1 when the hype was over, but o2 is gonna make a killing

    • @otpezdal
      @otpezdal Місяць тому +1

      Please, write the same statement but for o3 in the future

    • @JimmyKrochmalska-f7p
      @JimmyKrochmalska-f7p Місяць тому +1

      @@otpezdal Don't worry guys, o8 was a flop but o9 is gonna beat us all

  • @Radblur
    @Radblur Місяць тому

    Like o1, Fireship's video production value and depth are getting better and better.

  • @HarryEdwards-zk6ok
    @HarryEdwards-zk6ok Місяць тому

    Thanks for updating us!

  • @98ahni
    @98ahni Місяць тому +8

    As long as it can't solve the _"Okay, so hear me out."_ problems the client has with all the help of _"I'm sure you'll figure it out!"_ and (of course) no further details, I think my job is pretty safe.

  • @bennythetiger6052
    @bennythetiger6052 Місяць тому +7

    I love how, by this point, people should've already realized they shouldn't freak out when new AI DLC drops, yet it all follows the same hype trend. They keep being like "oh, but this time it's for real", but until we see a real and fair example of it actually doing all these revolutionary things, it's illogical to assume things will be any different. It's not copium, it's just a matter of proof of concept

  • @naishiuan1
    @naishiuan1 Місяць тому +37

    o1ways 2 steps ahead!

  • @royaldgaming3485
    @royaldgaming3485 Місяць тому +1

    I love masking my fear of getting replaced by AI with videos about the progression of AI

  • @deleted-something
    @deleted-something Місяць тому +4

    Okay but at this rate the next model with need .05% of the worlds energy to solve a question

    • @JimmyKrochmalska-f7p
      @JimmyKrochmalska-f7p Місяць тому

      Probably not because at that point none of the models would be even public even to anyone, .05% is a lot, but yeah it is getting pretty resource dependent

  • @existenceisillusion6528
    @existenceisillusion6528 Місяць тому +13

    The core innovation driving o1 was made public about 6 months ago. And it really works, but we still have a long way to go. I tried it on 2 challenging problems, and it almost didn't suck.

  • @jonwinder6622
    @jonwinder6622 Місяць тому +6

    I almost want fireship to stop posting. This channel is scaring the shit out of me and my career. This is fucking nuts

    • @hiya2793
      @hiya2793 Місяць тому +6

      why, is your job creating tiny 300 line snake games?

    • @evolgenius1150
      @evolgenius1150 Місяць тому

      Just use it. AI has become an amazing pair programmer and conversational wiki page. I like bouncing logic off of it and getting its feedback, and its ability to answer questions id normally send to stack exchange.
      If anything is loosing its job it will be stack exchange 😂
      Just leverage the tool already.
      What it will make obsolete are low level junior programmers which no ai skills because ai fills in skill gaps. Junior devs will be expected to do more, and senior devs will be expected to do more.
      If anything AI will just make it so our jobs demand more of us, we’ll be expected faster turn around times or to ship twice as much code.

    • @jonwinder6622
      @jonwinder6622 Місяць тому

      @@hiya2793 Oh if only that were all it could do.

    • @hiya2793
      @hiya2793 Місяць тому

      @@jonwinder6622 I mean-
      Good luck throwing a 10.000 line project at chatgpt.
      As a matter of fact-
      Go and create a simple 2000 line vite project, with let's keep it small and simple and say 10 scripts vanilla js, a simple small game on an html canvas.
      No AI in the world comes even close to having enough tokens to even just read through that small af project-
      Let alone provide good additional code that doesn't suck aboslute balls without spending hours proompting - at which point you may aswell just write it yourself.
      AI is cool for stuff like: "How did flexbox go again? i'm too lazy to google, ai do it"
      or "ah crap i forgot the syntax for a switch case in some niche language - ai, you do it"

    • @divinecreation6
      @divinecreation6 Місяць тому

      Lmao dude your comment got copied and stolen by a bot.

  • @kili20394
    @kili20394 Місяць тому +51

    As a coder and developer, I have no fear of "LLMs" taking my job. A lot of the stuff I code is too specific and niche for an LLM to figure out without having hella bugs.

    • @sajeucettefoistunevaspasme
      @sajeucettefoistunevaspasme Місяць тому +14

      as a 0.1x developper I am very afraid

    • @djs-vids
      @djs-vids Місяць тому +1

      agreed, same

    • @fullstackweebdev
      @fullstackweebdev Місяць тому +25

      To replace me, the customer would need to know what they want and accurately describe it to an AI.
      I’m perfectly safe.

    • @djs-vids
      @djs-vids Місяць тому

      @@fullstackweebdev and then be able to debug the trashy code AI produces

    • @byron_00
      @byron_00 Місяць тому +4

      @@fullstackweebdev well said. I push back on the garbage requirements I receive and help point the customer in the right direction for something more sane. A.I. will happily write a clucking fsck.

  • @NeonVisual
    @NeonVisual Місяць тому +4

    When we eventually get AGI it will be so expensive to run that we will only be able to turn it on for a fraction of a second to resolve all of humanity's problems. It will then take 10 years to work through all of the data created.

  • @marked75
    @marked75 Місяць тому +1

    based on what you said, I think this confirms that they are now at the phase where they're doing clever implementations of the LLMs and being more specific in what it should generate well. In my opinion this is a sign that the technology is maturing, and the real potentially world changing products are coming. But it may also be a sign that this technology is at it's peak, when you can't go up, you go side ways

  • @strategistaow3520
    @strategistaow3520 Місяць тому +43

    If ai can replace programmers, it can replace anyone

    • @szymoniak75
      @szymoniak75 Місяць тому

      yup!

    • @CyanRooper
      @CyanRooper Місяць тому +2

      Spy from TF2: "It could replace you, it could replace me. It could even replace..."

    • @sajeucettefoistunevaspasme
      @sajeucettefoistunevaspasme Місяць тому

      @@CyanRooper "it could even be your mother !"I haven't seen it for a while

    • @reinhardt_tv
      @reinhardt_tv Місяць тому +7

      Sadly, we don't leave in fantasy world and this thing will be massively disappointing

    • @ineeddaname2
      @ineeddaname2 Місяць тому +12

      Not really. Coding has tons of sample data to train on.
      There's tons of obscure roles or tasks in the business world that could be replicated if the right training data was available but it isn't since it's only in some guys head

  • @notKhalid
    @notKhalid Місяць тому +79

    to be clear, o1 are not actually new models themselves, they're built on top of gpt-4o models with extended inference abilities.

    • @sashub2593
      @sashub2593 Місяць тому +8

      well, what you just said is
      quite obvious because if we think about it, no company is going to redesign the entire algorithm again to come up with a new model.

    • @tzardelasuerte
      @tzardelasuerte Місяць тому

      Correcto. Now in a few months gpt5 is coming out with all these advancements.

    • @David-gu8hv
      @David-gu8hv Місяць тому

      Doesn't it use feed back now? Adding "one little change" can have profound effects...

  • @genzod-i6e
    @genzod-i6e Місяць тому +12

    In those competitions were they using new challenges or old ones that the AI might have gone through during training?

    • @flarebear5346
      @flarebear5346 Місяць тому +6

      They were using old ones lmao

    • @veloce5491
      @veloce5491 Місяць тому +3

      this is always my question but the answer is always hard to find. where would they even get all these completely original coding questions to test these models on?

    • @MindBlowerWTF
      @MindBlowerWTF Місяць тому

      @@veloce5491 for GPT 4 they published a paper and they show the result for both. Can't find the paper on this model, but didn't look that hard.

  • @UnbanMeNowOfficial
    @UnbanMeNowOfficial Місяць тому

    The balance between the potential and the realistic expectations is much needed in these discussions.

  • @gavin3405
    @gavin3405 Місяць тому +2

    It did not fail the "r"s in "strawberry" challenge. You did not ask how many "r" letters are in "strawberry". You merely stated that there are two "r"s in the word "strawberry" (which is correct. There are two "r"s in "strawberry") and the model agreed with you. There is an "r" in the "strawberry", there are two "r"s in "strawberry", there are three "r"s in "strawberry". All three are correct statements. Saying there are ONLY two "r"s in "strawberry" would be incorrect.

  • @user-sb5vt8iy5q
    @user-sb5vt8iy5q Місяць тому +11

    Ok so when will they replace HR?

  • @ryzikx
    @ryzikx Місяць тому +37

    didnt expect nikocado cameo on Jeff Fireship's channel!

  • @mrkingsquid20
    @mrkingsquid20 Місяць тому +16

    so cooked I'm watching this during comp sci class

    • @n-o-i-d
      @n-o-i-d Місяць тому +5

      Paying off student loans later while having no job sure sounds like a lot of fun

    • @hvr8463
      @hvr8463 Місяць тому +4

      Too late for a refund?

    • @notme3987
      @notme3987 Місяць тому +1

      Have faith brother, see this AI scare as a good thing.

    • @turolretar
      @turolretar Місяць тому

      What’s cooking? Where’s mine

    • @NoName-cd5ft
      @NoName-cd5ft Місяць тому

      Me too 😢. Does anyone have any suggestions about how to stay relevant.

  • @JasonStJohnRules
    @JasonStJohnRules Місяць тому +1

    I mean, as a professional dev, it seems to me that 74.2% of problems are the first 10% of time spent on a project the other 90% is the other 26.8% of issues, and we're still safe there. It's actually nice that AI will get us there quicker.

  • @indluk
    @indluk Місяць тому

    Learned something new today, that is I should not sip/drink coffee while watching Fireship's videos, oh my poor keyboard😢
    Great vid as always 👍

  • @aubreygonzalez4715
    @aubreygonzalez4715 Місяць тому +4

    0 days since we talked about AI

    • @Likemea
      @Likemea Місяць тому

      ai is overhyped

  • @BrandonAaskov
    @BrandonAaskov Місяць тому +39

    lol officer hardass with that image 😂

    • @officebatman9411
      @officebatman9411 Місяць тому

      whos that?

    • @therealkon_
      @therealkon_ Місяць тому +4

      @@officebatman9411 Officer Hardass

    • @ryzikx
      @ryzikx Місяць тому +7

      @@officebatman9411 someone who got fired for doing certain activities when she shouldnt have been

    • @JM-st1le
      @JM-st1le Місяць тому

      😂

    • @bigbigdog
      @bigbigdog Місяць тому +7

      @@ryzikx doing certain activities to the whole goddamn police dept

  • @PatrickHoodDaniel
    @PatrickHoodDaniel Місяць тому +4

    My prompt for the number of "r"s in the word "strawberry" got it right.

    • @Scrubzei
      @Scrubzei Місяць тому +5

      Mine didn't

    • @PatrickHoodDaniel
      @PatrickHoodDaniel Місяць тому +1

      @@Scrubzei interesting.

    • @purplebuckwheat
      @purplebuckwheat Місяць тому +1

      Even GPT-4 legacy got that one right for me.

    • @rumfordc
      @rumfordc Місяць тому +4

      @@PatrickHoodDaniel LLM's don't give consistent answers because 1) they're rate limited and the amount of compute spent changes the answer and 2) they have a 'temperature' parameter which is effectively just RNG when selecting from the top token candidates 3) every single character you type is a completely new input so something as simple as leaving out a question mark will potentially get a different answer

  • @pkingo1
    @pkingo1 Місяць тому

    Experienced the same with coding, its initial output was impressive but I also hit that limit pretty quickly on what it could accomplish and it failed at certain tasks. So a marginal improvement from GPT-4o, which in itself is pretty impressive. Another huge leap in capabilities is still hard to imagine, but looking forward to it.

  • @nicknelson1975
    @nicknelson1975 Місяць тому +18

    It can build a game of Snake because there are thousands of open source examples online.

    • @w.mcnamara
      @w.mcnamara Місяць тому +4

      This a million times over lmfao. If only the people hyping ai through the moon knew even the most basic aspects of how llms work

    • @iraniansuperhacker4382
      @iraniansuperhacker4382 Місяць тому +6

      @@w.mcnamara I just remind them of how crazy their ideas are. I remind them that they are claiming that linear algebra and statistics have literally become living beings and can now reason like humans. The hype is just silly at this point, I just ask them how its possible literal math became conscious and I never get a reply back.

    • @micca971
      @micca971 Місяць тому +5

      @@iraniansuperhacker4382 Probably the same way a few neurons sending singals back and forth can become conscoius aka we don't know. We don't know what consciousness is, what do you need for that or how it comes to exist. Maybe even math can become conscious who knows. That said I'm not saying any AI is conscious or even that it will ever reach consciousness, just that we don't know if it is possible.

    • @iraniansuperhacker4382
      @iraniansuperhacker4382 Місяць тому +4

      @@micca971 I would go as far as to say that math being processed on a silicon chip becoming conscious is physically impossible no matter how complex of a system it is. This is like saying if we write a sufficiently advanced piece of literature it will eventually be able to think or reason in some way. It just fundamentally doesnt make any sense.

    • @micca971
      @micca971 Місяць тому

      @@iraniansuperhacker4382 that's not same at all, a piece of literature does not compute or process anything it does not receive and manipulate energy, therefore it cannot do aynthing on its own. If however you said a lot of monkeys were writing books, then possibly the entire collective of monkeys writing books (a lot of them, trillions or quadrillions at least or maybe more) can become conscious or at least exhibit intelligent behaviour as we see with the current AI. Aka it's not just about complexity, it's about manipulating energy and data using some logic. Also keep in mind this is all very hypothetical, but you can't say it is fundamentaly wrong. We just don't know.

  • @co3udatel
    @co3udatel Місяць тому +3

    3:32 Ну за фруктовый сад лайк однозначно

  • @AyushRaj-w4j
    @AyushRaj-w4j Місяць тому +3

    1:40 i never expected to wake up and see a bot having higher rating than me.

  • @rheavictor7
    @rheavictor7 Місяць тому

    OK, amazing video as always, but the "Officer Hardass" + the image you chose...
    Dead.
    10/10

  • @geovane19
    @geovane19 Місяць тому +2

    as long as LLMs can't translate the bizarre requirements from a actual client into a functioning product, we're good

  • @chr0ne692
    @chr0ne692 Місяць тому +6

    I am pretty sure GPT4 also prompts itself somewhat at least because I am remember one time it accidentally showed me it's internal prompting. It said something like "user wants to understand blah blah..." then abruptly switched to explaining what I wanted.

    • @Caphalem
      @Caphalem Місяць тому +3

      You are correct, this is something ChatGPT does. It basically tries to create a more sophisticated prompt out of your prompt before actually addressing it. However, what these new models essentially do is check their answer and try to sanity check themselves several times before giving you the final response.

    • @chr0ne692
      @chr0ne692 Місяць тому

      @@Caphalem I figured something like that. I just thought this distinction wasn't totally clear in the video, or maybe I wasn't paying enough attention. Thanks for the reply

  • @hahahano2796
    @hahahano2796 Місяць тому +10

    When given 10,000 chances it finds the one monkey who can write Shakespeare.

    • @rise9489
      @rise9489 Місяць тому +6

      Even a blind squirrel will eventually find a nut

    • @aabbvcddeeffaass6216
      @aabbvcddeeffaass6216 Місяць тому +1

      if the monkey can finish 10000 trys in 10 minutes, I don't see a problem.

  • @VeryUniqueRandomName
    @VeryUniqueRandomName Місяць тому +11

    74% might sound like a lot for a non-technical person, but for those who know what is SLA and how hard to go from 99.9 to 99.99, 74% is not even worth looking. Though I have doubts that LLM models will ever reach 99%

    • @peterhorton9063
      @peterhorton9063 Місяць тому +8

      Right being 95 percent accurate in your compute is terrible for most things. Imagine 1/20 words you speak and interpret wrong while not even knowing they were wrong. Errors would compound all over.

    • @egodreas
      @egodreas Місяць тому +6

      @@peterhorton9063 I'm sure it wouldn't be too bad. I suspect that most people would probably understand you just pineapple.

    • @Haise-san
      @Haise-san Місяць тому +3

      ​@@egodreasFor people yeah, but condoms for sure need it to be accurate for them to work and solve problems.

    • @Dorian-y3v
      @Dorian-y3v Місяць тому

      Lil' bro. Real human workers ain't pulling 99.99 success rate. What are you yapping about

  • @PS3PCDJ
    @PS3PCDJ Місяць тому +1

    It will take years for AI to plateau, sure the specific method like GPT might plateau, but not the field in general.
    We have barely started with this and I 100% believe that the improvements are going to be even faster and better now

  • @lafireteamplx3400
    @lafireteamplx3400 26 днів тому +1

    My guy, of fucking course it can solve leetcode problems, what you need is innovation

  • @beeronme7131
    @beeronme7131 Місяць тому +18

    No views, 15 comments. The boys are wild.

    • @chrisholland6366
      @chrisholland6366 Місяць тому +3

      I love eventual consistency

    • @rumfordc
      @rumfordc Місяць тому +2

      views are a thousand times more frequent than comments, so youtube counts them in batches slowly over time as opposed to comments which are just counted normally as they come in. So there is often a noticeable delay in the view count when the video is first uploaded. i believe that's what the comment above mine is referring to as well.

    • @beeronme7131
      @beeronme7131 Місяць тому +1

      @@rumfordc Thanks for explaining :) I knew how this works (Tom Scott's video about this is great).

  • @RedactedBrainwaves2
    @RedactedBrainwaves2 Місяць тому +9

    I'm amazed how we so easily trust a guy whose net worth depends on the public's perception of his own success.

  • @will_abule
    @will_abule Місяць тому +6

    But can it be monetised?

  • @aihtdikh
    @aihtdikh Місяць тому +1

    I used to love playing Dog Wars, back in the day! My favourite moment was a random-generated event with a random-generated price: some shady character in a dark alley offered to sell me a trench-coat with massively increased dog-carrying capacity, but he wouldn't accept any less than $0 for it.
    I wonder if this new AI would understand why that was hilarious?

  • @Im_Ninooo
    @Im_Ninooo Місяць тому +4

    4:06 ayo is that Bogdan? 😳

  • @cherubin7th
    @cherubin7th Місяць тому +28

    CoT is known to make GPT-4 much better and people were using it themselves already to improve it. The fact OpenAI has to resort to such well known improvement hacks is very sad.

    • @spookskxz
      @spookskxz Місяць тому +16

      that's what I was thinking, before they released gpt-4 somebody made "auto gpt" and that's basically what it did lol

    • @KeinNiemand
      @KeinNiemand Місяць тому +2

      yeah made it better by actually training the model to be better at chain of cought, all those other exisitng tools are external.

    • @gramioerie_xi133
      @gramioerie_xi133 Місяць тому +2

      Are you joking?????

    • @JavedAlam-ce4mu
      @JavedAlam-ce4mu Місяць тому +1

      @@KeinNiemand Not really, they just reprompt it continuously.

    • @KeinNiemand
      @KeinNiemand Місяць тому

      @@JavedAlam-ce4mu They did some reinforcment learning stuff.