Did OpenAI Just Create AGI with the o3 models?

Поділитися
Вставка
  • Опубліковано 27 гру 2024

КОМЕНТАРІ • 58

  • @timooothy1234
    @timooothy1234 7 днів тому +10

    The future is how creative you can be

  • @YahyaDrame
    @YahyaDrame 7 днів тому +5

    its just about creativity now

  • @DavidROliver
    @DavidROliver 7 днів тому +6

    o3 highly tuned, is around $1000 per command at the moment and this is because of hardware, now we will need to wait 6-18 months for this to come down as NVIDIA and other TPU/CPU manufacturers release products that are optimised to run these models.
    This is a very small window to get your skills sorted, so don't wait!

    • @chromotk1118
      @chromotk1118 6 днів тому

      What skill sets do you see evolving further or becoming obsolete as AI becomes more specialized?

    • @DavidROliver
      @DavidROliver 6 днів тому

      @@chromotk1118 AI is a means and not an end, someone has to type in the prompts and put systems together. Databases are still needed, integrations are still required, security and compliance are still important, so is accessibility and of course business continuity. So still lots of considerations before letting an AI lose on running a business. The Enterprise still needs a crew!

    • @vishalvs8576
      @vishalvs8576 5 днів тому

      @@DavidROliver How long do you think it would take for ai to spit out end-to-end business-grade software needs?

  • @maxziebell4013
    @maxziebell4013 7 днів тому +6

    Every model had a safety testing team. This time only they're opening it up also to third parties. The other models all had red teaming and safety testing. That is normal.

  • @maxziebell4013
    @maxziebell4013 7 днів тому +4

    Let's see, maybe we're going to have a Sora story again. They tease it, it takes ages to release, and in the meantime, Google overtakes them. I really hope that this doesn't occur, but we never know.

    • @Sekai420
      @Sekai420 7 днів тому

      Sora was only overtaken by Veo 2 because google has the access to the most videos and can curate the models to better filmography… they own UA-cam lol.
      The only other thing they’re better at is SEO and search engines, everything else ChatGPT got them beat

  • @couchtaming23
    @couchtaming23 7 днів тому +2

    Some argue that it still struggles with certain simple human tasks. However, I believe this only underscores the fact that its inner workings are fundamentally different from those of humans. These minor issues will not impede its immense potential to drive significant advancements in science and technology. I also believe that humanity may achieve immortality sooner than expected, as the development of artificial superintelligence is progressing far more rapidly than anticipated.

    • @BoominGame
      @BoominGame 7 днів тому +1

      Why are we obsessed with having the AI acting like a human, it does it's job brilliantly as point of fusion between a human brain and technology at large. It's fantastic to team with, otherwise what is the point in making them behave like humans.

    • @Ashleyreallylikesboatsbro
      @Ashleyreallylikesboatsbro 6 днів тому

      ​@@BoominGamewere not trying ti make it act like a human persay its more about making these systems general in nature and from what we can tell humans are pretty much the only other general intelligence apart from a few animals

  • @MrErick1160
    @MrErick1160 6 днів тому +1

    I can't believe how fast this is going to be honest we went from reasoners to agents in a fucking month and back to advanced godlike reasoners in another few weeks...

    • @phen-themoogle7651
      @phen-themoogle7651 6 днів тому +1

      To have true agents doing useful tasks online and the majority of human desk jobs we needed to reach this level of reasoning or even more so (they still need to learn and improve in real time), this development was expected. Stuff will speed up every 3-6 months now instead of yearly. Tons of new models coming!

  • @advaitc2554
    @advaitc2554 2 дні тому

    For me, I'll believe we have real AGI when a group of humanoid robots can successfully coach and manage a soccer team of 6 to 8 year old human kids.

  • @bestvexer
    @bestvexer 5 днів тому

    now we need an open source 'o3'

  • @greenumbrellacorp5744
    @greenumbrellacorp5744 7 днів тому +1

    Yea.... but this has to be explained again and again. Programming, software engineers, developers, whatever you wanna call it .. the act of just TYPING the code isnt that.
    The AI will TYPE the logic you tell it to type, the logic of the program you designed writting in code. The actual work, the logic, is the actual skill developers are useful for, once you have the logic figured out, whatever size the program is TYPING it is just a mechanical procedure... tedious, long... and prone to errors.... but not "hard". like doing manual math vs a calculator, but we still have mathematicians dont we?.In fact in calculator work NOTHING beats a computer. Because a mathematician is not just a human calculator. A developer isnt just someone who knows the language... a developer is someone who knows how to develop and build the program/aplication/web whatever required for the job, typing that program is just the way to get that info from our brain into the computer.... if AI can do that.. its autocomplete on steroids.

  • @ThreeChe
    @ThreeChe 7 днів тому +2

    Natural language is the future of coding languages. Within a decade we'll be speaking apps into existence.

  • @couchtaming23
    @couchtaming23 7 днів тому +1

    I think they are currently training o5.

  • @CraigLaValle
    @CraigLaValle 7 днів тому +4

    They're crowd sourcing safety testing because they cut their safety team.
    And while code competition benchmarks are cool, they're still commonly solved problems.
    We need a benchmark for solving novel problems.

    • @anandkanade9500
      @anandkanade9500 7 днів тому +3

      Its still impressive it can do that , llms are not databases so it shows level of training

    • @uGetkilled
      @uGetkilled 7 днів тому

      ^This. I use GPT4 pretty much on a daily basis for rubber duckying thoughts or to do some grunt tasks. People that think AI is going to create autonomous developers that will just 1:1 replace some engineers on your team, I think are either delusional, or they just build Hello World CRUD APIs with 0 business or domain logic. Next to that, every task seems "code" focused. As if all you do all day as a software engineer is write code. Literally the time spent on meetings, code reviews, writing your deployment logic, syncing with your BA on UATs, provisioning components in your cloud environment, hooking them into your application, etc. etc. etc. is all stuff that all GPT models suck ass at. Even Devin sucks at it. We're nowhere even close to having autonomous replacement for developers. This fearmongering and doomsaying is getting boring.

  • @Corporate_Viking
    @Corporate_Viking 7 днів тому +1

    Hey @corbin could you make a video on other people in AI that you follow or people who you think are legit and worth following in the space? I find currently there still to be a lot of hype and “fluffed” channels and also a lot of the “build a $10M app with me in 20 mins”.
    Feels like you are one of the few who actually are thinking critically and building a sound business model using AI. Would love some recs on others to follow or just about the overall hypeness of the space. Thanks again

  • @paulmuriithi9195
    @paulmuriithi9195 6 днів тому

    there is a possibility that 03 could be hosted on cerebras wafer racks to reduce inference costs. my estimates are $130 per command from February 2025. Groqs new hardware set up seems to be taken by anthropic which are expected to release their 03 competitor models in January 2025.

  • @MatthewSanders-l7k
    @MatthewSanders-l7k 7 днів тому +3

    03 model ranks 175th! Major shift from thinking AI code is subpar to recognizing its advanced capabilities. Logic-based coding is the future. Wonder what 04 will bring?

  • @maxziebell4013
    @maxziebell4013 7 днів тому +2

    But did you see the pricing though? One run can go from $10 to $1000.

  • @AlexJohnson-g4n
    @AlexJohnson-g4n 7 днів тому +3

    OpenAI 03 hitting top 175 on Codeforces is wild! AI coding is next-level now. Can't wait to see what we can build. When do you think OpenAI 04 will drop?

    • @0og
      @0og 7 днів тому +1

      considering the roughly 3 month dif, maybe 3-4 months from now? I'd imagine they'll use that time for red teaming it, then publicly release O3 mini a bit before then.

    • @phen-themoogle7651
      @phen-themoogle7651 6 днів тому

      Or skip to o5(=GPT-5) probably gonna switch back to their main model that has the ability to summon other o-models whenever it needs to and be extremely agentic

  • @NaveenReddy-p5j
    @NaveenReddy-p5j 6 днів тому

    03 model's coding leap is impressive. AI reshapes development; time to embrace the change.

  • @nephastgweiz1022
    @nephastgweiz1022 7 днів тому

    What happened to the o2 series ?

    • @tencizinec9583
      @tencizinec9583 7 днів тому +2

      O2 is a phone company. They couldn't use it.

  • @julianbruns7459
    @julianbruns7459 7 днів тому

    1:30 im not sure what your argument is here, gpt 4 had 6 months of safety testing...

  • @BoominGame
    @BoominGame 7 днів тому

    GPT 3.5 was way better than me at coding already.

  • @jacksontwilliams
    @jacksontwilliams 7 днів тому +1

    🙌

  • @CharlotteLopez-n3i
    @CharlotteLopez-n3i 6 днів тому +3

    AI climbs the ranks! The 03 model just made coding less about typing, more about logic. Ready for this revolution?

    • @RobertFletcherOBE
      @RobertFletcherOBE 6 днів тому +1

      coding was always about logic, it was never about typing. what are you on about?

  • @genkidama7385
    @genkidama7385 6 днів тому

    havent been impressed by any model's coding capabilities they just do bs and misconception about everything damn instruction. like it will make square wheeled car because you forgot to specify that cars need round wheels, whats even the point.

  • @peehi2
    @peehi2 6 днів тому

    What you haven't told is that a more efficient version is $20 per prompt.

  • @Recuper8
    @Recuper8 5 днів тому

    Bye bye human jobs

  • @mybocks3
    @mybocks3 7 днів тому

    *_Google got em so shook they skipped 02 and went straight to 03_* 😂

    • @reddddzzz
      @reddddzzz 7 днів тому

      Glazing google lol 😂😂 it's copyright issues not because of Google you genius

    • @mybocks3
      @mybocks3 7 днів тому

      @reddddzzz is that you, Sam?

    • @reddddzzz
      @reddddzzz 7 днів тому

      @@mybocks3 nope not Sam just saying some thing that is obvious

  • @sartipablo
    @sartipablo 7 днів тому

    so companies will rather pay $1000 to run these models which will get cheaper over time so wtf not a good time to be a developer.

  • @jonprall
    @jonprall 7 днів тому

    no because it isn't done, isn't shipped, isn't validated, and you just made a video on vaporware. congrats for not much.