DeepSeek-V3 (Fully Tested) : RIP 3.5 Sonnet & O1! This Opensource Model Beats Claude 3.5 Sonnet!

Поділитися
Вставка
  • Опубліковано 12 січ 2025

КОМЕНТАРІ • 132

  • @wedding_photography
    @wedding_photography 18 днів тому +65

    We know that when question 4 is answered correctly, the AGI has been achieved.

    • @fabiankliebhan
      @fabiankliebhan 18 днів тому +4

      o1 pro gets it correct

    • @markantscott
      @markantscott 18 днів тому +3

      Q4 is bigger than mere AGI. It is the ability to answer obscure English Pub Quiz Night questions.

    • @tescOne
      @tescOne 18 днів тому

      @@fabiankliebhan sonnet too

    • @kafkaesqued
      @kafkaesqued 18 днів тому +2

      What is so special about question 4?

    • @seanlbrennan
      @seanlbrennan 18 днів тому +2

      Deepthink option gets Q4 right but took two turns. First turn it ran out of tokens testing words and used a 10 letter word. asked it to keep going and it gives Sententious. The next turn it came up with Transparent right away.

  • @HedleyPugh
    @HedleyPugh 18 днів тому +17

    The "preview" of DeepSeek's new V3 model takes 2nd place on the aider polyglot leaderboard.
    1: 62% o1
    2: 48% DeepSeek V3 Preview
    3: 45% Sonnet
    4: 38% Gemini-exp-1206
    5: 33% o1-mini

  • @theorderofz
    @theorderofz 18 днів тому +10

    Thanks for always putting us on, mate

  • @EditUMedia
    @EditUMedia 18 днів тому +8

    Thank you so much for these videos covering new models. Merry Christmas

  • @d.d.z.
    @d.d.z. 18 днів тому +12

    With Qwen and Deepseek China strikes back. So amazing to live in 2025.

  • @samuelsilveira9709
    @samuelsilveira9709 18 днів тому +7

    Merry Christmas, codeking

  • @ElvinHoney707
    @ElvinHoney707 18 днів тому +10

    o1 passes question 4: "A suitable answer is "SENTENTIOUS." It is an English adjective (from Latin "sententiosus"), it has 11 letters, begins and ends with S, and its vowels (e, e, i, o, u) appear in strictly alphabetical (non‐decreasing) order."

  • @AI-fm2vu
    @AI-fm2vu 18 днів тому +1

    This is my Christmas gift, thx!

  • @sinapxiagency
    @sinapxiagency 18 днів тому +3

    King, i dont know how you get this reviews so fast even in holidays, thank you so much

  • @JeffreyWang-hh4ss
    @JeffreyWang-hh4ss 18 днів тому

    very good new questions! i like how u are being super objective with all the model reviews.

  • @Kevencebazile
    @Kevencebazile 18 днів тому +4

    Merry Christmas Brother Love your content

  • @collinsk8754
    @collinsk8754 18 днів тому

    Great work. And it's finally up to date with NextJS 14! 🙌🙌

  • @TitoSadek
    @TitoSadek 18 днів тому

    Merry Christmas , I love your content , thanks

  • @Bu3askoorDXB
    @Bu3askoorDXB 17 днів тому

    Thank you so much for this! It's amazing. I just have tried it. It is available on open router. The pricing is great

  • @ram49967
    @ram49967 18 днів тому +2

    Super questions for the LLM! It's ok with me to give it a pass on Question 3, even though it used the first letter and not the second letter to make the Haiku.

  • @SipChai
    @SipChai 18 днів тому +7

    Panda has replaced Santa. Poor old man.

  • @jacobfloyd6929
    @jacobfloyd6929 18 днів тому +6

    Brother you really have extremely valuable content. Have you ever thought about running a community/course? I’m sure there’s a lot of people looking to collaborate with like minded people, especially since AI is so tough to stay on top of.

    • @AICodeKing
      @AICodeKing  18 днів тому +7

      I already have a membership on my channel where I post in-depth tutorials for niche topics.

    • @jacobfloyd6929
      @jacobfloyd6929 18 днів тому +2

      @ thank you I’m gonna look into that. Are you opposed to creating a discord or Skool community? That way everyone can collaborate on new stuff they’re finding, we all know networking is everything but it’s tough to find a valuable community.

    • @theorderofz
      @theorderofz 18 днів тому

      @@jacobfloyd6929true. That would work pretty well.

    • @jmg9509
      @jmg9509 18 днів тому

      This guy in the vid sounds like ai lol

  • @maddoxthorne2297
    @maddoxthorne2297 18 днів тому +1

    Christmas gift galore.🎁❤️

  • @notshekhar4738
    @notshekhar4738 18 днів тому +13

    Even with a slightly altered prompt like ' 'what mode are you using?' (with a single quote at the beginning), the model still responds with 'GPT-4'. This raises questions about its underlying architecture.

    • @gui1236100
      @gui1236100 18 днів тому +4

      Maybe they trained on data generated by gpt-4

    • @BACA01
      @BACA01 18 днів тому +3

      @@gui1236100 They stole it as always 😁

    • @wwkk4964
      @wwkk4964 18 днів тому

      They tend to tend to say that, even Gemini would say it last year. everyone trained on Chatgpt.

    • @GRVTY3
      @GRVTY3 18 днів тому +1

      i'm using it in cline with openrouter deepseek chat api, and it keeps saying it's claude and acting like claude. something really sus going on here

    • @boynet2
      @boynet2 18 днів тому +3

      @@GRVTY3 maybe cline prompt has something like "you are Claude..." ?

  • @displayname7t4
    @displayname7t4 18 днів тому

    Most useful channel in youtube right now

  • @mrinalraj4801
    @mrinalraj4801 18 днів тому

    Thanks for the video 😊

  • @jeremyph8319
    @jeremyph8319 14 днів тому

    Which is better, DeepSeek V3 or DeepSeek R1 Lite Preview? When I tried the R1, I thought V3 was better because the R1 wasn’t following my instructions.

  • @flutterflowexpert
    @flutterflowexpert 18 днів тому

    New questions! Finally! 🎉❤

  • @Wesley58481
    @Wesley58481 18 днів тому

    Tks for sharing!! u so incredible!

  • @Bangs_Theory
    @Bangs_Theory 18 днів тому

    Merry X-mas King!

  • @chyldstudios
    @chyldstudios 18 днів тому +1

    Love to see this

  • @voltax4435
    @voltax4435 18 днів тому +4

    Finally a real sonnet alternative, and way cheaper!

  • @notshekhar4738
    @notshekhar4738 18 днів тому +4

    I tried prompting the model with 'what model are you using to response to this chat?' and it said 'GPT-4'. When I followed up with 'who developed you?', it answered 'OpenAI'. This makes me wonder if the system is actually utilizing OpenAI APIs.

    • @gui1236100
      @gui1236100 18 днів тому +2

      Maybe just training data generated by gpt-4

    • @notshekhar4738
      @notshekhar4738 18 днів тому

      @@gui1236100 maybe yess

    • @BACA01
      @BACA01 18 днів тому +1

      When it was deepseek v2 it was saying that it's a gpt3.5 and now it says it's gpt4

    • @Nomadnotepad
      @Nomadnotepad 18 днів тому +2

      Tell me you don’t understand how training data works without telling me you don’t know how training data works.

    • @TheFinanciallyWiseKidsTV222
      @TheFinanciallyWiseKidsTV222 18 днів тому

      I’m DeepSeek-V3, an intelligent assistant developed by the Chinese company DeepSeek. I’m built on advanced natural language processing and machine learning technologies, designed to assist with answering questions, providing information, and engaging in conversations. If you have any questions or need help, feel free to ask! 😊

  • @uniq6318
    @uniq6318 18 днів тому +2

    Without using deep thinking
    That's amazing

  • @Teetanthegamer
    @Teetanthegamer 18 днів тому +1

    Can you please make a tutorial on how to use it with cline locally through ollama or through paid api ?

  • @HarishPillay
    @HarishPillay 16 днів тому

    If you use the code from the github repo and run it locally, it is correct to claim that the code is open source (MIT license afterall). But if you use their model form huggingface, they put that out on a proprietary license and no longer open source.

  • @pranjalsuthar9476
    @pranjalsuthar9476 18 днів тому +1

    hey...You are making amazing videos. Please make video on organised files by AI

  • @jeffwads
    @jeffwads 18 днів тому

    I asked QwQ 32b the 4th question and it refused on the grounds that it may be part of a competition test and it wouldn't be fair, etc. It can be stubborn at times but I hope this isn't a sign of things to come.

  • @alexjensen990
    @alexjensen990 18 днів тому

    Well, color me surprised. I look forward to using it. Especially the lite model.

    • @Woutermans
      @Woutermans 17 днів тому

      Is there any evidence for a lite model coming?

  • @SimonStarfinger
    @SimonStarfinger 18 днів тому

    It's still not really usable with cline though, is it?
    Last time I tried, it got caught in a loop

  • @andrinSky
    @andrinSky 18 днів тому +2

    Hello Is it possible to work with Deepseek perhaps with Cline or RooCline. If yes how can i do this. Because this would be very Great! I could be very good for Coding.

    • @sinapxiagency
      @sinapxiagency 18 днів тому

      Use in cline the Open ai compatible api

    • @andrinSky
      @andrinSky 18 днів тому

      @@sinapxiagency And how are the Settings under "Open AI compatible AI"?
      I Mean the "Base URL"?
      and The "Model ID"?

    • @finnpoitier
      @finnpoitier 18 днів тому

      @@sinapxiagency Do you know, which provider? Openrouter?

    • @sorenkirksdjfk7310
      @sorenkirksdjfk7310 17 днів тому

      follow deepseek's documentation, it's easy

  • @SudeeptoDutta
    @SudeeptoDutta 18 днів тому

    So, If I'm already paying for the 2.5 API using Continue extension, it should automatically start using v3 right? No need to configure any new API key right?

    • @AICodeKing
      @AICodeKing  18 днів тому +1

      Yes, it should automatically switch

  • @fabiankliebhan
    @fabiankliebhan 18 днів тому

    Will deepseek v3 be available for cursor?

  • @Reverse-sg5rn
    @Reverse-sg5rn 18 днів тому +1

    Merry Christmas. Can you do code testing with cline and aider on it?

  • @UsmanAli-ve6tq
    @UsmanAli-ve6tq 18 днів тому

    Is there any model which was able to answer question 4 and achieved 100% score.

  • @supriyadas9703
    @supriyadas9703 18 днів тому

    Can you please give us the information about its knowledge cut off date or month

    • @huk2617
      @huk2617 17 днів тому

      It's not been stated anywhere as of yet

  • @JohnLewis-old
    @JohnLewis-old 18 днів тому

    How fast are inference speeds? Did it do well in Cline?

  • @salimalsenani2614
    @salimalsenani2614 18 днів тому +2

    Everyone wants to take sonnet down.. but no one could!
    It remains the king of coding.

    • @thanartchamnanyantarakij9950
      @thanartchamnanyantarakij9950 18 днів тому

      Not at this time. You can check by yourself

    • @salimalsenani2614
      @salimalsenani2614 18 днів тому

      @thanartchamnanyantarakij9950 I just tested Deepseek V3, Gemini 2.0 Flash, and Sonnet, asked them to create amazing landing page for coffee brand.
      Sonnet won by far in terms of design and following correct prompts.
      Second is so close Deepseek V3 and Gemini 2.0 Flash, but I preferred the Deepseek it's really amazing 🤩.

    • @Osys91
      @Osys91 18 днів тому

      ​@@thanartchamnanyantarakij9950 did it outperformed sonnet? I quickly tested some code and sonnet was still performing better

  • @aculz
    @aculz 18 днів тому +2

    wow, i have been waiting for this. i use deepseek as my main LLM since its the cheapest. great job to cover this model
    it seems we get our open-source model king this end of the year.
    Marry Christmas and Happy new year everyone 🎄🎄

  • @santypk5
    @santypk5 18 днів тому +1

    Why Australia and not Mongolia ?

  • @DouhaveaBugatti
    @DouhaveaBugatti 18 днів тому

    Um can you also add questions for coding in other frameworks like svelte etc.
    This will tell how much useful this model can be for building real applications

  • @greenpulp.
    @greenpulp. 18 днів тому

    Nice! How do we use it with Cline in VS Code?

    • @karamjittech
      @karamjittech 18 днів тому

      Use openai compatible from Cline settings.

  • @TawnyE
    @TawnyE 18 днів тому +1

    E
    Merry Christmas 🎄🎅

  • @aislanarislou
    @aislanarislou 18 днів тому

    What about programming skills ?

  • @nolannosike
    @nolannosike 18 днів тому

    is question 5 correct? you should get a decimal no? 20% of 48 is 9.6 so shouldnt the answer be 38, 38.4 to be exact? the way it did it also seems correct but we're getting two diff answers.

    • @AquaAstronaut23
      @AquaAstronaut23 18 днів тому +1

      That’s 20% of the inflated number 48 not the original number 40. You need to divide by the percentage as a decimal (1.2) to work it back.

  • @jessewang3330
    @jessewang3330 17 днів тому

    They have officially released V3 now, and they will raise prices, will still be much cheaper than sonnet though

  • @ProkopHapala
    @ProkopHapala 18 днів тому

    The biggest problem I always have with using DeepSeek for programming is the speed (14 token/s, its like 5times slower than Sonet and 10 slower than GPT), I hope they adress also that.

  • @Opeyemi.sanusi
    @Opeyemi.sanusi 18 днів тому

    I am amazed

  • @misterleo885
    @misterleo885 18 днів тому +1

    QvQ 72B Test please

  • @LucianoFiandesio
    @LucianoFiandesio 18 днів тому

    If I try to ask Deepseek "Which model are you?" I get this answer (via API): "I am an instance of OpenAI's language model, specifically based on the GPT-4 architecture. My design allows me to understand and generate human-like text based on the input I receive. If you have any questions or need assistance, feel free to ask!". If i try to ask the same question using the web interface from Deepseek, I get this answer: "I'm DeepSeek-V3, an AI model created exclusively by the Chinese Company DeepSeek. I'll do my best to help you.". Any idea why?

    • @AICodeKing
      @AICodeKing  18 днів тому +1

      It might be because they would have used some synthetic data from OpenAI's models. It's a preview version. The official one will come today which should fix this.

  • @DemocracyDecoded
    @DemocracyDecoded 18 днів тому

    I need to see a direct comparison to sonnet 3.5 new in all your videos buddy.

  • @formixcode
    @formixcode 18 днів тому +1

    yes I can sense their suddenly better in coding solving here is why

  • @skunagar0
    @skunagar0 15 днів тому

    install it on local pc

  • @perfectartiste6332
    @perfectartiste6332 18 днів тому +1

    merry Christmas, first here

  • @VincentLannoo-Bourton
    @VincentLannoo-Bourton 18 днів тому

    I know, there is twoo Rounds...
    But...
    first:
    After careful research, there is no known English adjective of Latin origin of 11 letters that has the first and last letter identical and whose vowels appear in alphabetical order.
    second was jusr "are you sure?"
    and: The adjective that fits all the specified criteria is "sententious."
    11 letters: S-E-N-T-E-N-T-I-O-U-S.
    Adjective: It describes someone who is given to moralizing or using maxims.
    Latin origin: The word derives from the Latin "sententia," meaning opinion or meaning.
    First and last letters are the same: Both are 'S.'
    Vowels in alphabetical order: The vowels are E, E, I, O, U, which follow the sequence A, E, I, O, U (with A not present).
    Thus, "sententious" satisfies all the conditions.

  • @limjuroy7078
    @limjuroy7078 18 днів тому

    Interesting!!!

  • @paulyflynn
    @paulyflynn 18 днів тому

    amazing

  • @aleksanderspiridonov7251
    @aleksanderspiridonov7251 18 днів тому

    Finally🎉🎉🎉🎉🎉🎉🎉❤

  • @다루루
    @다루루 15 днів тому

  • @fun8711
    @fun8711 18 днів тому

    Question number 4 stand on ten toes

  • @miselgpt
    @miselgpt 18 днів тому +1

    Why not Mongolia? 😉

  • @JoraMacKornev
    @JoraMacKornev 18 днів тому

    Rip 3.5 Sonnet and O3 😅

  • @rashad6459
    @rashad6459 18 днів тому +1

    I cant keep up😂😂😂

  • @varunaeeriyaulla
    @varunaeeriyaulla 18 днів тому

    Bro, I just asked, "What is the AI model I’m chatting with?" (using the Deepseek API via OpenWebUI). The answer is "You're currently chatting with OpenAI's GPT-4".
    I asked the same question from the chat and the code model. Are they reselling the OpenAPI GPT4???? Crazy. Please run a test.

    • @UsmanAli-ve6tq
      @UsmanAli-ve6tq 18 днів тому

      I got the same answer :)

    • @gui1236100
      @gui1236100 18 днів тому

      Maybe they used training data generated by gpt-4

    • @varunaeeriyaulla
      @varunaeeriyaulla 18 днів тому

      @@UsmanAli-ve6tq Yes, I asked the same question from the Deepseek chat interface, and it says "You're currently chatting with DeepSeek-V3". Very strange.

    • @varunaeeriyaulla
      @varunaeeriyaulla 18 днів тому

      @@gui1236100 then why it's only one API but not on deepseek chat interface?

  • @sontieudev
    @sontieudev 18 днів тому

    In v2.5 its slow, and impossible to use in my usecases.

  • @Luca-xr7bs
    @Luca-xr7bs 18 днів тому

    Uhmm I dunno

  • @beavenjnr4187
    @beavenjnr4187 17 днів тому

    Somalia? 🤷🏽‍♂️

  • @DarkLineSnes
    @DarkLineSnes 17 днів тому

    i find gpt very bad when compared to others coding focused a.i, its great for cheating on tests and sort of but nothing more than that, the problem is that gpt turned too limited for free users, you could say free users arent important but actually they are because their job is to come in volume to make the websites popular and bring paid users to it, only paid users arent enough.
    also what is a game of life?

    • @huk2617
      @huk2617 17 днів тому

      look up "conway's game of life"