Gemini 2.0 Pro vs Deepseek R1 vs Openai o3 mini | Who will win? : Arch AGI Bench

Поділитися
Вставка
  • Опубліковано 6 лют 2025

КОМЕНТАРІ • 22

  • @tantzer6113
    @tantzer6113 7 годин тому +5

    R1’s answer at @8:46 can be considered as correct, since the question has more than one right answer. Here’s the alternative answer: the color of the boundary of the rectangle with the largest length. The problem with this benchmark is that the questions don’t always have unique solutions. Updated result: tie between o3 mini and DeepSeek.

  • @Trinity_TF2
    @Trinity_TF2 13 годин тому +9

    im excited to see gemini 2.0 pro thinking

    • @YJxAI
      @YJxAI  13 годин тому +1

      yes me too..

  • @Trinity_TF2
    @Trinity_TF2 13 годин тому +10

    btw if gemini 2.0 pro is that good without thinking then it will be good

  • @videosclips_
    @videosclips_ 7 годин тому +1

    Thanks 🎉❤

    • @YJxAI
      @YJxAI  6 годин тому

      ❤️

  • @gemini_537
    @gemini_537 10 годин тому +2

    Gemini 2.0 Pro ❤

  • @bodethoms8014
    @bodethoms8014 11 годин тому +5

    Gemini has thinking now

    • @michaelspoden1694
      @michaelspoden1694 10 годин тому +2

      Yeah I was wondering why you didn't use that it actually has two thinking models at the moment

    • @YJxAI
      @YJxAI  10 годин тому +2

      there's only flash thinking guess.

  • @Family_Guy_12
    @Family_Guy_12 10 годин тому +2

    okay ❤

  • @imqqmi
    @imqqmi 7 годин тому +1

    Who will win?
    The machines of course!

  • @abhrodipsingharoy4508
    @abhrodipsingharoy4508 13 годин тому +4

    I think you should scold o3 mini-high for better response

    • @YJxAI
      @YJxAI  13 годин тому +1

      🤣 yeah.

  • @darksidedevelopment
    @darksidedevelopment 10 годин тому

    I've been on the fence about Gemini. It always gives me different results than any other model. I've sorta just dismissed it entirely, but now that they have deep think. I may have to revisit Gemini as a potential tool.

    • @YJxAI
      @YJxAI  6 годин тому +1

      I think you are talking about the gemini right not ai studio.

    • @darksidedevelopment
      @darksidedevelopment 5 годин тому +1

      @@YJxAI Yes, I was referring to the Gemini Chat. It has been very hit and miss, more miss than hit :)

    • @YJxAI
      @YJxAI  3 години тому

      ​@ yeah it is becoming compelling lately.

  • @tvk-ox2my
    @tvk-ox2my 13 годин тому +1

    what program you are using to generate those promt

    • @YJxAI
      @YJxAI  12 годин тому +1

      I made a common prompt like understand the below question and each time new question comes it is placed in the placeholder. and I can copy it.

  • @wilsonbotlero2363
    @wilsonbotlero2363 12 годин тому +1

    Isn't there a thinking model from google too in AI studio?

    • @YJxAI
      @YJxAI  12 годин тому +1

      there is did a video on it. Please do check :)

  • @hamloji
    @hamloji 10 годин тому +1

    Do manim coding challenge