Is Gemini Flash 2.0 Worth the hype?

Поділитися
Вставка
  • Опубліковано 23 січ 2025

КОМЕНТАРІ • 12

  • @shreyam1008
    @shreyam1008 Місяць тому +1

    Great video, but comparing with claude 3.5 sonnet would be a better comparison??? since its their latest model. with base model fee. WOuld like to see similar test, and more general dauly usage problem test from different fields, with sonnet.

  • @JEHOASJY
    @JEHOASJY Місяць тому

    Very good analysis.

  • @parimalarenga92
    @parimalarenga92 Місяць тому +1

    Firstly I don't have much in-depth in deep learning things...
    can you add xai grok in your comparisons, I'm also a LLM enthusiast , I'm impressed with it's reasoning abilities and more consistent on generating results comparing the gemini/Claude/gpt, and its code generation/reasoning is way more powerful, tho now its free to use.
    what I'm going to tell you, that might create controversy on me😂, but from my pov...
    in my list,
    1) grok/claude
    2) copilot/gpt
    3) gemini
    i'll make grok as my goto llm tool, note: i'm not elon fanboy🙂
    you're soo underrated , you need more recognition.

    • @YJxAI
      @YJxAI  Місяць тому +1

      thanks means a lot . There is nothing wrong in having a lit of yourself. That is why i say don't take my results as ultimate fact.
      If Lmsys guys can keep gpt4 above o1 than i think our lists are way better than that.

  • @davidcampos8952
    @davidcampos8952 Місяць тому +1

    Can you *please* put that *LLM Test* for us to see and use it also, so we can also test models?

    • @YJxAI
      @YJxAI  Місяць тому

      yes bro working on it. I'll have to make some changes to make it more dynamic but surely do.

    • @YJxAI
      @YJxAI  Місяць тому

      yes bro working on it. I'll have to make some changes to make it more dynamic but surely do.

    • @davidcampos8952
      @davidcampos8952 Місяць тому

      @@YJxAI thank you so much!

  • @iamboring2535
    @iamboring2535 Місяць тому +1

    How can you get consistency when you are using a temperature of 1 and top p value of 0.95. if you want consistency you must set it to a low value

    • @YJxAI
      @YJxAI  Місяць тому

      wanted to test it out on default values.

  • @saikatkarmakar6633
    @saikatkarmakar6633 Місяць тому +1

    Change the temperature from 1 to 2 in Gemini flash.. and then see the accuracy