Gemini 2.0 flash with "thinking" Useful? | Model Tested in all Modalities

Finally a Competitor! Claude 3.5 Sonnet vs Deepseek v3 Who will Win?

Agentic AI Implementation: Using Phidata with OpenAI and Llama 3.2

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

Пилот обманул смерть ракета пролетела рядом с ним #shorts

Is Gemini Flash 2.0 Worth the hype?

YJxAI

Переглядів 414

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 23 січ 2025

КОМЕНТАРІ • 12

@shreyam1008 Місяць тому ⁺¹
Great video, but comparing with claude 3.5 sonnet would be a better comparison??? since its their latest model. with base model fee. WOuld like to see similar test, and more general dauly usage problem test from different fields, with sonnet.
@JEHOASJY Місяць тому
Very good analysis.
@parimalarenga92 Місяць тому ⁺¹
Firstly I don't have much in-depth in deep learning things...
can you add xai grok in your comparisons, I'm also a LLM enthusiast , I'm impressed with it's reasoning abilities and more consistent on generating results comparing the gemini/Claude/gpt, and its code generation/reasoning is way more powerful, tho now its free to use.
what I'm going to tell you, that might create controversy on me😂, but from my pov...
in my list,
1) grok/claude
2) copilot/gpt
3) gemini
i'll make grok as my goto llm tool, note: i'm not elon fanboy🙂
you're soo underrated , you need more recognition.
@YJxAI Місяць тому ⁺¹
thanks means a lot . There is nothing wrong in having a lit of yourself. That is why i say don't take my results as ultimate fact.
If Lmsys guys can keep gpt4 above o1 than i think our lists are way better than that.
@davidcampos8952 Місяць тому ⁺¹
Can you *please* put that *LLM Test* for us to see and use it also, so we can also test models?
@YJxAI Місяць тому
yes bro working on it. I'll have to make some changes to make it more dynamic but surely do.
@YJxAI Місяць тому
yes bro working on it. I'll have to make some changes to make it more dynamic but surely do.
@davidcampos8952 Місяць тому
@@YJxAI thank you so much!
@iamboring2535 Місяць тому ⁺¹
How can you get consistency when you are using a temperature of 1 and top p value of 0.95. if you want consistency you must set it to a low value
@YJxAI Місяць тому
wanted to test it out on default values.
@saikatkarmakar6633 Місяць тому ⁺¹
Change the temperature from 1 to 2 in Gemini flash.. and then see the accuracy

Наступне

Автоматичне відтворення

Gemini 2.0 flash with "thinking" Useful? | Model Tested in all Modalities

Gemini 2.0 flash with "thinking" Useful? | Model Tested in all Modalities

Finally a Competitor! Claude 3.5 Sonnet vs Deepseek v3 Who will Win?

Finally a Competitor! Claude 3.5 Sonnet vs Deepseek v3 Who will Win?

Agentic AI Implementation: Using Phidata with OpenAI and Llama 3.2

Agentic AI Implementation: Using Phidata with OpenAI and Llama 3.2

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

Пилот обманул смерть ракета пролетела рядом с ним #shorts

Пилот обманул смерть ракета пролетела рядом с ним #shorts

У ДЕТЕНЫША СТЕПЫ ИСЧЕЗ ГЛАЗИК

У ДЕТЕНЫША СТЕПЫ ИСЧЕЗ ГЛАЗИК

Healthcare Innovation at Davos 2025: Cracking the Code of Digital Health

Healthcare Innovation at Davos 2025: Cracking the Code of Digital Health

Setting Up React Native Envoirnment on Mac | XCode, React Native CLI & VS Code 2025

Setting Up React Native Envoirnment on Mac | XCode, React Native CLI & VS Code 2025

Is OpenAI o3 really AGI?

Is OpenAI o3 really AGI?

Lanewood Studios: A Home Studio Tour & Gear Rundown

Lanewood Studios: A Home Studio Tour & Gear Rundown

MASSIVE Print Farm Upgrade!

MASSIVE Print Farm Upgrade!

Best Ways to Use Gemini 2.0 (over ChatGPT & Perplexity)!

Best Ways to Use Gemini 2.0 (over ChatGPT & Perplexity)!

Android Studio & Kotlin vs Other Languages: Which is Best for App Development?

Android Studio & Kotlin vs Other Languages: Which is Best for App Development?

Finally a Competitor! Deepseek r1 vs OpenAI o1| Battle of the Best LLMs

Finally a Competitor! Deepseek r1 vs OpenAI o1| Battle of the Best LLMs

Lets build a landing page with React and SASS - Part 1 (Intro and setup)

Lets build a landing page with React and SASS - Part 1 (Intro and setup)

Как найти себе жену? Больше - тут @stas.yornik.shorts

Как найти себе жену? Больше - тут @stas.yornik.shorts

Перший наступ КНДРівців

Перший наступ КНДРівців

The Witcher IV - Cinematic Reveal Trailer | The Game Awards 2024

The Witcher IV — Cinematic Reveal Trailer | The Game Awards 2024

1% vs 100% #beatbox #tiktok

1% vs 100% #beatbox #tiktok

Нельзя смеяться | Смех с водой | 97 #shorts

Нельзя смеяться | Смех с водой | 97 #shorts

人是不能做到吗？#火影忍者 #家人 #佐助

人是不能做到吗？#火影忍者 #家人 #佐助

СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️

СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️

СКОЛЬКО ИХ...?! #Shorts #Глент

СКОЛЬКО ИХ...?! #Shorts #Глент