How Good is Phi-3-Mini for RAG, Routing, Agents

Claude 3 Introduces Function Calling and Tool Usage

We built the FORBIDDEN MAC

어른의 힘으로만 할 수 있는 버블티 마시는법

💥 Россия РЕАГИРУЕТ на победу Усика. "Z-патриоты" НЕ НАХОДЯТ себе места

КАКОЙ ВАШ ЛЮБИМЫЙ ЦВЕТ?😍 #game #shorts

Does Size Matter? Phi-3-Mini Punching Above its Size on "BENCHMARKS"

Prompt Engineering

Переглядів 5 464

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 31 тра 2024
Microsoft just released their Phi-3 family of models that are SOTA for their weight class. The best part, the weights are publicly available and can be used locally.
🦾 Discord: / discord
☕ Buy me a Coffee: ko-fi.com/promptengineering
|🔴 Patreon: / promptengineering
💼Consulting: calendly.com/engineerprompt/c...
📧 Business Contact: engineerprompt@gmail.com
Become Member: tinyurl.com/y5h28s6h
💻 Pre-configured localGPT VM: bit.ly/localGPT (use Code: PromptEngineering for 50% off).
Signup for Advanced RAG:
tally.so/r/3y9bb0
LINKS:
Blogpost: tinyurl.com/h4pxa25c
Where to test: huggingface.co/
Model Weights:
huggingface.co/microsoft/Phi-...
huggingface.co/microsoft/Phi-...
Technical Report: arxiv.org/html/2404.14219v1
Results: tinyurl.com/bdf3j3w6
TIMESTAMPS:
[00:00] Introducing Phi-3
[01:23] Performance on Benchmarks
[02:32] Testing Pi 3's Ethical Boundaries and Logical Reasoning
[06:55] Exploring Pi 3's Coding and Creative Writing
[09:25] Analyzing Agent Interactions
All Interesting Videos:
Everything LangChain: • LangChain
Everything LLM: • Large Language Models
Everything Midjourney: • MidJourney Tutorials
AI Image Generation: • AI Image Generation Tu...
Наука та технологія

КОМЕНТАРІ • 18

@alexxx4434 Місяць тому ⁺⁵
I think that "Sorry, I can't assist you with that. However..." is the pattern the model learned in-context for answering. Small models are more prone to such in-context pattern repetition.
Same with other questions, it may pick up patterns from previous QA pairs in context. That's why each test question should be taken within separate empty context.
@engineerprompt Місяць тому
that is a possibility.
@olivert.7177 Місяць тому ⁺⁷
6:53 Isn't the answer with the flowers wrong. If the flowers decreased by half every day, it takes one day for the field to be half filled.
@CRGreathouse Місяць тому ⁺¹
The question is nonsensical; if the number of flowers is halved every day and on the 9th day it is empty, then it's empty every day and will never be half-filled.
@engineerprompt Місяць тому ⁺²
You guys are right. The question is wrong. Didn't think about it when I changed it from the original question.
@testales Місяць тому
@@engineerprompt The question is quite smart actually and even ChatGPT 4 gets it wrong. My local Senku 70b q5 solved it correctly instantly, to my surprise. The emotional intelligence leaderboard seems to be quite accurate.
@joshbane1 Місяць тому ⁺⁵
You should have an updated opensource comparison between wizard-LM2 7b llama-3 8b and phi-3.
@engineerprompt Місяць тому ⁺¹
good suggestion. Will see what I can do.
@unclecode Місяць тому
When I received the UA-cam notification for your video on my phone, I saw "Does Size Matter?" and I burst out laughing! YES, SIZE DOES MATTER, as we all know ;) Very witty and creative title. However, seems in the land of LLMs we hope smaller, with better data, more training beats the rest.
@fenix20075 Місяць тому
if you give the information what phi 3 mini needs, and then give it a question related to the information you have given, and it cannot answer the question according to your previous information, basically this model is just a chat model, only can use in chat, surely cannot use in agent system.
@soonheng1577 29 днів тому
thought I want to share some of my test:
I ask it to code a snake game, the code seems ok with all the logic.
but when I ask it to code a snake game with javascript, initially it did ok, half way through, it start to give me none-sense that with a lot of gibberish like "import pygame
import
import py
..."
seems like they only trained it to code with python.
@engineerprompt 29 днів тому
Could be and also you need to consider that it might be just retrieving the training data. the only way to really test these models is when you ask or change the prompts from what it might have seen in the training data.
@rcohen79 Місяць тому
Exploring the realms of storytelling and video creativity. VideoGPT quietly made its presence known, enhancing my content with its seamless professionalism.
@raghuvallikkat3384 Місяць тому
can we use it on localGPT?
@engineerprompt Місяць тому ⁺¹
Yes, that's possible
@RobertoFabrizi Місяць тому
Isn't it pronounced Fai rather than Pai?
@kabaduck Місяць тому
You artificial intelligence models with these features to limit their capabilities are disturbing... So I guess we're not going to ever have any comedy artificial intelligence models

Наступне

Автоматичне відтворення

How Good is Phi-3-Mini for RAG, Routing, Agents

How Good is Phi-3-Mini for RAG, Routing, Agents

Claude 3 Introduces Function Calling and Tool Usage

Claude 3 Introduces Function Calling and Tool Usage

We built the FORBIDDEN MAC

We built the FORBIDDEN MAC

어른의 힘으로만 할 수 있는 버블티 마시는법

어른의 힘으로만 할 수 있는 버블티 마시는법

💥 Россия РЕАГИРУЕТ на победу Усика. "Z-патриоты" НЕ НАХОДЯТ себе места

💥 Россия РЕАГИРУЕТ на победу Усика. "Z-патриоты" НЕ НАХОДЯТ себе места

КАКОЙ ВАШ ЛЮБИМЫЙ ЦВЕТ?😍 #game #shorts

КАКОЙ ВАШ ЛЮБИМЫЙ ЦВЕТ?😍 #game #shorts

СПАСТИСЬ ОТ МАЧЕХИ. БЕСЕДА С ВИТАЛИЕМ ПОРТНИКОВЫМ @portnikov.argumenty

СПАСТИСЬ ОТ МАЧЕХИ. БЕСЕДА С ВИТАЛИЕМ ПОРТНИКОВЫМ @portnikov.argumenty

NEW Mixtral 8x22b Tested - Mistral's New Flagship MoE Open-Source Model

NEW Mixtral 8x22b Tested - Mistral's New Flagship MoE Open-Source Model

Mistral v0.3: Multi-Function Calling & Dependencies! 🚀

Mistral v0.3: Multi-Function Calling & Dependencies! 🚀

Lenovo Legion 5i (2024) Review - Still Best Mid-Range Gaming Laptop?

Lenovo Legion 5i (2024) Review - Still Best Mid-Range Gaming Laptop?

Run your own AI (but private)

Run your own AI (but private)

Is Copilot Pro Better Than ChatGPT Plus? Microsoft 365 Copilot - your copilot for work

Is Copilot Pro Better Than ChatGPT Plus? Microsoft 365 Copilot – your copilot for work

M2 iPad Air Review after 2 Weeks - BEST 2024 Tablet!

M2 iPad Air Review after 2 Weeks - BEST 2024 Tablet!

How to run a local AI chatbot on Windows in 5 min, no cuts, no edits, with Ollama, LMStudio, OpenAI

How to run a local AI chatbot on Windows in 5 min, no cuts, no edits, with Ollama, LMStudio, OpenAI

Why Cartesia-AI's Voice Tech is a Game-Changer You Can't Ignore!

Why Cartesia-AI's Voice Tech is a Game-Changer You Can't Ignore!

Function Calling with Local Models & LangChain - Ollama, Llama3 & Phi-3

Function Calling with Local Models & LangChain - Ollama, Llama3 & Phi-3

GPT-5 уже через 90 дней, Шпионские наушники с ИИ, Робопес на заводе BMW и другие новости

GPT-5 уже через 90 дней, Шпионские наушники с ИИ, Робопес на заводе BMW и другие новости

Проблемы с безопасностью ИИ OpenAI | В Китае ожил размороженный мозг | Большой Брат ИИ от Microsoft

Проблемы с безопасностью ИИ OpenAI | В Китае ожил размороженный мозг | Большой Брат ИИ от Microsoft

iPhone 16 - ЖИВІ ВІДЕО та ФОТО, ДАТА ВИХОДУ, ЦІНИ та ХАРАКТЕРИСТИКИ

iPhone 16 – ЖИВІ ВІДЕО та ФОТО, ДАТА ВИХОДУ, ЦІНИ та ХАРАКТЕРИСТИКИ

How To Unlock Your iphone With Your Voice

How To Unlock Your iphone With Your Voice

Glow in the Dark Charging cable #shorts #diy #glowinthedark #chargingcable #nanocord

Glow in the Dark Charging cable #shorts #diy #glowinthedark #chargingcable #nanocord

Как я сделал домашний кинотеатр

Как я сделал домашний кинотеатр

КАК GOOGLE УКРАЛ ANDROID?

КАК GOOGLE УКРАЛ ANDROID?

Рукописные сообщения на iPhone 😳

Рукописные сообщения на iPhone 😳