LLAMA 3 or Phi 3 AI Agent: Can they Beat Perplexity in Web Search?

A Prompt Engineering Trick for Building "High-level" AI Agents

The cloud is over-engineered and overpriced (no music)

МАФИЯ в РЕАЛЬНОЙ ЖИЗНИ: Масленников, Дзюба, Полина, L'One, Даник, Мага, Братишкин, Усачев, Чернец

DOROFEEVA - Колискова 2022 (Official Music Video)

Військовий прощається із побратимом #війна #war #зсу #україна

Llama 3 70B Custom AI Agent: Better Than Perplexity AI?

Data Centric

Переглядів 4 175

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 30 вер 2024

КОМЕНТАРІ • 33

@jasonjefferson6596 4 місяці тому ⁺³
Looking for to the RAG series 🎉
@tonyppe 3 місяці тому ⁺¹
you're the real mvp.. your videos are ace, you break down these other tools into low-level code.
@ademczuk 4 місяці тому ⁺³
Surely you can have an agent running opus or GPT4o perform the testing and generate a matrix for you against your expected answers
@EddieAdolf 4 місяці тому ⁺²
The point is it's local, more secure/ data not being sent out/used as training data AND you're not paying for it. ... AnnnD, you can expand it into your own workflows etc.
@tonyppe 3 місяці тому
@@EddieAdolf he's the real MVP for sure.
@kalokali7711 4 місяці тому ⁺³
Hi, I like the content - very informative; I have idea for next video and very good showcase : Agents automation (*selenium based) to log to any portal or interact with any page eg. ebay and looking for specific item (*book) but not via API, just tool based on selenium library or similar - that will set foot in the door of RPA ;)
@Data-Centric 4 місяці тому ⁺¹
I like this idea
@malikrumi1206 4 місяці тому ⁺¹
Did you get a bigger background to accommodate a bigger llama model? 😆 Ok, let me get serious and actually watch this thing ...🤣.
@matten_zero 4 місяці тому ⁺³
There are whole teams that fund raised on being able to this very task.
@MrAhsan99 3 місяці тому ⁺¹
and This guy gave them the run for money
@matten_zero 3 місяці тому ⁺¹
@@MrAhsan99 it points to how overvalued some of these startups are
@MrAhsan99 3 місяці тому
@@matten_zero absolutely
@JackieUUU 4 місяці тому ⁺³
impressive work!
@trafferz 4 місяці тому
How's it to determine which city? You don't specify in the UK. North of London Birmingham is the largest city in the world?
@Data-Centric 4 місяці тому
I would hope it could figure it out. I'm going to try the same thing with the GPT-3.5-Turbo and 4o.
@matten_zero 4 місяці тому
For context Perplexity has a $50+ M* valuation.
@javiergimenezmoya86 4 місяці тому
Contex windows shouldn't be broken. It should slide and do do summarizes at time.
@SnakeCaseGuy 4 місяці тому
Hi, I see you have been doing quite some LLMing and RAGging. I was just fiddling with it, and the problem is, sometimes, it generates like real garbage, like someone github pages which don't exists, or like a lines and lines of nothing (
) or, maybe starts repeating. How do you prevent or catch and stop generating? If you have some examples or some videos, it would be helpful
@Data-Centric 4 місяці тому
If you're using open source LLMs, you need to be aware of the prompt formats and stop tokens. I have a video on deploying a basic llama 3 chatbot that goes into a bit more detail.
@FelipeHoenen 4 місяці тому
You should add Snowflake Arctic to the comparison! Apparently its 128 experts are less prone to hallucinating
@Data-Centric 4 місяці тому ⁺¹
I was wondering how a MoE architecture might do. I have Mixtral coming up, but will consider Snowflake Arctic too!
@andynguyen8847 4 місяці тому
What temperature is set to and what quantized version are you running? The free version on groq managed to get the final question right though struggle with Aruba one. Am sure with more tweaking, we can make llama 70b do quite well on these tasks
@Data-Centric 4 місяці тому
Temperature setting is 0 and model is the 16bit version. I think the Llama models are published in 16 bit though so completely unquantized.
@octadion3274 4 місяці тому
What gpu are you used? I have followed your video before to deploy in runpod but i cannot connect to host 8000, or its just takes more time to start? Please let me know!
@octadion3274 4 місяці тому ⁺¹
my bad, i jusr need to wait a litle bit
@madhudson1 4 місяці тому
I haven't gone back and looked at your implementation of your agent workflow, but when you're talking about restrictions in context windows with your scraping. Are you using RAG with the large documents you're scraping?
@supercurioTube 4 місяці тому
From what I recall in the previous videos, the content is fed in full in the context, leaving the LLM to extract the information from the whole page.
That's why some pages don't fit in the 8K tokens window.
RAG would work better by chunking and retrieving only the relevant text from the whole page but it would also make the code of the project quite a bit more complex, unless relying on a framework.
@madhudson1 4 місяці тому ⁺¹
@@supercurioTube aye, a RAG stage is needed then for large contexts or some form of hierarchical context summariser.
RAG would be better though
@Data-Centric 4 місяці тому ⁺¹
Yes, this is pretty much it. Would probably add some latency too because you would have to create the embeddings for each webpage each time you did a new search.
@supercurioTube 4 місяці тому
@@Data-Centric good point about the latency.
Ollama recently added the ability to keep several models loaded at the same time, which would help.
Otherwise swapping between the models for embeddings and a 8b LLM would slow things down significantly.
@FutureFocused-lo1jn 4 місяці тому
@@Data-Centric aye, but you could check first to see if it's already indexed

Наступне

Автоматичне відтворення

LLAMA 3 or Phi 3 AI Agent: Can they Beat Perplexity in Web Search?

LLAMA 3 or Phi 3 AI Agent: Can they Beat Perplexity in Web Search?

A Prompt Engineering Trick for Building "High-level" AI Agents

A Prompt Engineering Trick for Building "High-level" AI Agents

The cloud is over-engineered and overpriced (no music)

The cloud is over-engineered and overpriced (no music)

МАФИЯ в РЕАЛЬНОЙ ЖИЗНИ: Масленников, Дзюба, Полина, L'One, Даник, Мага, Братишкин, Усачев, Чернец

МАФИЯ в РЕАЛЬНОЙ ЖИЗНИ: Масленников, Дзюба, Полина, L'One, Даник, Мага, Братишкин, Усачев, Чернец

DOROFEEVA - Колискова 2022 (Official Music Video)

DOROFEEVA - Колискова 2022 (Official Music Video)

Військовий прощається із побратимом #війна #war #зсу #україна

Військовий прощається із побратимом #війна #war #зсу #україна

😮 Реакції УСИКА, Ф’ЮРІ та інших зірок на ПОРАЗКУ ДЖОШУА НОКАУТОМ!

😮 Реакції УСИКА, Ф’ЮРІ та інших зірок на ПОРАЗКУ ДЖОШУА НОКАУТОМ!

How 1 Software Engineer Outperforms 138 - Lichess Case Study

How 1 Software Engineer Outperforms 138 - Lichess Case Study

Building an AI Receptionist with LangGraph

Building an AI Receptionist with LangGraph

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

Neo4j Knowledge Graphs for "Smarter" AI-Agent Search

Neo4j Knowledge Graphs for "Smarter" AI-Agent Search

Better Searches With Local AI

Better Searches With Local AI

Unlimited AI Agents running locally with Ollama & AnythingLLM

Unlimited AI Agents running locally with Ollama & AnythingLLM

AI Agent | Perplexity Alternative Built with LangGraph & Advanced Prompt Engineering (Demo)

AI Agent | Perplexity Alternative Built with LangGraph & Advanced Prompt Engineering (Demo)

Why Are Open Source Alternatives So Bad?

Why Are Open Source Alternatives So Bad?

Dynamic AI Agents with LangGraph, Prompt Engineering Enhancements + RAG

Dynamic AI Agents with LangGraph, Prompt Engineering Enhancements + RAG

А що нам??23 вересня 2024 р.

А що нам??23 вересня 2024 р.

РЕШАЮЩИЙ РАЗГОВОР: Золкин и Карпенко нашли ее мужа / "Жди меня" отдыхает!

РЕШАЮЩИЙ РАЗГОВОР: Золкин и Карпенко нашли ее мужа / "Жди меня" отдыхает!

Send this to an artist… 🎨✨🧑🏻‍🎨 #shorts #art #artistomg

Send this to an artist… 🎨✨🧑🏻‍🎨 #shorts #art #artistomg

Жіночий лікар. Нове життя 2. Серія 31. Новинка 2024 на 1+1 Україна. Найкраща медична мелодрама

Жіночий лікар. Нове життя 2. Серія 31. Новинка 2024 на 1+1 Україна. Найкраща медична мелодрама

Челсі VS Брайтон - Огляд матчу

Челсі VS Брайтон - Огляд матчу

«У коридор вийшли і все прилетіло»: жителька пошкодженого будинку про удар РФ по Запоріжжю

«У коридор вийшли і все прилетіло»: жителька пошкодженого будинку про удар РФ по Запоріжжю

От первого лица: Школа 7 😡 УБЕЖАЛ из ДОМА 😱 БРОСИЛ ДЕВУШКУ ИЗ-ЗА ДЕНЕГ 😰 СТЫД ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 7 😡 УБЕЖАЛ из ДОМА 😱 БРОСИЛ ДЕВУШКУ ИЗ-ЗА ДЕНЕГ 😰 СТЫД ГЛАЗАМИ ШКОЛЬНИКА

Как мы играем в игры 😂

Как мы играем в игры 😂