Intro to RAG for AI (Retrieval Augmented Generation)

Semantic Chunking for RAG

LangGraph 101: it's better than LangChain

Survival skills: A great idea with duct tape #survival #lifehacks #camping

Их было двое😍 #аняищук #димасблог #семья #anyaischuk #дети

Самые безумные прогнозы для биткоина: Финни, Вуд, Макафи, Кайзер, О'Лири ⚡️ Hamster Academy

Superfast RAG with Llama 3 and Groq

James Briggs

Переглядів 4 054

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 7 лип 2024
Groq API provides access to Language Processing Units (LPUs) that enable incredibly fast LLM inference. The service offers several LLMs including Meta's Llama 3. In this video, we'll implement a RAG pipeline using Llama 3 70B via Groq, an open source e5 encoder, and the Pinecone vector database.
📌 Code:
github.com/pinecone-io/exampl...
🌲 Subscribe for Latest Articles and Videos:
www.pinecone.io/newsletter-si...
👋🏼 AI Consulting:
aurelio.ai
👾 Discord:
/ discord
Twitter: / jamescalam
LinkedIn: / jamescalam
#artificialintelligence #llama3 #groq
00:00 Groq and Llama 3 for RAG
00:37 Llama 3 in Python
04:25 Initializing e5 for Embeddings
05:56 Using Pinecone for RAG
07:24 Why We Concatenate Title and Content
10:15 Testing RAG Retrieval Performance
11:28 Initialize connection to Groq API
12:24 Generating RAG Answers with Llama 3 70B
14:37 Final Points on Why Groq Matters
Наука та технологія

КОМЕНТАРІ • 15

@awakenwithoutcoffee 3 дні тому ⁺¹
hi James, Microsoft just open-sourced their graphRAG technology stack, might be cool to take a look and see how we can leverage/combine them both.
@tiagoc9754 5 днів тому ⁺¹
Nice thing is that you can use groq with langchain as well
@jamesbriggs 5 днів тому ⁺¹
Yes very true
@gilbertb99 5 днів тому ⁺¹
What are your thoughts on adding a short summary description of the document or paper in each chunk including the title?
@jamesbriggs 5 днів тому
it's a good idea - I haven't tried it before but seems sensible, you would need to find a balance between too much summary which might "overpower" the meaning of the chunk itself and getting enough summary in there to be useful - but if you get something good there it feels like a great idea imo
@tiagoc9754 5 днів тому ⁺¹
Groq is insanely fast
@jamesbriggs 5 днів тому
Yeah it’s wild
@tiagoc9754 5 днів тому ⁺¹
Is there any oss embedding model you'd recommend over e5 for real/prod use cases? I've just used openai so far
@juanpablomesalopez 5 днів тому ⁺¹
gte-base or bge-base are good in benchmarks, but gotta really test them on your use case. You should also fine-tune the embeddings with your use case data.
@jamesbriggs 5 днів тому ⁺¹
E5 have been good, I like Jina’s embedding model, and I’ve heard some good things about BAAI bge-m3 too for hybrid search
@byczong 5 днів тому ⁺¹
@@jamesbriggs maybe in some future video you could cover bge-m3 :)) this model sound pretty cool (especially dense/multi-vector/sparse retrieval)
@content_ai_ 5 днів тому ⁺¹
You in Bali nice! I am looking for an online job mate. I'm pretty desperate at this point
@jamesbriggs 5 днів тому ⁺¹
You can tell? But yes, here for a while - just work on AI stuff, get yourself out there a bit, it does take time though
@Davorge 5 днів тому
is this re-usable in such way that we can switch calling groq to call open ai gpt-4o or other models?
@jamesbriggs 5 днів тому
Yeah it’s pretty simple to swap them out, they use a similar (maybe even same) API

Наступне

Автоматичне відтворення

Intro to RAG for AI (Retrieval Augmented Generation)

Intro to RAG for AI (Retrieval Augmented Generation)

Semantic Chunking for RAG

Semantic Chunking for RAG

LangGraph 101: it's better than LangChain

LangGraph 101: it's better than LangChain

Survival skills: A great idea with duct tape #survival #lifehacks #camping

Survival skills: A great idea with duct tape #survival #lifehacks #camping

Их было двое😍 #аняищук #димасблог #семья #anyaischuk #дети

Их было двое😍 #аняищук #димасблог #семья #anyaischuk #дети

Самые безумные прогнозы для биткоина: Финни, Вуд, Макафи, Кайзер, О'Лири ⚡️ Hamster Academy

Самые безумные прогнозы для биткоина: Финни, Вуд, Макафи, Кайзер, О'Лири ⚡️ Hamster Academy

Мама хитрая😂⁠⁠@ladymilanapap4610

Мама хитрая😂⁠⁠@ladymilanapap4610

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Graph RAG: Improving RAG with Knowledge Graphs

Graph RAG: Improving RAG with Knowledge Graphs

End To End Document Q&A RAG App With Gemma And Groq API

End To End Document Q&A RAG App With Gemma And Groq API

AI Agent Evaluation with RAGAS

AI Agent Evaluation with RAGAS

Vector Search RAG Tutorial - Combine Your Data with LLMs with Advanced Search

Vector Search RAG Tutorial – Combine Your Data with LLMs with Advanced Search

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Mixture of Agents TURBO Tutorial 🚀 Better Than GPT4o AND Fast?!

Mixture of Agents TURBO Tutorial 🚀 Better Than GPT4o AND Fast?!

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Kyutais New "VOICE AI" is INSANE (and open source)

Kyutais New "VOICE AI" is INSANE (and open source)

ИГРОВОЙ ПК ЗА 10К КОТОРЫЙ ДЕЙСТВИТЕЛЬНО ТАЩИТ В 2024 ГОДУ / СБОРКА ПК ЗА 10000 РУБЛЕЙ by KOMPUKTER

ИГРОВОЙ ПК ЗА 10К КОТОРЫЙ ДЕЙСТВИТЕЛЬНО ТАЩИТ В 2024 ГОДУ / СБОРКА ПК ЗА 10000 РУБЛЕЙ by KOMPUKTER

Когда паникуешь слишком рано #магазин #электроника #смартфоны #пк

Когда паникуешь слишком рано #магазин #электроника #смартфоны #пк

High voltage Ground Fault testing.

High voltage Ground Fault testing.

Какой Смартфон Купить Летом 2024 Года? Лучшие бюджетные модели

Какой Смартфон Купить Летом 2024 Года? Лучшие бюджетные модели

Что не так с яблоком Apple? #apple #macbook

Что не так с яблоком Apple? #apple #macbook

Easy Art with AR Drawing App - Step by step for Beginners

Easy Art with AR Drawing App - Step by step for Beginners

Product Link in Bio ( # 1636 ) @MaviGadgets ✅ Smart Universal Magnetic Car Phone Holder

Product Link in Bio ( # 1636 ) @MaviGadgets ✅ Smart Universal Magnetic Car Phone Holder

ФЕНОМЕНАЛЬНОЕ ВЫЖИВАНИЕ НА AM2+ В 2024 ГОДУ

ФЕНОМЕНАЛЬНОЕ ВЫЖИВАНИЕ НА AM2+ В 2024 ГОДУ