Supercharge Your RAG with Contextualized Late Interactions

Why Cartesia-AI's Voice Tech is a Game-Changer You Can't Ignore!

Fine Tuning Mistral v3.0 With Custom Data

Решение задачи про лжеца и честного охранника

«Проти семи окупантів, вів бій з автомату»: «Монгол» про оборону позиції на Запорізькому напрямку

Footage Released of Moment Truck Pludges Off Clark Memorial Bridge in Louisville | 10 News First

Advanced RAG with ColBERT in LangChain and LlamaIndex

Prompt Engineering

Переглядів 8 145

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 31 тра 2024
ColBERT is a fast and accurate retrieval model, enabling scalable BERT-based search over large text collections in tens of milliseconds. This can be used as a potential alternative to Dense Embeddings in Retrieval Augmented Generation. In this video we explore using ColBERTv2 with RAGatouille and compare it with OpenAI Embedding models.
🦾 Discord: / discord
☕ Buy me a Coffee: ko-fi.com/promptengineering
|🔴 Patreon: / promptengineering
💼Consulting: calendly.com/engineerprompt/c...
📧 Business Contact: engineerprompt@gmail.com
Become Member: tinyurl.com/y5h28s6h
💻 Pre-configured localGPT VM: bit.ly/localGPT (use Code: PromptEngineering for 50% off).
Signup for Advanced RAG:
tally.so/r/3y9bb0
LINKS:
Find the Notebook here: github.com/PromtEngineer/Yout...
ColBERTv2 with RAGatouille Video: • Supercharge Your RAG w...
ColBERTv2 Paper: arxiv.org/pdf/2112.01488.pdf
ColBERT Github: github.com/stanford-futuredat...
RAGatouille: github.com/bclavie/RAGatouill...
TIMESTAMPS:
[00:00] Introduction
[00:29] Use ColBERT in LangChain
[08:46] Use ColBERT in LlamaIndex
All Interesting Videos:
Everything LangChain: • LangChain
Everything LLM: • Large Language Models
Everything Midjourney: • MidJourney Tutorials
AI Image Generation: • AI Image Generation Tu...
Наука та технологія

КОМЕНТАРІ • 19

@engineerprompt День тому
If you are interested in learning more about how to build robust RAG applications, check out this course: prompt-s-site.thinkific.com/courses/rag
@pawan3133 2 місяці тому ⁺¹⁴
can you make a video on how to evaluate a RAG? And compare different RAG approaches.
@joxxen 2 місяці тому ⁺¹
I would also be interested in this :) specially with open source llm's and embeddings. Tried alot and cant figure out which is the best one
@KOTAGIRISIVAKUMAR Місяць тому
@@joxxen I`m to waiting for that if you found any resource let me know.
@hales922 Місяць тому ⁺¹
Navigating the landscape of storytelling and video experimentation, VideoGPT silently empowers my creative journey, adding a layer of sophistication to my content.
@valentind.5398 2 місяці тому
Thanks for sharing
@nbbhaskar3294 Місяць тому ⁺¹
@engineerprompt, can we use a persistant vector db like chroma, qdrant and others with Ragatouille? So that I can just embed the documents once and re-use them for inferences later.
@engineerprompt Місяць тому ⁺¹
It supports only FAISS at the moment for persisting it to disk
@hamslammula6182 2 місяці тому
I’d like to do RAG over a medical textbook. What strategies would you recommend for chunking. I’m thinking a hierarchical graph structure makes intuitive sense. What are your thoughts on this?
@iaincampbell4422 25 днів тому
Cant find the google collab notebook? Would love to copy this across to my own account and havd a play. Not sure if I'm overlooking it? I just see the github link?
@jayethompson3414 2 місяці тому
Is the Plaid DB persistent? As in, if I do this, how do I connect to that particular DB again?
@shameekm2146 2 місяці тому
I am working on a machine that is running Ubuntu and connected to 4 80GB A100 GPU's. The issue i face is RAG.index cell is running forever on this machine. Whereas same code running on Google Colab free version runs within seconds. Any insights on how this can be resolved will be helpful. Thanks :)
@engineerprompt 2 місяці тому
Is your env able to see the GPUs? Check that the torch is actually using the gpu
@shameekm2146 2 місяці тому
@@engineerprompt yes i run LLM's on same notebook, it is able to load that to gpu. I checked via nvidia-smi command
@almirbolduan Місяць тому
How can we use approaches like ColBERT with other languages, as portuguese? Thanks!
@engineerprompt Місяць тому
I think you will have to finetune the model for the language first
@iham1313 2 місяці тому
of course the last result is more accurate. you gave it almost 50% (5 instead of 3 chunks) more context. when using multiple ways to achieve the same goal, please use the same amount of data. otherwise it is hard to compare the output.
on the topic of chunks given to RAG - why define that? what if one does not know about how many parts may contain relevant information?
@linuswatiti8078 2 місяці тому
How can I monetize whatever is being said as a beginner..?
@engineerprompt 2 місяці тому ⁺³
If you are interested in leanring more about Advanced RAG Course, signup here: tally.so/r/3y9bb0

Наступне

Автоматичне відтворення

Supercharge Your RAG with Contextualized Late Interactions

Supercharge Your RAG with Contextualized Late Interactions

Why Cartesia-AI's Voice Tech is a Game-Changer You Can't Ignore!

Why Cartesia-AI's Voice Tech is a Game-Changer You Can't Ignore!

Fine Tuning Mistral v3.0 With Custom Data

Fine Tuning Mistral v3.0 With Custom Data

Решение задачи про лжеца и честного охранника

Решение задачи про лжеца и честного охранника

«Проти семи окупантів, вів бій з автомату»: «Монгол» про оборону позиції на Запорізькому напрямку

«Проти семи окупантів, вів бій з автомату»: «Монгол» про оборону позиції на Запорізькому напрямку

Footage Released of Moment Truck Pludges Off Clark Memorial Bridge in Louisville | 10 News First

Footage Released of Moment Truck Pludges Off Clark Memorial Bridge in Louisville | 10 News First

Я пытался разбить небьющийся бокал

Я пытался разбить небьющийся бокал

What is LangChain?

What is LangChain?

Better RAG: Hybrid Search in Chat with Documents | BM25 and Ensemble

Better RAG: Hybrid Search in Chat with Documents | BM25 and Ensemble

Chat with Documents is Now Crazy Fast thanks to Groq API and Streamlit

Chat with Documents is Now Crazy Fast thanks to Groq API and Streamlit

Understanding Embeddings in LLMs (ft LlamaIndex + Chroma db)

Understanding Embeddings in LLMs (ft LlamaIndex + Chroma db)

Getting Started with GPT-4o API, Image Understanding, Function Calling and MORE

Getting Started with GPT-4o API, Image Understanding, Function Calling and MORE

Get your own custom Phi-3-mini for your use cases

Get your own custom Phi-3-mini for your use cases

RAPTOR - Advanced RAG with LangChain

RAPTOR - Advanced RAG with LangChain

Цифровые песочные часы с AliExpress

Цифровые песочные часы с AliExpress

Pratik Cat6 kablo soyma

Pratik Cat6 kablo soyma

What do you think of this indoor LED screen? #leddisplay #ledwall #ledscreenfactory #ledmodule

What do you think of this indoor LED screen? #leddisplay #ledwall #ledscreenfactory #ledmodule

iOS 18 - подтвержденные функции iOS 18! Что нового в iOS 18?

iOS 18 - подтвержденные функции iOS 18! Что нового в iOS 18?

НЕ успели сдать в гарантию, ноут умер сразу после нее. Ремонт Legion 5 pro : который быстро умер.

НЕ успели сдать в гарантию, ноут умер сразу после нее. Ремонт Legion 5 pro : который быстро умер.

What percentage of charge is on your phone now? #entertainment

What percentage of charge is on your phone now? #entertainment

Мобильные Ryzen 7000 | Как выбрать ноутбук и разгадать ребус AMD

Мобильные Ryzen 7000 | Как выбрать ноутбук и разгадать ребус AMD

How charged your battery?

How charged your battery?