Build production-ready AI Agents with Qdrant and n8n

ColPali: Document Retrieval with Vision-Language Models only (with Manuel Faysse)

How to make data ready for your RAG application with Qdrant and FastEmbed

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

СОЛДАТ КНДР: ВТЕЧА/ВІЙНА В УКРАЇНІ/10 РОКІВ ШПИГУВАВ У ПІВНІЧНІЙ КОРЕЇ/ТОРГУЮТЬ НАРКОТИКАМИ І ЗБРОЄЮ

The evil clown plays a prank on the angel

Optimizing ColPali for Retrieval at Scale, from Theory to Practice

Qdrant - Vector Database & Search Engine

Переглядів 1 768

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 13 січ 2025

КОМЕНТАРІ • 14

@RobCaulk Місяць тому ⁺²
I've been waiting for a break down like this to help me wrap my head around ColPali. Thanks!!
@tirushv9681 Місяць тому ⁺²
Haha same here
@AmitDeshmukh-l8d Місяць тому ⁺³
I'm currently building with qdrant (love the binary quantization and multi vector approach to scale retrival with colpali) I was wondering if a js/ts example exists because thats primarily our tech stack. If not I'll try to put something out eventually.
@sabrinaesaquino Місяць тому
Thanks! We don’t have a JS/TS example yet, but we'd love to see what you create if you decide to put one together!
@rodyatube Місяць тому ⁺³
i've used tesseract ocr to get text from images and the result is just ok, certainly no where near what you shown colpali can do. will certainly give it a try.
@MrAhsan99 Місяць тому
What approach did you take? Did you extract the images and store their text summaries? Then, during retrieval, did you use these summaries along with the original images to get the answer?
@nitin7554 Місяць тому ⁺²
Towards the end, she passed the whole image to a large 90b llama or gpt 4o, what's the point if have to pass the whole image instead of patches. Better owuld be if we can get the patches retrieved using copli and run some small vision model to extract answer.
@EvgeniyaSukhodolskaya Місяць тому
You can, using ColPali's attention mask
@MrAhsan99 Місяць тому
@@EvgeniyaSukhodolskaya what does that mean? how to do it?
@zishaansayyed2092 Місяць тому ⁺²
This is amazing. Punch down OCR’s
@haralc6196 Місяць тому ⁺¹
Why is this better than asking GPT4o to read the image?
@EvgeniyaSukhodolskaya Місяць тому ⁺¹
1) It's a free model
2) It's optimized for retrieval
You can't ask GPT4o to read 100k pages each time your user wants to find some answer among them:)
@haralc6196 Місяць тому
@@EvgeniyaSukhodolskaya You can just ask GPT to extract the information you want and save it to the vector db. Don't have to analyse the image every time.
@EvgeniyaSukhodolskaya Місяць тому ⁺²
@@haralc6196 well then you need to think about every possible question you could answer with this one pdf page, and ask GPT-4o generate all of them&answer all of them. Not 100% it will cover all, and if you're doing VRAG and need PDF to be retrieved regardless (say, to look at the graph/chart), you'll have to save it regardless in db. Imo doesn't make much sense, unless it's a specific Q&A use-case/you really want to use gpt-4o for the sake of using it/volumes are too big for ColPali.

Наступне

Автоматичне відтворення

Build production-ready AI Agents with Qdrant and n8n

Build production-ready AI Agents with Qdrant and n8n

ColPali: Document Retrieval with Vision-Language Models only (with Manuel Faysse)

ColPali: Document Retrieval with Vision-Language Models only (with Manuel Faysse)

How to make data ready for your RAG application with Qdrant and FastEmbed

How to make data ready for your RAG application with Qdrant and FastEmbed

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

СОЛДАТ КНДР: ВТЕЧА/ВІЙНА В УКРАЇНІ/10 РОКІВ ШПИГУВАВ У ПІВНІЧНІЙ КОРЕЇ/ТОРГУЮТЬ НАРКОТИКАМИ І ЗБРОЄЮ

СОЛДАТ КНДР: ВТЕЧА/ВІЙНА В УКРАЇНІ/10 РОКІВ ШПИГУВАВ У ПІВНІЧНІЙ КОРЕЇ/ТОРГУЮТЬ НАРКОТИКАМИ І ЗБРОЄЮ

The evil clown plays a prank on the angel

The evil clown plays a prank on the angel

Рабочий способ бросить вредную привычку

Рабочий способ бросить вредную привычку

ColPali: Vision Language Models for Efficient Document Retrieval

ColPali: Vision Language Models for Efficient Document Retrieval

AI Agent UX Patterns: Lessons From 1000 Startups - Jonas Braadbaart

AI Agent UX Patterns: Lessons From 1000 Startups - Jonas Braadbaart

Building Production RAG Over Complex Documents

Building Production RAG Over Complex Documents

5 Minute RAG: Build GenAI at Warp Speed

5 Minute RAG: Build GenAI at Warp Speed

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

Qdrant X Emergent Methods: Prototype, Pivot, and Perfect

Qdrant X Emergent Methods: Prototype, Pivot, and Perfect

Allen Downey - A future of data science

Allen Downey - A future of data science

A Fun & Absurd Introduction to Vector Databases • Alexander Chatzizacharias • GOTO 2024

A Fun & Absurd Introduction to Vector Databases • Alexander Chatzizacharias • GOTO 2024

A Data Driven Approach to Productionizing RAG Systems

A Data Driven Approach to Productionizing RAG Systems

КТО НЕ ДВИНЕТСЯ, ПОЛУЧИТ МАШИНУ!

КТО НЕ ДВИНЕТСЯ, ПОЛУЧИТ МАШИНУ!

Что будет если украсть в магазине шоколадку 🍫

Что будет если украсть в магазине шоколадку 🍫

Сестра обхитрила!

Сестра обхитрила!

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

😳Трамп ПОТІШИВ Скабєєву, але одразу РОЗЧАРУВАВ #shorts

😳Трамп ПОТІШИВ Скабєєву, але одразу РОЗЧАРУВАВ #shorts

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

To Brawl AND BEYOND!

To Brawl AND BEYOND!