How OpenAI’s SWARM Simplifies Multi-Agent Systems

Multi-modal RAG: Chat with Docs containing Images

Expert shows how AI will escape and kill us.

Lp. Сердце Вселенной #60 РОЖДЕНИЕ ЛОЛОЛОШКИ [Финал] • Майнкрафт

The Security Guard Fell Into The Trap Of The Beauty #still #parkour #funny#skate

Тайское мороженое в Калининграде

Stop Losing Context! How Late Chunking Can Enhance Your Retrieval Systems

Prompt Engineering

Переглядів 21 239

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 22 гру 2024

КОМЕНТАРІ • 25

@TheMariolino2005 2 місяці тому ⁺¹
Your videos are always clear and super explicatives. Thanks. Keep on going like this!😇
@harshilpatel4989 2 місяці тому ⁺¹
Great video. I am a college student, and your videos are helping me do my projects. Thank you for such content.
@engineerprompt 2 місяці тому ⁺¹
Thank you and glad you are finding it useful.
@niallmoore8240 22 дні тому
Awesome, I’m subscribed
@durand101 2 місяці тому
This seems like a great technique to also help with entity confusion during retrieval. Sometimes I've noticed that embeddings models don't really capture nuanced but important differences between chunks that talk about once company vs another and that ends up confusing the LLM as well.
@Retburstjk 2 місяці тому
Late chunking seems very cost-efficient compared to others approach, ty for the share !
@ypsehlig 2 місяці тому
Super interesting once again 👍
@GeobotPY 2 місяці тому ⁺²
Intersting! But I still think Naive RAG is a bit underrated. To properly build up contextual retrieval or any RAG system for that matter a Naive approach lays as a foundation. It is also cheap and fast and if done correctly works very well. Only thing is Naive RAG for tables works quite bad, but for text it can work very well.
@awakenwithoutcoffee 2 місяці тому
use a specific Agent for SQL and use a Router :)
@sahil0094 2 місяці тому ⁺⁴
Most documents for rag would be more than 50 pages so I don’t think there’s any embedding model with that huge context. Pls correct me if I’m wrong. I don’t see this approach being effective for rag systems
@engineerprompt 2 місяці тому
I think these needs some clarifications. the 8k max token doesn't mean that you can only embed a document when its less than that many tokens. If you have a document which is longer than say 8k tokens, you can divide into batches and process it the way you would do it for chunking. Now there might be some discontinuity but overlap is again your friend here. Hope this clarifies how you would use it.
@revanthphanisaimedukonduru1177 2 місяці тому ⁺¹
If we chunk paragraphs, and yeah we keep overlap, still the main property of Late chunking is to hold semantic meaning of whole context, which is not useful. It's like an intermediate solution, because embedding limit is a challenge.
@zyzhang1130 2 місяці тому ⁺¹
@@engineerpromptthat defeats the very purpose of late chunking doesn’t it
@HugoCortijo-q4q 2 місяці тому ⁺²
Does the embedding dimension refer to the output length of the response?
@davidperedahernandez4190 2 місяці тому ⁺¹
y, i have the same question
@GeobotPY 2 місяці тому
Its the length of the vector. So a dimension = 3 would be [2, 3, 5]
@richardkuhne5054 2 місяці тому
So how to combine that with VRAG and Context extension in localVisionGPT?
@doaamostafa4759 2 місяці тому
Could we use Hybrid search with Late chunking ? or late Chunking is enough ?
@doaamostafa4759 2 місяці тому
@Prompt Engineering
@engineerprompt 2 місяці тому ⁺¹
Hybrid will always help. It's hard to beat BM25 :) that is usually really helpful when you have a lot of keywords in your dataset
@remusomega 2 місяці тому
Is there an application here where this can enhance knowledge graph generatin?
@barackobama4552 2 місяці тому
Thanks!
@tecnom7133 2 місяці тому
Thanks
@stunspot 2 місяці тому
Ah, chunking. I love the late chunking idea, but personally have found optimizing my document formatting for a specific RAG to be to best approach. Making sure it gets chunked sensibly. Pain in the ass, frankly, and can be largely avoided with fractal structuring. But you can't usually do that. Sigh.
@engineerprompt 2 місяці тому
I totally agree with this approach and have been advocating for it for a while now with clients I work with. Non of this is magic. You have to spend time with your data to understand it and then build on top of it. The unfortunate part is people don't want to do the dirty work for the most part.

Наступне

Автоматичне відтворення

How OpenAI’s SWARM Simplifies Multi-Agent Systems

How OpenAI’s SWARM Simplifies Multi-Agent Systems

Multi-modal RAG: Chat with Docs containing Images

Multi-modal RAG: Chat with Docs containing Images

Expert shows how AI will escape and kill us.

Expert shows how AI will escape and kill us.

Lp. Сердце Вселенной #60 РОЖДЕНИЕ ЛОЛОЛОШКИ [Финал] • Майнкрафт

Lp. Сердце Вселенной #60 РОЖДЕНИЕ ЛОЛОЛОШКИ [Финал] • Майнкрафт

The Security Guard Fell Into The Trap Of The Beauty #still #parkour #funny#skate

The Security Guard Fell Into The Trap Of The Beauty #still #parkour #funny#skate

Тайское мороженое в Калининграде

Тайское мороженое в Калининграде

#JasonDeruloTV // Funny #GotPermissionToPost From @SofiManassyan #SlowLow

#JasonDeruloTV // Funny #GotPermissionToPost From @SofiManassyan #SlowLow

The Best RAG Technique Yet? Anthropic’s Contextual Retrieval Explained!

The Best RAG Technique Yet? Anthropic’s Contextual Retrieval Explained!

LangChain: How to Properly Split your Chunks

LangChain: How to Properly Split your Chunks

Dynamic Retrieval For Full Context RAG Beside Chunks !

Dynamic Retrieval For Full Context RAG Beside Chunks !

Anthropic MCP + Ollama. No Claude Needed? Check it out!

Anthropic MCP + Ollama. No Claude Needed? Check it out!

Agentic RAG: Make Chatting with Docs Smarter

Agentic RAG: Make Chatting with Docs Smarter

AI Model Context Decoded

AI Model Context Decoded

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

Contextual Retrieval with Any LLM: A Step-by-Step Guide

Contextual Retrieval with Any LLM: A Step-by-Step Guide

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

Психіатр Глузман УПЕРШЕ сканує Зеленського, Путіна й Трампа

Психіатр Глузман УПЕРШЕ сканує Зеленського, Путіна й Трампа

Они Скупали ВСЁ Серебро Мира и вот ЧТО Было Дальше! #shorts

Они Скупали ВСЁ Серебро Мира и вот ЧТО Было Дальше! #shorts

Что будет если украсть в магазине шоколадку 🍫

Что будет если украсть в магазине шоколадку 🍫

Уличный боец с ДУХОМ воина

Уличный боец с ДУХОМ воина

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

Правильный подход к детям

Правильный подход к детям

«Просив пробачення, що не уберіг Діму» - історія братів Василя Репчука і Дмитра Мурару #shorts

«Просив пробачення, що не уберіг Діму» — історія братів Василя Репчука і Дмитра Мурару #shorts