Learn AI Engineer Skills: Autonomous Agentic Behavior (Llama 3 8B Ollama)

Unlimited AI Agents running locally with Ollama & AnythingLLM

100% Local AI Speech to Speech with RAG - Low Latency | Mistral 7B, Faster Whisper ++

Військові ВИКРИЛИ БЕЗУГЛУ! Генерали заговорили про ЄРМАКА. Ось на кого працює депутка - Симороз

"Майже 2 роки він був у полоні". Сестра військового з Херсонщини чекала повернення брата #shorts

Safely Sawing: Tips for Tree Cutting Success!

37% Better Output with 15 Lines of Code - Llama 3 8B (Ollama) & 70B (Groq)

All About AI

Переглядів 17 014

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 12 лип 2024
To try everything Brilliant has to offer-free-for a full 30 days, visit brilliant.org/AllAboutAI . You’ll also get 20% off an annual premium subscription.
37% Better Output with 15 Lines of Code - Llama 3 8B (Ollama) & 70B (Groq)
GitHub Project:
github.com/AllAboutAI-YT/easy...
👊 Become a member and get access to GitHub and Code:
/ allaboutai
🤖 Great AI Engineer Course:
scrimba.com/learn/aiengineer?...
📧 Join the newsletter:
www.allabtai.com/newsletter/
🌐 My website:
www.allabtai.com
In this video I try to improve a known problem when using RAG in local model like Llama 3 8B on ollama. This local RAG system was improved by just adding around 15 lines of code. Feel free to share and rate on GitHub :)
00:00 Llama 3 Improved RAG Intro
02:01 Problem / Soulution
03:05 Brilliant.org
04:26 How this works
12:05 Llama 3 70B Groq
15:12 Conclusion
Наука та технологія

КОМЕНТАРІ • 32

@AllAboutAI 2 місяці тому ⁺³
Brilliant: To try everything Brilliant has to offer-free-for a full 30 days, visit brilliant.org/AllAboutAI . You’ll also get 20% off an annual premium subscription.
@MattJonesYT 2 місяці тому ⁺⁸
Another approach to this is to just ask for the simple llm to hallucinate an answer to the current chat. That answer will not be correct but it will probably have the phrases needed for the RAG system to find the needed excerpts. There's a technical term for this idea which I can't remember but I came across it on the TwoSetAI channel which has a lot of similar tricks
@robboerman9378 2 місяці тому ⁺³
HyDE, Hypothetical Document Embeddings. Works very well and easy to implement. Similarity search on a vector database using a hallucinated answer to the question instead of the question usually gives better similarity
@AllAboutAI 2 місяці тому ⁺¹
yes this is nice, thnx :)
@kenhtinhthuc 2 місяці тому ⁺²
RAG is a bit too much of an exact match because it is based on concepts and similar concepts. Therefore no match, no return. HyDE makes the search a bit more fuzzy by expanding the query and introducing more concepts. It would be good to have an evaluator to check on the faithfuness of retrieval and the relevance of the ouputs to the original query.
@ASchnacky 2 місяці тому ⁺⁵
Dolphin-llama3 & Groq-llama3
are awesome! Well done!
@ByZaMo64 2 місяці тому
how are they different?
@MarcShade 2 місяці тому ⁺⁵
dolphin-llama3:8b-v2.9-fp16 is so good as an assistant!
@ASchnacky 2 місяці тому ⁺¹
Dolphin-llama3 & Groq-llama3
@futureworldhealing 2 місяці тому ⁺²
best AI python coding channel hands down
@AllAboutAI 2 місяці тому ⁺¹
thnx a lot :D
@pec8377 2 місяці тому ⁺¹
@AllAboutAi the issue is it makes the assumption that the question is related to the content passed, which is not always the case in a conversation. Like suddenly you talk about something else, let's say "How are you", it will be rewritten to be aligned to the precedent context, which is not what you want.. then you need to implement some more mechanism or tweak your prompt to only rephrase when the question seems to be linked to the past. Many discussions about this..
@nic-ori 2 місяці тому ⁺³
👍👍👍Thanks! Useful information.
@Edoras5916 2 місяці тому
direct, didactic almost verbatim in my book, explanation. excellent
@technolus5742 2 місяці тому ⁺¹
Great job
@AllAboutAI 2 місяці тому
thnx :)
@akimezra7178 2 місяці тому
Bruuuuuuh, just found this channel, you sure you're human?!?! Wish i had 5% of your brain.... thank you so much for your work! Im learning so much!!
@SeattleShelby Місяць тому
You just need a bigger neck beard. It’s all in the neck beard.
@realorfake4765 2 місяці тому ⁺¹
based on your experience, why is olama better than LMStudio?
@samyio4256 Місяць тому
How is the retrieval so fast? Did you cut the loading time for context out of the video?
@vetonrushiti19 4 дні тому
can you please give the code of 70b model?
@elsondasilva8636 2 місяці тому ⁺¹
💎💎🌟💎💎💎💎
@iamisobe 2 місяці тому ⁺¹
first
@monstercameron 2 місяці тому ⁺¹
What about doing the same for the output? One pass is the internal voice, compare it to the promo to see if matches up and a second pass for any corections. Like giving LLMs an inner voice like we do.
@AllAboutAI 2 місяці тому
interesting
@buttpub 2 місяці тому ⁺¹
the problem and solution is that your setup is stateless
@AllAboutAI 2 місяці тому
interesting, will look into
@buttpub 2 місяці тому
@@AllAboutAIllms such as those built on transformer architectures, are fundamentally stateless, meaning they do not inherently maintain information about previous inputs across separate input sequences like recurrent neural networks. however; they can emulate state-like behavior through the use of positional and specialized embeddings that incorporate contextual information within a given sequence, processing data in a stateless manner, the autoregressive nature of many llms allows them to generate text by sequentially predicting the next token based on the accumualted outputs, mimicking a form of statefulness. allowing them to handle extensive and complex sequences effectively, tho each processing step inherently lacks a continuous internal state beyond its immediate inputs.

Наступне

Автоматичне відтворення

Learn AI Engineer Skills: Autonomous Agentic Behavior (Llama 3 8B Ollama)

Learn AI Engineer Skills: Autonomous Agentic Behavior (Llama 3 8B Ollama)

Unlimited AI Agents running locally with Ollama & AnythingLLM

Unlimited AI Agents running locally with Ollama & AnythingLLM

100% Local AI Speech to Speech with RAG - Low Latency | Mistral 7B, Faster Whisper ++

100% Local AI Speech to Speech with RAG - Low Latency | Mistral 7B, Faster Whisper ++

Військові ВИКРИЛИ БЕЗУГЛУ! Генерали заговорили про ЄРМАКА. Ось на кого працює депутка - Симороз

Військові ВИКРИЛИ БЕЗУГЛУ! Генерали заговорили про ЄРМАКА. Ось на кого працює депутка - Симороз

"Майже 2 роки він був у полоні". Сестра військового з Херсонщини чекала повернення брата #shorts

"Майже 2 роки він був у полоні". Сестра військового з Херсонщини чекала повернення брата #shorts

Safely Sawing: Tips for Tree Cutting Success!

Safely Sawing: Tips for Tree Cutting Success!

🦾 Український ветеран із протезом допомагає розбирати завали у Києві. Нас не зламати ❤️‍🩹🔗

🦾 Український ветеран із протезом допомагає розбирати завали у Києві. Нас не зламати ❤️‍🩹🔗

ML Was Hard Until I Learned These 5 Secrets!

ML Was Hard Until I Learned These 5 Secrets!

Build a Meta Llamma-3 Powered Voice Assistant with Ollama and Python

Build a Meta Llamma-3 Powered Voice Assistant with Ollama and Python

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Llama3 + CrewAI + Groq = Email AI Agent

Llama3 + CrewAI + Groq = Email AI Agent

This Llama 3 is powerful and uncensored, let’s run it

This Llama 3 is powerful and uncensored, let’s run it

The most important AI trends in 2024

The most important AI trends in 2024

"okay, but I want Llama 3 for my specific use case" - Here's how

"okay, but I want Llama 3 for my specific use case" - Here's how

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

Llama3: Comparing 8B vs 70B Parameter Models - Which One is Right for You?

Llama3: Comparing 8B vs 70B Parameter Models - Which One is Right for You?

Easy Art with AR Drawing App - Step by step for Beginners

Easy Art with AR Drawing App - Step by step for Beginners

ВОЗМОЖНО ЛИ ПОЧИСТИТЬ КЛАВИАТУРУ КЛЕЕМ?🤔 #shorts

ВОЗМОЖНО ЛИ ПОЧИСТИТЬ КЛАВИАТУРУ КЛЕЕМ?🤔 #shorts

I tested every new Samsung product!

I tested every new Samsung product!

НЕ ПОКУПАЙТЕ ТАКИЕ ПК! Клиент собрал китайский компьютер, но проработал он не долго! 🤬

НЕ ПОКУПАЙТЕ ТАКИЕ ПК! Клиент собрал китайский компьютер, но проработал он не долго! 🤬

1$ vs 500$ ВІРТУАЛЬНА РЕАЛЬНІСТЬ !

1$ vs 500$ ВІРТУАЛЬНА РЕАЛЬНІСТЬ !

40$ or 50$ or Typecase iPad keyboard #ipadkeyboard #ipadcase #typecase #ipad #ipadpro

40$ or 50$ or Typecase iPad keyboard #ipadkeyboard #ipadcase #typecase #ipad #ipadpro

Product Link in Bio ( # 1636 ) @MaviGadgets ✅ Smart Universal Magnetic Car Phone Holder

Product Link in Bio ( # 1636 ) @MaviGadgets ✅ Smart Universal Magnetic Car Phone Holder

Clicks чехол-клавиатура для iPhone ⌨️

Clicks чехол-клавиатура для iPhone ⌨️