Your Own Llama 2 API on AWS SageMaker in 10 min! Complete AWS, Lambda, API Gateway Tutorial

Deploy FastAPI on AWS Lambda | In 9 MINUTES

What are AI Agents?

Когда отец одевает ребёнка @JaySharon

100 Identical Twins Fight For $250,000

Кирилл Набутов. Над трупом Маслякова надругались, Патрушева прикончили, Терешкова выжила из ума

Turn Your AI Model into a Real Product (Amazon SageMaker, API Gateway, AWS Lambda, Next.js, Python)

Brian H. Hough | Tech Stack Playbook

Переглядів 2 615

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 3 жов 2024

КОМЕНТАРІ • 17

@ricand5498 4 місяці тому ⁺¹
amazing content!! you deserve more subscribers. This is exactly the kind of tutorial I've been looking for but NO ONE EXPLAINS THIS. There's hundreds of videos on deploying open source LLMs locally but there's almost no in-depth high quality info on deploying to remote servers on AWS especially for the masses. You earned a new subscriber with this one, keep making great tutorials!
@RatherBeCancelledThanHandled 3 місяці тому
Thanks for sharing ❤️👍
@mmzzzmeemee Рік тому ⁺¹
underrated content, this is fire content!
@BrianHHough Рік тому
So glad you liked this video!! Thanks so much for the kind words and support! 🤩🔥💯🙏
@gilbertyoungjr4898 Рік тому ⁺¹
Let's goo. Keep pushing content. Great stuff brother.
@BrianHHough Рік тому
Thanks so much, bro!! 🤩🔥 This was a really fun video to put together and I learned a ton in the process! 💡
@faizamalik8298 5 місяців тому
that's brilliant.. I was looking for this
@elad3958 Рік тому
Outstanding.
@tal7atal7a66 Рік тому ⁺¹
thanks very interesting bro ❤
@BrianHHough Рік тому
Thanks so much for checking out this build! Really glad you enjoyed the AI/ML content! 🙏🔥
@lewdogpop Рік тому ⁺¹
Let’s gooo
@BrianHHough Рік тому
Wooo! 🔥
@finnsteur5639 Рік тому ⁺¹
I'm trying to create 100 000 reliable tutorials for hundred complex software like photoshop, blender, da vinci resolve etc.. Llama and gpt don't give reliable answer unfortunately. Do you think finetuning llama 7b would be enough (compared to 70b)? Do you know how much time/data that would take?
I also heard about embedding but couldn't get it to work on large dataset. Would that be a better option? We have at least 40 000 pages of documentation I don't know what the better approach is.
@BrianHHough Рік тому ⁺¹
Really interesting use-case (lot's to share on this below...👀)! LLaMa 13B (which I used in my tutorials) is pretty solid. Jumping to 70B might be overkill in terms of time and resources, especially if you're initially testing out feasibility. I'd say, test the waters with something smaller like 7B, or 13B like what I used, and then decide. There's an inherent trade-off between model size and quality. LLaMA 70B will generally have better performance than LLaMa 7B due to its larger parameter count, but the improvements might be marginal beyond a certain point, and the cost in terms of computation and time might be disproportionately higher for the 70B model. That's where 13B could be the happy medium for testing/use, and once you get the tests you want to test for, maybe quickly run a 70B build for a bit and see if the performance is any different. Just keep costs in mind of course!
Related to embeddings - I've seen the debates too! RAG is awesome, but there are some quirks, especially when handling broad queries. Augmenting LLMs using RAG is particularly effective for specific tasks, but there are of course certain inherent challenges, like what you shared. It’s all about how you chunk and index your data. Make sure your tutorials are bite-sized to get the most out of RAG. RAG can handle localized info retrieval well, but struggles with broader queries requiring a scan of the entire dataset, especially if it's as large as you're describing like in the 100,000s.
Overall, I'd say start small, test it out, then scale. And with your 40k pages of docs, you’ve got a goldmine to work with! 💎 Please let me know how you get along with this! Curious to hear how it goes and what you build! 🛠
@eltafhussain Рік тому
Can you explain how many concurrent request can g5.12xLarge instance handles when using LlaMa 2 7B or 13B model? What would be solution for such scenarios?
@eltafhussain Рік тому
I have used smaller instance and getting issue with multiple request as the instance memory insufficient for handle multiple requests
@rsalazar9784 Місяць тому
Amazon SageMaker is good for a few users, but when the number of users is 10 thousand or 100 thousand it is no longer useful.

Наступне

Автоматичне відтворення

Your Own Llama 2 API on AWS SageMaker in 10 min! Complete AWS, Lambda, API Gateway Tutorial

Your Own Llama 2 API on AWS SageMaker in 10 min! Complete AWS, Lambda, API Gateway Tutorial

Deploy FastAPI on AWS Lambda | In 9 MINUTES

Deploy FastAPI on AWS Lambda | In 9 MINUTES

What are AI Agents?

What are AI Agents?

Когда отец одевает ребёнка @JaySharon

Когда отец одевает ребёнка @JaySharon

100 Identical Twins Fight For $250,000

100 Identical Twins Fight For $250,000

Кирилл Набутов. Над трупом Маслякова надругались, Патрушева прикончили, Терешкова выжила из ума

Кирилл Набутов. Над трупом Маслякова надругались, Патрушева прикончили, Терешкова выжила из ума

skibidi toilet 77 (part 3)

skibidi toilet 77 (part 3)

AWS Sagemaker tutorial | Build and deploy a Machine Learning API with Python

AWS Sagemaker tutorial | Build and deploy a Machine Learning API with Python

How to Run a Python Docker Image on AWS Lambda

How to Run a Python Docker Image on AWS Lambda

Large Language Models (LLMs) - Everything You NEED To Know

Large Language Models (LLMs) - Everything You NEED To Know

What is an API Gateway?

What is an API Gateway?

Create a Highly Accurate Knowledge Base in Voiceflow using Tags API

Create a Highly Accurate Knowledge Base in Voiceflow using Tags API

#2- Complete End To End Generative AI Project On AWS Using AWS Bedrock And AWS Lambda

#2- Complete End To End Generative AI Project On AWS Using AWS Bedrock And AWS Lambda

Why Agent Frameworks Will Fail (and what to use instead)

Why Agent Frameworks Will Fail (and what to use instead)

"The Life & Death of htmx" by Alexander Petros at Big Sky Dev Con 2024

"The Life & Death of htmx" by Alexander Petros at Big Sky Dev Con 2024

I Remade YouTube From Scratch Using Just Bash

I Remade YouTube From Scratch Using Just Bash

ЗАГС. 1 СЕРИЯ. Мелодрама

ЗАГС. 1 СЕРИЯ. Мелодрама

Usyk and Conor McGregor met on AJ vs Dubois fight

Usyk and Conor McGregor met on AJ vs Dubois fight

Дурнєв та Фелікс Редька дивляться сторіс ZОМБІ #54 (napisy PL, eng subtitles)

Дурнєв та Фелікс Редька дивляться сторіс ZОМБІ #54 (napisy PL, eng subtitles)

Загадочная череда смертей участников группы Ласковый май | Документальный фильм

Загадочная череда смертей участников группы Ласковый май | Документальный фильм

Which One Is The Best - From Small To Giant #katebrush #shorts

Which One Is The Best - From Small To Giant #katebrush #shorts

Life hack 😂 Watermelon magic box! #shorts by Leisi Crazy

Life hack 😂 Watermelon magic box! #shorts by Leisi Crazy

Україна - Бразилія: ПРЯМА ТРАНСЛЯЦІЯ МАТЧУ / футзал, Чемпіонат світу-2024, ПІВФІНАЛ

Україна – Бразилія: ПРЯМА ТРАНСЛЯЦІЯ МАТЧУ / футзал, Чемпіонат світу-2024, ПІВФІНАЛ

БЕЛКА СЬЕЛА КОТЕНКА?#cat

БЕЛКА СЬЕЛА КОТЕНКА?#cat