Build Your API for Llama 2 on AWS: Lambda Function and API Gateway

Deploy LLMs (Large Language Models) on AWS SageMaker using DLC

Build and Run a Medical Chatbot using Llama 2 on CPU Machine: All Open Source

💣Все! Під КУРСЬК зайшли БІЛОРУСИ на танках. У Київ везуть ПОСЛАННЯ ПУТІНА. ТАКОГО ТОЧНО ще не було!

Дурнєв дивиться сторіс #52

Deploy Llama 2 on AWS SageMaker using DLC (Deep Learning Containers)

AI Anytime

Переглядів 12 633

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 21 сер 2024
In this tutorial video, I'll show you how to effortlessly deploy Llama2 large language model on AWS SageMaker using Deep Learning Containers (DLC). We'll walk through each step, from accessing pre-built DLC images to configuring SageMaker for Llama2 deployment, designed to make the process smooth and understandable, whether you're new to Generative AI or experienced in the field.
AWS SageMaker DLC: github.com/aws...
AI Anytime GitHub: github.com/AIA...
#ai #llm #python

КОМЕНТАРІ • 42

@ashleymavericks Рік тому ⁺¹
Waiting for GGML quantised model deployments. Btw, thanks for your videos.
@AIAnytime Рік тому ⁺²
Coming soon!
@49_jaypandya40 4 місяці тому
the content is amazing
@yashsrivastava4878 5 місяців тому ⁺¹
thank you ,
can you please make a video on how to finetune mistral 7b on aws sagemaker, S3, boto3 (in form of async jobs)
@shumon29 11 місяців тому ⁺²
I am not able to find the gists. The attached repository has only a LICENSE and README file. Could you please share me the repo or gist links?
@sebastianhresko3984 10 місяців тому ⁺¹
same
@dchuguashvili 11 місяців тому ⁺²
What is the advantage, if any, of using this approach instead of deploying the llama2 model directly from Sagemaker JumpStart?
@Digitalsmb 11 місяців тому
Would love to know answer to this too
@user-iu4id3eh1x Рік тому
So simple.... Thank you
@AIAnytime Рік тому
You are welcome 😊
@danielmz99 Рік тому ⁺³
Hi thanks for your videos. Would it be possible to get a video on GGML models being deployed on SageMaker? It is unclear what requirements it needs. They fact that they are CPU optimized will help adoption as many small businesses can't really afford the $40/day hosting cost of a 5g.2x LLM + running costs if all they need is an LLM which is private. Local deployment might not be an option as if you need a 13b+ model to get a decent outcome takes a GGML to require also significant dedicated hardware. I see private cloud GGML deployments as the perfect compromise for cheap running costs and decent functionality for a very large number of usecases. I think it would be a great video. Thanks for your efforts
@AIAnytime Рік тому ⁺³
On GGML deployment, soon..... Pls stay tuned.
@ashleymavericks Рік тому
I can totally resonate with your viewpoint, I'm exploring similar possibilities for a low cost setup.
@ashleymavericks Рік тому ⁺¹
@AIAnytime It would be great if you try to deploy a GGML model on AWS compute instances and the REST API is compatible with OpenAI specifications. (can leverage LocalAI project)
@sohailhosseini2266 11 місяців тому
Thanks for sharing!
@AIAnytime 11 місяців тому
Thanks for watching!
@avijit_barua Рік тому
very helpful video!
@AIAnytime Рік тому
So glad!
@kaarthikandu Рік тому
Can we use spot instances when deploying the models ? Have you tried ?
@AIAnytime 11 місяців тому
You can but that will be interrupted.
@amangrover9343 11 місяців тому
i am getting error RuntimeError: weight model.layers.0.self_attn.rotary_emb.inv_freq does not exist while using Phind/Phind-CodeLlama-34B-v2 model
@mohammadkashif6072 Рік тому ⁺¹
What IAM roles to assign for the first time in AWS SageMaker?
@AIAnytime Рік тому
Sagemaker full access
S3 full access
@sravantipris3544 3 місяці тому
is GPU required or can it run on CPU only
@rohitleo9712 3 місяці тому
Hi can we do this for summarization purpose
@Ankur-be7dz 11 місяців тому
while we use the hugging face tokens and secret key, does hugging face charge us money? Or its free?
@AIAnytime 11 місяців тому ⁺¹
No they don't charge. It's free but they do have an API hit rate limit but for you, it won't be a problem. Feel free to use it. It's free.
@efexzium 9 місяців тому
how can we deactivate this endpoint?
@karamjittech Рік тому
Awesome video. But how can we fine tune and using RAG approach?
@AIAnytime Рік тому ⁺⁴
Coming soon..... Will same deployed LLMs for RAG based application
@user-ie9hr5sl8h Рік тому
Can you show how to do it in aws EC2 instances?
@PrasadPrasad-hi7pl Рік тому
Could you please make s tutorial on deploying a chatbot for pdf files using sagemaker. Thank you in advance
@AIAnytime Рік тому ⁺²
Yes, i will use the same deployed model for this use case. This will be my next 2 videos. Next will be lambda function and API gateway and then the chatbot for your knowledge base.
@VenkatesanVenkat-fd4hg Рік тому
Highly appreciated, Thanks for your videos. I hav got an error:
AWS SageMaker Endpoint Failed. Reason: The primary container for production variant AllTraffic did not pass the ping health check see the cloudwatch logs eventhough I hav run the same code in the huggingface deploy for llama 2 7b but falcon 7b runs fine, any help...
@AIAnytime Рік тому ⁺¹
Thank you! The issue is gated model... Can you use this model?NousResearch/Llama-2-7b-chat-hf it's same but not gated... This should be deployed fine.
@VenkatesanVenkat-fd4hg Рік тому
@@AIAnytimeThanks for your kind response. I hav deployed successfully 7b today only but 13 b needs the AWS quota...(I found related error). Whether I can try quantized version of 13b without AWS quota problem. Kindly reply...
@mydsworld3130 Рік тому
@@VenkatesanVenkat-fd4hg throwing the same errors as you have written before(for 7b model) I am not able figure it out can you pls help how you figured it out
@VenkatesanVenkat-fd4hg Рік тому
@@mydsworld3130 check the cloudwatch logs....
@jayasuriyap8748 7 місяців тому
Kindly make an video how to deploy in azure.

Наступне

Автоматичне відтворення

Build Your API for Llama 2 on AWS: Lambda Function and API Gateway

Build Your API for Llama 2 on AWS: Lambda Function and API Gateway

Deploy LLMs (Large Language Models) on AWS SageMaker using DLC

Deploy LLMs (Large Language Models) on AWS SageMaker using DLC

Build and Run a Medical Chatbot using Llama 2 on CPU Machine: All Open Source

Build and Run a Medical Chatbot using Llama 2 on CPU Machine: All Open Source

💣Все! Під КУРСЬК зайшли БІЛОРУСИ на танках. У Київ везуть ПОСЛАННЯ ПУТІНА. ТАКОГО ТОЧНО ще не було!

💣Все! Під КУРСЬК зайшли БІЛОРУСИ на танках. У Київ везуть ПОСЛАННЯ ПУТІНА. ТАКОГО ТОЧНО ще не було!

Дурнєв дивиться сторіс #52

Дурнєв дивиться сторіс #52

Втрачене дитинство | GOVOR TikTok #govor #shots

Втрачене дитинство | GOVOR TikTok #govor #shots

AWS Sagemaker tutorial | Build and deploy a Machine Learning API with Python

AWS Sagemaker tutorial | Build and deploy a Machine Learning API with Python

Fine-Tuning Meta's Llama 3 8B for IMPRESSIVE Deployment on Edge Devices - OUTSTANDING Results!

Fine-Tuning Meta's Llama 3 8B for IMPRESSIVE Deployment on Edge Devices - OUTSTANDING Results!

Creating a Veterinary Chatbot using Llama 2: Harnessing Gen AI for Pet Care

Creating a Veterinary Chatbot using Llama 2: Harnessing Gen AI for Pet Care

Your Own Llama 2 API on AWS SageMaker in 10 min! Complete AWS, Lambda, API Gateway Tutorial

Your Own Llama 2 API on AWS SageMaker in 10 min! Complete AWS, Lambda, API Gateway Tutorial

$0 Embeddings (OpenAI vs. free & open source)

$0 Embeddings (OpenAI vs. free & open source)

Build Your RAG-based ChatGPT Web App with Azure: LawGPT Use Case Tutorial

Build Your RAG-based ChatGPT Web App with Azure: LawGPT Use Case Tutorial

Anyone can Fine Tune LLMs using LLaMA Factory: End-to-End Tutorial

Anyone can Fine Tune LLMs using LLaMA Factory: End-to-End Tutorial

Hugging Face LLMs with SageMaker + RAG with Pinecone

Hugging Face LLMs with SageMaker + RAG with Pinecone

End To End LLM Project Using LLAMA 2- Open Source LLM Model From Meta

End To End LLM Project Using LLAMA 2- Open Source LLM Model From Meta

👀Коли трохи не вийшло з Києвом за 3 дні #війна #курськ #суджа #росія #зсу

👀Коли трохи не вийшло з Києвом за 3 дні #війна #курськ #суджа #росія #зсу

skibidi toilet 77 (part 1)

skibidi toilet 77 (part 1)

Карпати - Динамо / УПЛ / 3 тур / Огляд матчу #Карпати #Динамо #уплтб

Карпати - Динамо / УПЛ / 3 тур / Огляд матчу #Карпати #Динамо #уплтб

How I Did The SELF BENDING Spoon 😱🥄 #shorts

How I Did The SELF BENDING Spoon 😱🥄 #shorts

🔥ЖДАНОВ: Путін НАС ПЕРЕГРАВ З КУРСЬКОМ! Здає землі недарма. Він знає фінал. Ми так ПРОФУКАЄМО ДОНБАС

🔥ЖДАНОВ: Путін НАС ПЕРЕГРАВ З КУРСЬКОМ! Здає землі недарма. Він знає фінал. Ми так ПРОФУКАЄМО ДОНБАС

В ДЕТСТВЕ ОТПРАШИВАЕШЬСЯ НА РЕЧКУ У МАМЫ

В ДЕТСТВЕ ОТПРАШИВАЕШЬСЯ НА РЕЧКУ У МАМЫ

КУДА ДАЛЬШЕ ДВИНУТСЯ ВСУ? БЕСЕДА С ЮРИЙ ФЕДОРОВ

КУДА ДАЛЬШЕ ДВИНУТСЯ ВСУ? БЕСЕДА С ЮРИЙ ФЕДОРОВ