I am loving these tutorials I would like to see you do in depth on using vllm as an api point for serving llm using azure kubernates cluster it would be soo useful to the community as we can then use quantized models of llama3 70b with very cheap gpu to help serve applications. I would be just amazing for the community then you can use that to help make agents with lang graph tutorials bro I would love it
Great content! By the way, do you have any videos on deploying a LangGraph app on the Cloud (Google, AWS, Azure)?
Thanks, your videos are really showing the potential of this open source framework
I’m really enjoying your channel and content.
Thanks 😊
Thank you!
You bet!
I am loving these tutorials I would like to see you do in depth on using vllm as an api point for serving llm using azure kubernates cluster it would be soo useful to the community as we can then use quantized models of llama3 70b with very cheap gpu to help serve applications. I would be just amazing for the community then you can use that to help make agents with lang graph tutorials bro I would love it
10/10