Deploying a knowledge-based chatbot with RAG in production
Вставка
- Опубліковано 21 вер 2024
- On this webinar Boris Popov, CSA at Nebius AI, talk about the deployment of a knowledge-based chatbot using RAG in a production environment. This implementation leverages open source technologies and is powered by NVIDIA® H100 Tensor Core GPUs. We will also discuss the integration of Kubernetes, Cuda, Triton Server, TensorRT, Milvus, PyTorch, and Llama2.
During this session, we will cover:
- Techniques for deploying RAG in a production setting using open source tools.
- The foundational architecture of RAG, customized for efficient scalability in production environments.
- A live demonstration of the chatbot deployment, emphasizing practical deployment strategies and operational considerations.
nebius.ai/