Deploying a knowledge-based chatbot with RAG in production

Поділитися
Вставка
  • Опубліковано 21 вер 2024
  • On this webinar Boris Popov, CSA at Nebius AI, talk about the deployment of a knowledge-based chatbot using RAG in a production environment. This implementation leverages open source technologies and is powered by NVIDIA® H100 Tensor Core GPUs. We will also discuss the integration of Kubernetes, Cuda, Triton Server, TensorRT, Milvus, PyTorch, and Llama2.
    During this session, we will cover:
    - Techniques for deploying RAG in a production setting using open source tools.
    - The foundational architecture of RAG, customized for efficient scalability in production environments.
    - A live demonstration of the chatbot deployment, emphasizing practical deployment strategies and operational considerations.
    nebius.ai/

КОМЕНТАРІ •