Це відео не доступне.
Перепрошуємо.

Azure OpenAI Service - Rate Limiting, Quotas, and throughput optimization

Поділитися
Вставка
  • Опубліковано 15 сер 2023
  • This video explains how Azure OpenAI Service's rate limiting and quota configuration works and shows suggestions for optimizing the throughput for a given model.
    Blog post: clemenssiebler.com/posts/unde...
    #azure #openai #gpt4

КОМЕНТАРІ • 7

  • @jonathanbarton5243
    @jonathanbarton5243 2 місяці тому

    Most concise explanation - thank you

  • @Stateoftheheart
    @Stateoftheheart 6 місяців тому +1

    Thank you, Clemens, very helpful! Keep them coming :)

  • @jagadeeskumarlenin5517
    @jagadeeskumarlenin5517 7 місяців тому

    Is it only supported for round robin only ?

  • @jagadeeskumarlenin5517
    @jagadeeskumarlenin5517 7 місяців тому

    Thanks for this video. May i know what is the user hit limt for 240k token. (Per second or per minute)

    • @Leavinggermany
      @Leavinggermany 7 місяців тому +1

      It’s in TPMs, so Tokens per Minute. There’s now also a dynamic quota feature that allows to go over that limit in case there is capacity. 👍🏻

  • @nclub976
    @nclub976 6 місяців тому

    Hello. I want to use Chatgbt4 Turbo vision for my application however I am not sure about the charges I am paying the way of calculation is very confusing to me. Does anyone know for sure what is paid on Azure open ai for using the Chatgbt 4 Turbo vision model, is it just spent tokens or something extra,host? Thank you

    • @clemenssiebler
      @clemenssiebler  6 місяців тому

      Azure OpenAI just charges you for the tokens you consume when you use pay as a you go! 👍🏻