Creating APIs For Machine Learning Models with FastAPI

Поділитися
Вставка
  • Опубліковано 25 сер 2024

КОМЕНТАРІ • 26

  • @omegasigma4500
    @omegasigma4500 4 місяці тому +15

    I'm glad you uploaded a video about FastAPI. We prefer it over Flask.
    There are 2 topics where we need some help.
    1.) Hosting - How to deploy the app so that other can access it via web? And how to manage the cloud infrastructure?
    2.) Frontend - There are now plenty of frameworks and libraries. The standard approach is probably JavaScript, HTML and CSS. But I'm wondering what you think about pure Python libraries like Taipy, FastUI and reflex. What do you think is the best approach here? We would highly appreciate your input. Thanks!
    Keep up the great work! 💪💪👍👍

  • @khandoor7228
    @khandoor7228 4 місяці тому +2

    This was excellent, the capabilities this opens up is really powerful. Good job as always.

  • @TheDigitalSight
    @TheDigitalSight 4 місяці тому +10

    We use FastAPI more than django and flask, can you please create video on langchain and fastapi as well?

  • @dipeshsamrawat7957
    @dipeshsamrawat7957 4 місяці тому +1

    You are making requested videos. Thank you 💯

  • @cheukmingau983
    @cheukmingau983 4 місяці тому +3

    In production the async endpoint should not be used. An async function (coroutine) will be executed in the main thread event loop, and like the event loop in JS inside the browser, it can only execute one coroutine at a time. Running the synchronous, cpu intensive `model.predict` inside the async endpoint will make your prediction endpoint frozen and wait for the underlying cpu predicting the images, so the QPS of your handler is at most one.
    Better options could be: 1) Using a synchronous function as the inference endpoint, 2) create a threadpoolexecutor outside of the async function, and use `loop.run_in_executor()` with the threadpoolexecutor declared as it will run the model inside the thread, or 3) use poolexecutor similar to option 2. The problem for option 3 is that multiprocesses requires pickling and you might have to tweak your model case by case.
    Also, pickling the model and deserialize in the application api server doesn't reveal the identity and method signatures of that model. If you are the only one who train and deploy that might not be a big problem, but in production you might want to use some inferencing frameworks like Onnxruntime which you just serialize your trained model to the preferred format first (onnxruntime has a very small package size compared to other DL libraries which makes the deployment dependency smaller). Lastly, running scikit-learn model in python doesn't utilize the all the cores in your cpu, whereas other packages usually have higher utilization proportion.
    I understand that the model in this video is small in size and is a POC, so with the small size running async and pickling is fine. However, for just some even better CV and NLP models (e.g. BERT) it is nearly impossible to adopt the same approach as in this tutorial.

    • @alexandrosmaragkakis737
      @alexandrosmaragkakis737 4 місяці тому

      @cheukmingau983 hello, I'm currently facing a problem very similar to what you described, is there somewhere I can message you to get more info on this?

    • @cheukmingau983
      @cheukmingau983 4 місяці тому

      @@alexandrosmaragkakis737 perhaps here? You can state your situation just with the minimal details

  • @timothyelems1357
    @timothyelems1357 4 місяці тому

    Exactly what I was looking for! Thanks man!

  • @NomadicBrian
    @NomadicBrian Місяць тому

    I've been using fastAPI for a couple of years now. Just starting with AI models. I had planned on calling models with fastAPI. See if I can do that with a ViT model I've been working with.

  • @smstudio1035
    @smstudio1035 4 місяці тому +1

    Can we see a hosting video off the same

  • @thelifehackerpro9943
    @thelifehackerpro9943 4 місяці тому +2

    why not use model directly instead of pickle?

    • @tthcan8038
      @tthcan8038 4 місяці тому +5

      Pickle makes your model remember the weights, so you need to fit only once

  • @hardline_fc
    @hardline_fc 4 місяці тому

    For me it correctly guesses only numbers 4, 6. For the rest it says they're 7 or 5.

  • @zedcodes
    @zedcodes 4 місяці тому

    Why do I have this error: 'module 'PIL.Image' has no attribute 'ANTIALIAS''? @10:41

    • @bobfreeman7349
      @bobfreeman7349 4 місяці тому +2

      ANTIALIAS was removed in Pillow 10.0.0 (after being deprecated through many previous versions). Now you need to use PIL.Image.LANCZOS or PIL.Image.Resampling.LANCZOS.
      (This is the exact same algorithm that ANTIALIAS referred to, you just can no longer access it through the name ANTIALIAS.)

    • @zedcodes
      @zedcodes 4 місяці тому

      @@bobfreeman7349 That worked, thank you. I used this and it worked:
      pil_image = pil_image.resize((28, 28), PIL.Image.LANCZOS)

  • @DougSteinberg7410
    @DougSteinberg7410 2 місяці тому

    Great video! I'm a web developer and new to ML. Do you have the source code for this project in a Github repo? I would really love to try this out locally.

  • @ebenezervictor7297
    @ebenezervictor7297 2 місяці тому

    I like this guy

  • @Lyphnet
    @Lyphnet 4 місяці тому

    Please ensure that your Discord server remains joinable. Thanks!

  • @systembreaker4651
    @systembreaker4651 4 місяці тому +1

    What is your daily Linux distro ❤

    • @Wanhatoman
      @Wanhatoman 4 місяці тому

      popOs

    • @samstar1290
      @samstar1290 4 місяці тому

      Arch linux hyprland

    • @kmano2915
      @kmano2915 4 місяці тому +1

      He is using Linux mint . I am using ubuntu

  • @tcgvsocg1458
    @tcgvsocg1458 4 місяці тому

    i am not entire sur i understand how that worst but thx a lot for the video