[UPDATED] ViViT & NaViT papers: How Sora encoded space-time patches | Shawn's ML Notes

Поділитися
Вставка
  • Опубліковано 30 жов 2024

КОМЕНТАРІ • 12

  • @xiaosean
    @xiaosean 2 місяці тому +1

    Thank you for the excellent video! Your explanations were very clear, and I really appreciated how you covered so many concepts in just one video. The slides were also very well-organized and intuitive. I’m particularly curious about what tool you used to create the slides-could you share that with us? I’m looking forward to your future videos!
    Thanks again!

    • @yuxiangwang9624
      @yuxiangwang9624  2 місяці тому

      Thank you for your kind words! It's just PowerPoint haha. I used a lot of Morph transitions though and just their built-in recording functionality!

  • @xiaojinyusaudiobookswebnov4951
    @xiaojinyusaudiobookswebnov4951 4 місяці тому +1

    I learned a lot from your videos. Please keep them coming. They are worth all the time and effort it takes to produce them.

  • @matin2021
    @matin2021 5 місяців тому

    Hi
    I am very happy that I was able to find your channel on UA-cam
    I hope you will make more videos about computer vision
    keep Going ✌

  • @Kamlin001
    @Kamlin001 3 місяці тому +1

    Amazing video! Thanks. I was wondering how a fusion model might work with Navit feeding into vivit factorised encoder given different resolutions? Perhaps you can feed just the tokens?

    • @yuxiangwang9624
      @yuxiangwang9624  2 місяці тому

      Thank you! Sorry that I just saw your message. Yes I would think so! NaViT already handles multi-resolution well in managing how tokens are attended, so that would work?

  • @abhranilchandra2775
    @abhranilchandra2775 6 місяців тому

    This is such a great explanation, do you plan to cover the "DiT: Scalable Diffusion Models with Transformers" paper sometime soon?
    Thanks a lot for such wonderful and insightful explanations...

    • @yuxiangwang9624
      @yuxiangwang9624  6 місяців тому

      Thank you for the kind words! That's a good idea and let me look into it. :)

  • @SpaceTime8285
    @SpaceTime8285 6 місяців тому

    Great voice. For fun, audition for a voice actor gig. Would look great on resume. Or on a date or at a conference. Lol