Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Поділитися
Вставка
  • Опубліковано 17 гру 2024

КОМЕНТАРІ • 54

  • @osmanmamudu471
    @osmanmamudu471 Рік тому +1

    Great content on SOTA model architectures! Thank you

  • @ashimasingla103
    @ashimasingla103 10 місяців тому

    Dear Aarohi
    Your channel is very knowledgeable & helpful for all Artificial Intelligence/ Data Scientist Professionals. Stay blessed & keep sharing such a good content.

  • @mvi29
    @mvi29 Рік тому

    Best video ever!! Clear explanations. Thanks a lot.

  • @shitizgoel5027
    @shitizgoel5027 5 місяців тому

    Really very helpful video Ma'am to understand the concept of Swin Transformer.

  • @Sunil-ez1hx
    @Sunil-ez1hx 10 місяців тому

    Worth watching it again

  • @soravsingla6574
    @soravsingla6574 Рік тому +1

    Hello Ma’am
    Your AI and Data Science content is consistently impressive! Thanks for making complex concepts so accessible. Keep up the great work! 🚀 #ArtificialIntelligence #DataScience #ImpressiveContent 👏👍

  • @johnthomas6457
    @johnthomas6457 9 місяців тому

    Great explanation. Keep adding new videos :)

  • @salvadornunez23
    @salvadornunez23 5 місяців тому

    muy bueno el contendio de tus tutoriales, gracias x compartir

  • @luiscoelho3523
    @luiscoelho3523 Рік тому

    Great video! Congratulations!

  • @Sunil-ez1hx
    @Sunil-ez1hx Рік тому +1

    Thank you soo much mam for this amazing video

  • @muhammadatique4293
    @muhammadatique4293 Рік тому

    You are just amazing by explaining it so simple

  • @nandiniloku7747
    @nandiniloku7747 10 місяців тому

    mam can you please share me the link where i can get pre traine weights for swin transformers

  • @jeffg4686
    @jeffg4686 9 місяців тому

    Nice. Is this like the new way for vision transformers, or is this specific for certain tasks?

    • @CodeWithAarohi
      @CodeWithAarohi  9 місяців тому +1

      The SWIN Transformer is designed for visual recognition tasks, particularly suited for processing high-resolution images efficiently. It breaks down images into patches and processes them hierarchically, making it useful for tasks like classification, object detection, and segmentation.

  • @deeper_learning
    @deeper_learning 7 місяців тому

    Thanks for the clear explanation. I just couldn't understand how the Masked MSA works. If I could find out, I'll write back.

    • @deeper_learning
      @deeper_learning 7 місяців тому

      we have already done the patch partitioning. So, after the window shift, pixels in one local window may come from several local windows that are not adjacent. What should we do? so a masking mechanism is employed to limit self-attention computation to within each sub-window. Then the computed values are return to their original positions.

  • @marzi869
    @marzi869 Рік тому

    Thanks a lot, I can't believe there is just 40 likes!!!!!!!

  • @deepalikumari30
    @deepalikumari30 Рік тому

    superb explanation

  • @harharmahadev8624
    @harharmahadev8624 Рік тому

    Wow, great explanation...... 👍

  • @anantmohan3158
    @anantmohan3158 Рік тому

    Nicely Explained..!

  • @SS-zq5sc
    @SS-zq5sc Рік тому

    This was extremely helpful thank you very much. I subscribed.

  • @sadafsolangi4194
    @sadafsolangi4194 Рік тому

    Hello, mam I'm working on the diagnosis of skin diseases and I have done implementation by using Vit, so I wanna ask if should i use Swin transformer and concatenate both Vit and Swin transformer together to make novelty? I need your suggestion, please.

    • @CodeWithAarohi
      @CodeWithAarohi  Рік тому

      Yes, you can experiment with combining ViT and Swin Transformer using ensemble methods to potentially improve skin disease diagnosis, but carefully evaluate individual performances and consider computational resources.

    • @FinalProject-rw1yf
      @FinalProject-rw1yf 6 місяців тому

      Hi, have you done the ensemble of vit and swin transformer?

  • @sanjoetv5748
    @sanjoetv5748 Рік тому

    please make a landmark detection here in vision transformer. i greatly in need for this project to be finished and the task is to create a 13 landmark detection using vision transformer. and i cant find any resources that teaches how to do a landmark detection if vision transformer. this channel is my only hope.

  • @aliafkari874
    @aliafkari874 Рік тому

    Keep up the good work😊

  • @palurikrishnaveni8344
    @palurikrishnaveni8344 Рік тому

    Awesome madam, everytime eagrly waiting yours video, way of explanation is very clear and every one can understood easily
    Waiting for implementation video also
    One request I saw yours all gans videos, but if possible can you make conditional dcgan implementation video for any color images.
    Happy learning

  • @soravsingla6574
    @soravsingla6574 Рік тому

    Awesome

  • @pifordtechnologiespvtltd5698
    @pifordtechnologiespvtltd5698 9 місяців тому

    Amazed

  • @vasut6047
    @vasut6047 Рік тому

    Hi mam, If possible do a video on how to implement Meta-DeTR

  • @tikendraw
    @tikendraw Рік тому

    I understand you studied it thoroughly, but can you implement the transformer, I am asking because I myself find it difficult to create this transformers in papers. Can you? Do you? Good work btw .

    • @CodeWithAarohi
      @CodeWithAarohi  Рік тому +1

      Next video will be a implementation of Swin Transformer.

  • @soravsingla6574
    @soravsingla6574 Рік тому

    Code with Aarohi is Best UA-cam channel for Artificial Intelligence
    #BestChannel #UA-camChannel #ArtificialIntelligence #CodeWithAarohi #DataScience #Engineering #MachineLearning #DataAnalysis #BestLearning #LearnDataScience #DataScienceCourse #AytificialIntelligenceCourse #Codewithaarohi #CodeWithAarohi