CUDA Programming on Python

Поділитися
Вставка
  • Опубліковано 27 сер 2024

КОМЕНТАРІ • 770

  • @aishahoura2619
    @aishahoura2619 Рік тому +160

    Thank you so much for responding to my request for making a CUDA programming. I have donated 0.1 BTC to your account as a way to thank you. My professor has done so many hours trying to explain CUDA and none of my classmates really understood. I just can not believe that you do all this for free and that is why me and my classmates have decided to collect some funds to donate to you.
    Thanks for all that you do and please keep going.

  • @sksk-lo8kc
    @sksk-lo8kc Рік тому +91

    That was very well explained. I have only have taken one course, and you made it clearer than my professor or fellow students ever did.

  • @denizart2255
    @denizart2255 Рік тому +47

    You just opened my eyes to parallel programming. Thanks for the quick overview.

    • @mdrubelahmed6434
      @mdrubelahmed6434 Рік тому

      Too hard to find high -quality content like this these days. Thank you so much

  • @Drex.Yt1
    @Drex.Yt1 Рік тому +79

    Too hard to find high-quality content like this these days. Thank you so much

  • @nilsu1941
    @nilsu1941 Рік тому +106

    12:36 This guy is a God !

  • @leonelaguilera9059
    @leonelaguilera9059 Рік тому +72

    this was such an excellent video

  • @aoungamingyt3160
    @aoungamingyt3160 Рік тому +69

    Thank you so much. Probably the best introdution to CUDA with Python. The example you use, while very basic, touches on usage of blocks, which is usually omitted in other introduction-level tutorials. Great stuff! Hope you return with some more videos. I have subscribed!

  • @tatldunyas2471
    @tatldunyas2471 Рік тому +1

    Just did my research and this guy is at one of the most prestigious universities in the world ! No wonder why his lectures come up neat !

  • @sedat3218
    @sedat3218 Рік тому

    I have been looking into gpu programming using numba and python for a while, this seems to be the best tutorial I was able to find so far.. . thank you

  • @RAVIShankar-bm4ou
    @RAVIShankar-bm4ou Рік тому +5

    Thank you so much for this series! It's so clear and easy to follow

  • @alizz3984
    @alizz3984 Рік тому

    I feel like Cuda has been demystified. Very glad I found your series.

    • @user-sh6jw1ce3m
      @user-sh6jw1ce3m Рік тому

      Thanks for the comment... contact me for information and profitable investment strategies..⤴️

  • @apogeetheboss9999
    @apogeetheboss9999 Рік тому

    as a data scientist +2 years of experience, i ALWAYS learn something new with your content! please nich, never stop doing this things, and also, never cut your smile in your face, even if your are having bugs!!

  • @calskanadam1001
    @calskanadam1001 Рік тому

    wanted to comment that the information in this presentation is very well structured and the flow is excellent.

  • @erenbasak7694
    @erenbasak7694 Рік тому

    Hey this is super useful! I elected High Performance Computing and Microprocessors and Embedded Systems modules for my degree, and this channel has become my go-to guide.

  • @maher9422
    @maher9422 2 місяці тому +15

    يعطيك العافية اخوي احمد
    بدي اطلب منك طلب بسيط انه تعمل نفس الكورس عربي عارف مش حيكون كثير المشاهدات. ولكن
    اخوانك يحتاجوك اكثر من الاجانب
    انا بفهم عليك بس فيه غيري بيحبوا المجال وبيحبوا يتعلموه بلغتهم.
    اذا انت ما تملك الوقت اسمحلي اترجم الفيديو واشرحه عندي بلايك من عندك على انك موافق.

  • @LouieVon
    @LouieVon Рік тому

    This is the best introduction to CUDA I've seen, thanks a lot !

  • @Fiekriekd
    @Fiekriekd Рік тому

    and that's what I call a great tutorial. Thankyou sir. I wish you make more tutorials.

    • @user-sh6jw1ce3m
      @user-sh6jw1ce3m Рік тому

      Thanks for the comment... contact me for information and profitable investment strategies...⬆️

  • @astaadxofficials7813
    @astaadxofficials7813 Рік тому

    I have no idea what kind of videos i am watching ... but i sure will learn

  • @todotasks7645
    @todotasks7645 Рік тому

    Too hard to find high-quality content like this these days. ⚡

  • @Kvmizo
    @Kvmizo Рік тому

    This was by far one of the most enlightening videos you have put up on your channel. Thanks and keep up the good work!!

  • @fatihcalidkan2254
    @fatihcalidkan2254 Рік тому

    holy shit, i was looking into this to speed up my mandelbrot-zooms and they are what you use as an example! This is a dream come true!

  • @aspheronn
    @aspheronn Рік тому

    Wow It is really awesome! It is much better than a tutorial from university! Thanks!

  • @cavansirmahmudov217
    @cavansirmahmudov217 Рік тому

    You saved me, i had to read the PointNet2 implementation for my BCS thesis. this made the job much easier!

  • @adeeshaamabidu9616
    @adeeshaamabidu9616 Рік тому

    Love the channel Nicholas, have recently graduated from an NLP Master's degree and seeing you explain stuff in a simpler way and your coding challenges is really helping me connect with the material I've learned! Keep it up and I'll keep watching!

  • @mustafasamet2783
    @mustafasamet2783 Рік тому

    Ayyyy, so glad you like it @Patrick. For the last two weeks I've just been making videos on stuff I find hard or want to get my head around I figure it's not just me staring there at some of these concepts like huh?!? Thanks for checking it out!!

  • @mrfcbs1251
    @mrfcbs1251 Рік тому

    Oh Ahmad , your tutorials are incredible and inspiring....

  • @muradhesenov5245
    @muradhesenov5245 Рік тому

    the essence of Deep learning in a few lines of code... awesome

  • @excolabirbuyuyecek9438
    @excolabirbuyuyecek9438 Рік тому

    LOL. Loved the graphic at 6:23! Brought tears to my eyes.

  • @user-yu4sr1ni3l
    @user-yu4sr1ni3l Рік тому

    OHHHH MANNN, I thought about doing that but I was debating whether I'd hit the 15 minute deadline already. Good suggestion @Julian!

  • @tajaochrisciamae4121
    @tajaochrisciamae4121 Рік тому

    what a passionate tutorial! I wish you were my professor for my parallel programming course. Well done!

  • @notachannel2601
    @notachannel2601 Рік тому

    It is effectively a very easy approach to harness the power of cuda in simple python scripts.

  • @merthanozer2964
    @merthanozer2964 Рік тому

    Ohh, yes, Thank you, and the documentation at nvidia site about CUDA is highly professionally written. Thank you.

  • @beratcansamur1517
    @beratcansamur1517 Рік тому

    An insanely underrated series!!!

    • @user-sh6jw1ce3m
      @user-sh6jw1ce3m Рік тому

      Thanks for the comment... contact me for information and profitable investment strategies..⤴️

  • @a.s.m.rashedchowdhury784
    @a.s.m.rashedchowdhury784 Рік тому

    opened my eyes to parallel programming

    • @user-sh6jw1ce3m
      @user-sh6jw1ce3m Рік тому

      Thanks for the comment... contact me for information and profitable investment strategies..⬆️

  • @lixaxel6815
    @lixaxel6815 Рік тому

    Excellent example of vector addition of using for loop and using CUDA

  • @Animals-vi5wt
    @Animals-vi5wt Рік тому

    Woah congrats @Ally 🎊 🎉 glad you’re enjoying the challenges, plenty more to come!!

  • @teamchanel4384
    @teamchanel4384 Рік тому

    I'm doing an internship in a research lab and I'll have to program some kernels to implement Blas primitives, this video really helps :)

    • @user-sh6jw1ce3m
      @user-sh6jw1ce3m Рік тому

      Thanks for the comment... contact me for information and profitable investment strategies..⤴️

  • @peloizol8947
    @peloizol8947 Рік тому

    I like how you did the website for documenting the video notes for reference later

  • @rithusvlogtime3342
    @rithusvlogtime3342 Рік тому

    Fantastic tutorials on CUDA. You deserve more followers.

    • @user-sh6jw1ce3m
      @user-sh6jw1ce3m Рік тому

      Thanks for the comment... contact me for information and profitable investment strategies..⤴️

  • @theatlantisreport1595
    @theatlantisreport1595 Рік тому

    This reminds me a lot of the computer tutorial tapes from the 90s

  • @TheTurksxayers
    @TheTurksxayers Рік тому

    Thank you for this great introduction to numba and more specifically numba+cuda.

  • @uniquevlogsbyadil
    @uniquevlogsbyadil Рік тому

    This was oddly intense. Great job Nicholas! Even though you ran out of time, this video is still a win to me. 😉

  • @hgmalani21
    @hgmalani21 Рік тому

    It's very informative and a good intro to CUDA programming. Thanks very much!

  • @totallycz6819
    @totallycz6819 Рік тому

    You are a lifesaver @Spencer, will do it next time i'm on the streaming rig!

  • @yasincaferzade8069
    @yasincaferzade8069 Рік тому

    Great video, I like this kind of video where you code some AI task counterclock, you teach us the concepts and show us the reality of implementing it👏

  • @-oof1016
    @-oof1016 Рік тому

    Ahmad , thanks for taking time to create these videos. It is unfortunate that people view your videos and then feel inspired to complain about a free gift. Folks could just keep it moving or add helpful insights.

  • @bununsemid8739
    @bununsemid8739 Рік тому

    Perfect Video! Saw was revealing to me to understand how it works. Thank you! I am a new subscriber of your channel. Regards from Buenos Aires, Argentina

  • @turkceraplyrics6701
    @turkceraplyrics6701 Рік тому

    Hey Ahmad , I love watching your videos because of the way you tell the story. Great graphics mate. Love the reference to rocket man too... lol keep up the good work.

  • @bangzayy520
    @bangzayy520 Рік тому

    I was needing this!!! Thanks a lot, Sir!!!!

  • @denizugurbiltekin622
    @denizugurbiltekin622 Рік тому

    Thank you so very much. This is the exact kind of material I was looking for on this very specific subject. Kudos.

  • @mipxello7749
    @mipxello7749 Рік тому

    Very well explained. The best CUDA explaination I have come across uptil now 😊😊. Keep up the spirits sir.👍👍

    • @user-sh6jw1ce3m
      @user-sh6jw1ce3m Рік тому

      Thanks for the comment... contact me for information and profitable investment strategies...⤴️

  • @vanshd6884
    @vanshd6884 Рік тому

    PS. I really so moved for your stock price episode. thank you so sosososo much.

  • @besttwitcher4569
    @besttwitcher4569 Рік тому

    YESSSS, right?! Glad you liked it Miguel!

  • @pusher7051
    @pusher7051 Рік тому

    glad to see you take it as a feedback and not as a hate comment

  • @bodyprodaction9718
    @bodyprodaction9718 Рік тому

    Well just built a new rig with a 980ti and a 4790k so I'm gonna put that to test. Thank you for your wonderful explanation :D

  • @blackiselia8054
    @blackiselia8054 Рік тому

    i need to say this: you are the gamechanger here!!

  • @yusifhsnov1802
    @yusifhsnov1802 Рік тому

    So stoked you liked it 🙏

  • @ricardomilos5889
    @ricardomilos5889 Рік тому +1

    This was a great video to me, I have very limited C++ experience and was looking for an explanation of CUDA. Another video like this could easily have been 70-80% over my head. This one was only about 15% whoosh. And now I actually find C++ interesting again!

  • @crystalannringor1025
    @crystalannringor1025 Рік тому

    Thank you so much for this video. It has helped me massively to prepare for my computer science exam.

  • @weds8296
    @weds8296 Рік тому

    Thanks for making all these topics very approachable!

  • @emirliemirxan2687
    @emirliemirxan2687 Рік тому

    Sir,make more detailed sessions on CUDA,your explanation is great

  • @berkozdeser8460
    @berkozdeser8460 Рік тому

    this is extremely helpful. you did an amazing job explaining the foundations

    • @user-sh6jw1ce3m
      @user-sh6jw1ce3m Рік тому

      Thanks for the comment... contact me for information and profitable investment strategies..⤴️

  • @difenbahya3465
    @difenbahya3465 Рік тому

    Thanks for the video, I found the first half and the wrap up really excellent.

  • @maishayasuperstar3699
    @maishayasuperstar3699 Рік тому

    The Knowledge of Ahmad knows no bounds.

  • @prietjepruck
    @prietjepruck Рік тому

    Thank you very much for this tutorial. I would love to have the code available because typing it in myself from the video is a bit hard especially with the atocomplete on all the time. Keep up the good work.

  • @kelechijames5577
    @kelechijames5577 Рік тому

    This is very helpful. Most people don't realize the overheads and code refactoring necessary to take advantages of the GPUs. I am going to refactor a simple MNIST training propgram I have which currently uses only Numpy. See if I can get meaningful improvements in training time.

  • @aliyensagaltc3121
    @aliyensagaltc3121 Рік тому

    You are bloody watching a master at work xD

  • @mehmetak4349
    @mehmetak4349 Рік тому

    What makes the CPU better than the GPU is that each core is clocked at a faster speed and has many built-in instructions like SSE, allowing data to be processed faster. This provides a tremendous benefit to programs that only run on 1 core. In rendering where multiple cores can be used, you would need the CPU to process pixels about 5+x faster to match the GPU's performance.

  • @likky2229
    @likky2229 Рік тому

    The video was very helpful for me. Many thanks to the author for developing his audience with interesting and useful content

  • @mjmlangenihd8706
    @mjmlangenihd8706 Рік тому +1

    yes, you could do this by hand, which would be a great challenge in distributed computing to code by hand. Another option is to use a framework/platform like AWS Sagemaker to do distributed kmeans. Most organizations will do this.

  • @imsoumyajitbag
    @imsoumyajitbag Рік тому

    Awesome video !! It's preety cool to see such theoretical concepts coded and explained like this. Keep going Nich !!

  • @yigitboran5565
    @yigitboran5565 Рік тому +1

    That's mostly how it works. It's more like sorting the stones by its color and pattern and counting each variety. Using the CPU way, you would need to count each variety separately. If you have 100 different colors and patterns, that would take a long time to count (even if you could count extremely accurate and fast, similar to how the CPU makes up for it's lack of parallelism). The GPU way lets many people count them. Given 100 people (like the GPU), each person would count each variety at the same time.

  • @MTHHC
    @MTHHC Рік тому

    Hey, thanks for explanation! Very well done 👍 I am downloading CUDA 💪

  • @emircoltu875
    @emircoltu875 Рік тому

    Once you initialized lr to 0.0, I knew you were going to forget to change it lol. Love the challenges tho, keep doing them, I think it would be cool to see how you implement a neural network from scratch

  • @caneruc8534
    @caneruc8534 Рік тому

    This was really good. Thanks for posting this!

  • @mendes.02
    @mendes.02 Рік тому +1

    This is an academic example that shows the process of copying data to the GPU, doing a vectorized operation, then showing the results. Actually what makes sense on the GPU vs CPU is something I didn't cover, and am hoping other can figure out some cool ideas.

  • @burakcetin317
    @burakcetin317 Рік тому

    Excellent explanation, keep going with this content man ;)

  • @prof.gilang
    @prof.gilang Рік тому

    This is amazing! Thank you for taking effort to make it!

  • @Voicemelod
    @Voicemelod Рік тому

    Amazing! I'm learning so much watching you code. Thank you for sharing.

  • @aliarda9719
    @aliarda9719 Рік тому

    Thanks a million @Lakshman!! I try to keep it pretty tight so it’s a good challenge otherwise I know I’ll just talk for 22 minutes anyway😅

  • @agusexclusife2577
    @agusexclusife2577 Рік тому

    Can't wait to see Juan's better tutorial that he's definitely going to release :') lmao. Great video Ahmad .

  • @kadirylmaz356
    @kadirylmaz356 Рік тому

    This is really helpful for my computing. Thank you.

  • @enescakmak6699
    @enescakmak6699 Рік тому

    It works on both AMD and NVIDIA. If you have CUDA code, you can convert it to HIP with their automated tool, there is very little CUDA specific that can't be just translated over.

  • @ramazandemir6848
    @ramazandemir6848 Рік тому

    wold love to see a video on what are a few CUDA programming challenges

  • @SiTacuissem
    @SiTacuissem Рік тому +1

    Interesting, but two remarks:
    Example 1: on my setup (3080Ti, i7-8700K, running in WSL2 with Ubuntu 22.04) vector multiplication runs actually *faster* on CPU (if you either use the vectorized formulation in MultiplyMyVectors with target "cpu" or, simply, a*b instead of the unnecessary for loop in the CPU code). IMO that is mostly due to the overhead of copying the data to the GPU memory.
    Example 2: to get a fair comparison, you should also use the JIT for FillArrayWithouGPU, decorating with @jit(target_backend="cpu"). Then, GPU array filling is still faster, but only by a factor of 2.

  • @kuzeyylmaz8726
    @kuzeyylmaz8726 Рік тому

    Many thanks for the lucid explanation.

  • @gularif1
    @gularif1 Рік тому

    On the PC side Matrox was the first company to introduce GPU's . This was followed by ATI . NVidia came into the scene after the success of these 2 Canadian companies. Matrox's original 3D board was a 3 setboard with custom asics. I believe NVIDA actually acquired ATI. So yes, NVIDIA was not the first but they are the biggest in the space now. Matrox is still around but more involved in the industrial and nice markets.

  • @pikutheboss576
    @pikutheboss576 Рік тому

    Great explanation! Fascinatingly clear

  • @alidoruk5588
    @alidoruk5588 Рік тому

    This guy is so underrated.

  • @_jack_cruusc_tech_5688
    @_jack_cruusc_tech_5688 Рік тому

    Awesome! learning never stops.

  • @mehmetplgx8025
    @mehmetplgx8025 Рік тому

    Love your videos. Please don't stop!

  • @burakkose3703
    @burakkose3703 Рік тому

    Damn you are such a great teacher dude.

  • @madkar988
    @madkar988 Рік тому

    This is excellent material! Thank you!

  • @vipyt8550
    @vipyt8550 Рік тому

    It can be found in O(1). As far as I remember the formula is derived using LDU decomposition or Diagonalising a matrix, for matrix exponentiation.

  • @taimonn
    @taimonn Рік тому

    Amazing, clear, and understandable!

  • @TheGameboyTheDream123
    @TheGameboyTheDream123 Рік тому

    @nvidia I personally think the way you did the demonstration was perfectly sufficient. IMO, fancy graphics are unnecessary. Good job.

  • @dragnn1430
    @dragnn1430 Рік тому +100

    Dear Ahmad, you are 30 years old only doing post-doc ? I'm sorry but this to me sounds very underrated. Postdocs are not always well compensated for their work but spend a lot of time working and doign research. If i were you, i'd invest more time on my youtube channel, rather than doing something that does not compensate well.

  • @keremkipri9436
    @keremkipri9436 Рік тому

    Thanks for the video, subscribed! A suggestion : this small change to your code would demonstrate a real-world gradient descent solution for linear regression with noisy data. E.g. :

  • @arifefe4264
    @arifefe4264 Рік тому

    UA-cam questions are typically the best way to handle an issue or an issue request on Github on a project demo. I will do my best to answer when I have time.

  • @crazygamingpointSv
    @crazygamingpointSv Рік тому

    the zoom in on the unsaved icon was personal 💀