Data science in Python: pandas, seaborn, scikit-learn

Поділитися
Вставка
  • Опубліковано 24 гру 2024

КОМЕНТАРІ •

  • @dataschool
    @dataschool  3 роки тому +9

    Having problems with the code? I just finished updating the notebooks to use *scikit-learn 0.23* and *Python 3.9* 🎉! You can download the updated notebooks here: github.com/justmarkham/scikit-learn-videos

    • @aryanterrance6092
      @aryanterrance6092 3 роки тому

      I know im randomly asking but does any of you know of a method to log back into an instagram account?
      I was stupid forgot the account password. I appreciate any help you can give me!

    • @stetsondavian5756
      @stetsondavian5756 3 роки тому

      @Aryan Terrance instablaster ;)

    • @aryanterrance6092
      @aryanterrance6092 3 роки тому

      @Stetson Davian thanks for your reply. I found the site through google and im trying it out now.
      I see it takes quite some time so I will get back to you later when my account password hopefully is recovered.

    • @aryanterrance6092
      @aryanterrance6092 3 роки тому

      @Stetson Davian It did the trick and I now got access to my account again. I'm so happy:D
      Thank you so much you really help me out :D

    • @stetsondavian5756
      @stetsondavian5756 3 роки тому

      @Aryan Terrance no problem :)

  • @prachinainawa3055
    @prachinainawa3055 3 роки тому +6

    I'm a beginner but your way of teaching makes me love machine learning, I feel it's so easy. Even you make me understand how the algo is working behind the scene. Love from India...

    • @dataschool
      @dataschool  3 роки тому +1

      That's awesome to hear! 😊

  • @Emmaizam
    @Emmaizam 5 років тому +22

    This is the best ML tutorials I have ever seen! Thank you very much Sir.

  • @LekanMakanju
    @LekanMakanju 2 роки тому +2

    This is unreal! I literally abandoned my datacamp machine learning course for this one and no regret at all. I especially like that you taught the underlying mathematical concept of how these codes come to be. You also speak clear and understandable English plus the sound system is top notch. I've taken your Data science course and your and prof Allen's remains my best to date with Hugo's coming in a distant 3rd. And to think you recorded this more than 7 years ago makes you conclude that this is way ahead of its time

    • @dataschool
      @dataschool  2 роки тому

      Thank you so much for your kind words, Moruf! 🙏

  • @tissues2441
    @tissues2441 6 років тому +1

    You're a way better instructor than my college professors. The syntax is fairly simple and the explanation of the statistical intuition behind the metrics made this enjoyable.

    • @dataschool
      @dataschool  6 років тому

      Thanks very much for your kind words! Really appreciate it!

  • @TheBurningofSolomon
    @TheBurningofSolomon 7 років тому +17

    MANY THANKS!!!
    All other data science tutorials (for beginners) go by way to quickly. Some people may find you going slowly a nuisance, but I found it to be EXTREMELY HELPFUL. THANK YOU! Subbed ^__^

    • @dataschool
      @dataschool  7 років тому +4

      Awesome! That's so great to hear... thanks very much for your comment!

    • @XaccountFr
      @XaccountFr 5 років тому +2

      @@dataschool yes very good explanation for the beginner like me

  • @pratikdhumal3975
    @pratikdhumal3975 8 років тому +11

    I was searching for appropriate videos on ML from long time. After following this series i can say that it is the best which i have ever seen.Each and every concept is covered with great detail. Same applies for study material and links. Thanks Data School .....!!!!

    • @dataschool
      @dataschool  7 років тому +1

      That is great to hear, thanks so much for your very kind words!!

  • @Superdooperhero
    @Superdooperhero 7 років тому +3

    I watch way too much training videos and I would like to say that I wish you were the presenter in all of them. You rule at this training thing!

  • @lakswin
    @lakswin 5 років тому +5

    Kinda complete one, putting together all at-once! The best, I have watched until now!

  • @dataschool
    @dataschool  6 років тому +20

    *Note:* This video was recorded using Python 2.7 and scikit-learn 0.16. Recently, I updated the code to use Python 3.6 and scikit-learn 0.19.1. You can download the updated code here: github.com/justmarkham/scikit-learn-videos

    • @rael213rd
      @rael213rd 5 років тому +2

      Can we please get a video about ensemble learning (bagging and boosting)

  • @mukulkathpalia6924
    @mukulkathpalia6924 7 років тому +2

    These are the best tutorial series on machine learning.

  • @ossaiaishat18
    @ossaiaishat18 5 років тому +1

    To be candid, this is the best video I've ever watched on scikit-learn. Thumbs up!!!

    • @dataschool
      @dataschool  4 роки тому

      That's awesome to hear... thank you! 🙏

  • @terryhenyo9216
    @terryhenyo9216 5 років тому

    Your video tutorial is outstanding! You can simplify complex concepts in an elegant manner. And unlike other instructors you don't show-off on how smart you are. That's why we know that you're really a smart guy :)

    • @dataschool
      @dataschool  4 роки тому

      Thank you SO MUCH for this kind comment! I truly appreciate it.

  • @injypal
    @injypal 5 років тому +1

    Please add more videos to the series. It is really helpful and amazing to watch your videos. You are a great teacher.

    • @dataschool
      @dataschool  5 років тому

      Thanks for your suggestion, and for your kind words!

  • @AntonioAugustoVianaS
    @AntonioAugustoVianaS 9 років тому

    More pandas please! And more Seaborn!
    A large part of Machine Learning is "messing" with the data BEFORE you apply any of the algorithms on it, and pd and sns are really good at that.
    Also, I think it'd be interesting (maybe latter in the series) that you could go on an all out example, like working with the titanic dataset from Kaggle, and giving hints on how to visualize, understand the data and choose the best algorithm for it.
    As a final note, I'm already a bit familiar with the techniques you use, but your comments and clear explanations makes everything clearer and helps me fixate some of these techniques.
    Thank you for that! Excellent series, and keep on the good work.

    • @dataschool
      @dataschool  9 років тому

      Antonio Augusto Santos Thanks for the feedback! I am planning to cover more examples later in the series, probably using a Kaggle competition. And, I appreciate your kind words! I was hoping to reach both users new to machine learning and those with some machine learning familiarity, so it's nice to hear that it's working :)

  • @BluntAmericanHistory
    @BluntAmericanHistory 9 років тому +1

    Your videos are fantastic, for people with random gaps in their knowledge you explain things very clearly.

    • @BluntAmericanHistory
      @BluntAmericanHistory 9 років тому

      +Siddharth Gupta For people who have random chunks of exposure to certain aspects of sklearn/pandas/etc: watch the video at 1.25 or 1.5x speed. You can get through the lesson faster, and the increased speed will actually have a counterintuitive effect of making you focus more. Also when you start losing focus or miss a concept, you will notice right away because you will suddenly be totally lost, so you will know to rewind.

    • @dataschool
      @dataschool  9 років тому

      +Siddharth Gupta Thanks for your kind comments!

  • @JackSimpsonJBS
    @JackSimpsonJBS 9 років тому +3

    Thank-you so much for your explanations of sk-learn, it finally makes sense to me! I'm already pretty familiar with Pandas so I'd love to learn more about sk-learn, because I feel there are so many other machine learning algorithms I'd love to get my head around.

    • @dataschool
      @dataschool  9 років тому

      ***** Nice! I love to hear that my explanations are helping things to "click" for people. Thanks for your comment!

  • @andrewsanchez4349
    @andrewsanchez4349 7 років тому

    Definitely one of the best tutorials I've ever watched. Can't wait to work through the 3 hour presentation at the end of this. Thank you!

    • @dataschool
      @dataschool  7 років тому

      Thanks so much for your very nice comment! You're very welcome! :)

  • @priyaponnus8620
    @priyaponnus8620 3 роки тому

    Thank you for the awesome videos. I am currently learning Machine Learning as part of a course. I don't have previous knowledge of Python (currently learning an introduction to Python as well), I am really struggling to understand; this is my midterm break; I found one of your videos while I was searching, I am one of the fortunate to found your videos. Thanks for your effort.

    • @dataschool
      @dataschool  3 роки тому

      You're very welcome! Glad I could help!

  • @aegystierone8505
    @aegystierone8505 4 роки тому

    Really appreciate that you also explain the algorithms and how to find the coefficient governing the equations. Thank you so much!

  • @AngryCanadian3
    @AngryCanadian3 8 років тому

    This is the best video tutorial series on Machine learning I have seen. You have hooked me up! Thanks for creating the series and you are an amazing teacher. Keep it up!

    • @dataschool
      @dataschool  8 років тому

      +Aashish Kumar You're very welcome, and thanks for your kind words!

  • @guptaachin
    @guptaachin 8 років тому

    You are undeniably the best tutor i have ever had. Thank you for teaching DS precisely. :)

    • @dataschool
      @dataschool  8 років тому

      Wow, thank you! I'm glad my teaching style works well for you :)

  • @umashankarverma3179
    @umashankarverma3179 5 років тому

    Your teaching methodology is best,you step by step teaching method is very helpful for me to understand.You are the best.

  • @lubojurciak2525
    @lubojurciak2525 5 років тому +1

    I wish you were my data analysis lecturer... Thank you very much for this.

    • @dataschool
      @dataschool  5 років тому

      Thanks very much for your kind words!

  • @kennyl7542
    @kennyl7542 9 років тому +16

    wonderful videos! I would like you to focus on scikit-learn, and your style of teaching which combines hands-on with scikit-learnt, real examples, explanation of ML techniques are very helpful!

    • @dataschool
      @dataschool  9 років тому

      +Kenny L Thanks for your kind comments and your feedback!

    • @nackyding
      @nackyding 7 років тому +1

      Word! I agree with you!

    • @joancolon635
      @joancolon635 6 років тому

      Kenny L i

  • @MrChristian331
    @MrChristian331 5 років тому

    Say one thing....you are an excellent teacher. My teachers at engineering school and on Udemy don't explain things half as well as you do! That should tell you a lot!
    I wish I could hire you personally.

    • @dataschool
      @dataschool  5 років тому

      Thanks so very much for your kind words! You might be interested in joining my membership community: www.patreon.com/dataschool

  • @dataschool
    @dataschool  8 років тому +22

    Want to learn more pandas? I have a new video series about it: ua-cam.com/play/PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y.html

  • @igorfigueredo5040
    @igorfigueredo5040 7 років тому

    Hi, im a begginer in data science and your videos are helping me a lot of, thanks.

  • @danielandreasen2293
    @danielandreasen2293 9 років тому +1

    As for an answer for your question: I would like to learn more about sklearn. Pandas is amazing, and I'm just starting to learn it, but there are already a lot of nice tutourials out there. Keep up the good job :)

    • @dataschool
      @dataschool  9 років тому

      Daniel Andreasen Good point! There are lots of Pandas tutorials already out there.

  • @doupanpan7271
    @doupanpan7271 6 років тому

    really thankful for your video series. it is straightforward and easy to understand, highly recommend to other guys who are interested in python, machine learning etc.

    • @dataschool
      @dataschool  6 років тому

      Awesome! Thanks for sharing it with others :)

  • @vamsikrishna1131
    @vamsikrishna1131 6 років тому +1

    Lots of great information at the end and links in the description. Very valuable. Really appreciate it!

  • @DenzilJoseph
    @DenzilJoseph 7 років тому

    Excellent description of the end-to-end ML flow. Thank you.

  • @flamboyantperson5936
    @flamboyantperson5936 7 років тому

    You are the best teacher in the world. I learned something very important to me in this video. Thank you so much. Please keep the good work going.

    • @dataschool
      @dataschool  7 років тому

      Wow! Thank you so much for the very kind comment! Good luck to you :)

  • @gauravmitra3683
    @gauravmitra3683 8 років тому +1

    This is one of the best available online resource for introduction to data science. Thank you for these amazing videos. Its teachers like you who inspire students like me :)

    • @dataschool
      @dataschool  8 років тому

      Wow, what a kind comment! Thank you so much!

    • @arjunbakshi810
      @arjunbakshi810 4 роки тому

      Gaurav, Im having trouble reading advertisemets.csv
      Can you help ma?

  • @The2002962
    @The2002962 7 років тому

    Tutorial content is pretty cool. adding humor while explaining will add good experience for learners. :)

  • @your_buddy_11
    @your_buddy_11 5 років тому +1

    Thank you very much
    Your teaching methodology is awesome making things crystal clear.

  • @v_b_r_1996
    @v_b_r_1996 7 років тому

    Very good content. I have tried so many video series for data science and this is by far the best! Thanks!

    • @dataschool
      @dataschool  7 років тому

      That's great to hear - thanks so much for your kind comment!

  • @robindong3802
    @robindong3802 7 років тому +1

    you made it so easy to learn. you lead me to ML right here. Thank you so much.

  • @julians.2597
    @julians.2597 5 років тому

    Wow, one of the best YT tutorials about this topic, thank you!

  • @samkumargupta2536
    @samkumargupta2536 6 років тому +1

    Really Awesome tutorials sir...
    Its very easy to understand...Better that other ML tutorials I have watched...☺☺☺

    • @dataschool
      @dataschool  6 років тому

      Thanks for your kind comment!

  • @raghug2073
    @raghug2073 6 років тому

    Very very great way teaching. I really liked the speed and pronounce you do, the possible mistakes which you cover, also explanation. This is great series and you are a great tutor. Fan of you and subscribed. Please make a separate series on Machine Learning (Bit more detailed), Deeplearning, AI, Data Science. I am not sure which one should be learnt first and how. I decided you are the best guru for me to make me some good level in all these skills. Please help.

    • @dataschool
      @dataschool  6 років тому

      Thanks for your suggestions! I'll consider them for the future :)

  • @sribastavrajguru304
    @sribastavrajguru304 7 років тому

    Great work,please upload more tutorials lyk these,really helpful to get started.
    Before watching this tutorial i was not at al aware of ML,but now after watching 4/5 videos i've got a good overview ,thank you

    • @dataschool
      @dataschool  7 років тому

      Great to hear! Thanks for your kind comment.

  • @serdarb8995
    @serdarb8995 6 років тому +1

    Hi Kevin, First of all thank you very much for those great videos. If you have a chance to make tutorial regarding deep learning it would be great. You are the best instructor, I've ever seen in this field. You are the best

    • @dataschool
      @dataschool  6 років тому

      Thanks so much for your kind words, and for your suggestion!

  • @genaugenaugenau
    @genaugenaugenau 7 років тому

    This guy is great at teaching. Much appreciated!

    • @dataschool
      @dataschool  7 років тому

      Thanks for your kind comment!

  • @unstatic_electronics
    @unstatic_electronics 9 років тому

    Excellent and straight to the point content again. Thanks a lot for the videos and also the additional references you provide. It's always good to know where to go next :)
    And please continue on with scikit-learn rather than pandas/seaborn.

    • @dataschool
      @dataschool  9 років тому

      Romain Lepert Thanks for the feedback! :)

  • @JoannaChmielewska_uk
    @JoannaChmielewska_uk 8 років тому

    Thank you for making the effort to produce these videos. It's a great resource and your delivery is superb.

    • @dataschool
      @dataschool  8 років тому

      Wow, what a kind compliment, thank you so much!

  • @JCRMatos
    @JCRMatos 9 років тому

    Another excellent video.
    Please continue to focus on ML and scikit-learn.

    • @dataschool
      @dataschool  9 років тому

      João Matos Thanks for your feedback, much appreciated!

  • @dianawilliams9470
    @dianawilliams9470 6 років тому

    Thank you! Your videos are helping to make the concepts click! This is the best resource I have found

  • @RajeshSriMuthu
    @RajeshSriMuthu 6 років тому +1

    தலைவரே - (tamil language) Thalaiva you are great.....

  • @AvivProg
    @AvivProg 8 років тому

    Watched all your videos. Your teaching skills are amazing, thank you for compiling those videos.
    I'm looking forward to your next videos about machine learning using sklearn.

    • @dataschool
      @dataschool  8 років тому

      +AvivProg Wow, thank you! You are very welcome -- I enjoyed creating the videos.
      Here is the playlist containing the entire video series: ua-cam.com/play/PL5-da3qGB5ICeMbQuqbbCOQWcS6OYBr5A.html

  • @RicardoFerrazLeal
    @RicardoFerrazLeal 9 років тому

    Pretty amazing video!
    +1 for sk-learn as next video in this series. I also think that plotting stuff helps a lot. Whenever possible it would be nice to show seaborn in action.
    Great job and looking forward to the next one.

    • @dataschool
      @dataschool  9 років тому

      Ricardo Ferraz Leal Thanks for the feedback!

  • @sofjakovalevskaya1446
    @sofjakovalevskaya1446 6 років тому +1

    Really perfect explanation and walk through. Thanks a lot!

  • @Dexter01
    @Dexter01 4 роки тому

    I am answering your question 5 years later but I would love to see more video tutorials from you about scikit-learn (e.g Neural network models (supervised)) or
    scikit-multilearn if you want!! :) Thnx a lot Kevin!

    • @dataschool
      @dataschool  4 роки тому

      Thanks for your suggestions!

  • @WillGoesMeta
    @WillGoesMeta 7 років тому

    Thank you so much for having this series!

  • @fritz0199
    @fritz0199 8 років тому +1

    This series is amazing, thank you!

    • @dataschool
      @dataschool  8 років тому

      You're welcome! Thanks for your kind words!

  • @shivbalaji8286
    @shivbalaji8286 7 років тому +2

    You are doing a great job !!!!!! Thank you very much for all your valuable videos !!! They are really helping me !!!! Thanks again :-)

    • @dataschool
      @dataschool  7 років тому

      That's great to hear! I'm glad the videos are helpful to you!

  • @yffzju3405
    @yffzju3405 8 років тому +1

    Cool video!I just finish your pandas video series, but I thought pandas should be learned before the sklearn, well, anyway thank you for making such great videos for us.

    • @dataschool
      @dataschool  7 років тому

      Great! I also have a scikit-learn video series: ua-cam.com/play/PL5-da3qGB5ICeMbQuqbbCOQWcS6OYBr5A.html

  • @bogdanjcnd
    @bogdanjcnd 7 років тому

    I totally agree, the excellent guide for data learning , visualisation and machine learning.Great work

    • @dataschool
      @dataschool  7 років тому

      Thanks for your kind comment!

  • @harveysummers3175
    @harveysummers3175 9 років тому

    These videos are outstanding. Am new to data science and many of the videos are too simple or too hard. You have found the goldilocks zone of data science. I also like that they are on youtube where I can speed them up to 1.5x to match my comprehension rate.Vimeo can't do that.
    I would like you to focus on Scikit, but use Pandas as most of use will be using both. I think a single lesson on how to use Pandas, as well as how to customize Ipython/Jupyter, would also be useful. I'd also like to see a video focused on data sources and on how to approach complex problems (ala kaggle challenges)
    Improvement suggestions: 1. Focus on technnical quality. Use basic stage lighting (difussed above, side, front, w/ reflector) and a condensor mic to better pic up your voice w/o echo. 2) put a whiteboard or suchsimple background behind you - way to much background clutter.
    And I think you are missing an opportunity to end with marketing your courses at data school, your book, etc.Not that I love ads, but... marketing!

    • @dataschool
      @dataschool  9 років тому

      Harvey Summers Thanks for all of the suggestions, and your kind comments! Very helpful. Building up to more complex problems is definitely on the list. And, it's nice to know that I'm hitting the "sweet spot" in terms of difficulty level.

  • @gabrielreilly7010
    @gabrielreilly7010 3 роки тому

    Thanks!

    • @dataschool
      @dataschool  2 роки тому

      Wow, thank you so much! That is truly kind of you! 🙏

  • @eturkoz
    @eturkoz 5 років тому +1

    Your explanations are wonderful. Thank you.

  • @reassassinator
    @reassassinator 7 років тому

    Your videos really helped me understand the sklearn basics easily. It would be great if you could do a similar video series on SVMs using scikit-learn and its applications. Your explanations and methods are great!
    Thanks a lot!

    • @dataschool
      @dataschool  7 років тому

      Thanks for your suggestion as well as your kind words! I appreciate it :)

  • @TheGautamj
    @TheGautamj 4 роки тому +3

    The csv file does not load up. Has the url changed?

  • @_SoundByte_
    @_SoundByte_ 7 років тому

    Thanks for your lessons :-)
    Clear, detailed and to the point.

    • @dataschool
      @dataschool  7 років тому

      Thanks for your kind comments!

  • @sebastianpinedaarango8239
    @sebastianpinedaarango8239 9 років тому

    Great video!! Thanks for that. I'd like to keep learning about Scikit-learn. Although, Pandas is also definitely a powerful Python data analysis toolkit.

    • @dataschool
      @dataschool  9 років тому

      Sebastian Pineda Arango Glad you liked it! Thanks for the feedback.

  • @MrMmahesh007
    @MrMmahesh007 7 років тому

    amazing videos. Very streamlined and easy to understand.

  • @BadriNathJK
    @BadriNathJK 8 років тому +5

    You are the best.

    • @dataschool
      @dataschool  8 років тому

      Thank you, I'm glad this content is useful to you!

  • @rahulmanna5730
    @rahulmanna5730 5 років тому +5

    Currently the url for the dataset is : faculty.marshall.usc.edu/gareth-james/ISL/Advertising.csv

    • @dataschool
      @dataschool  4 роки тому

      Thanks for sharing! I also have it on GitHub: github.com/justmarkham/scikit-learn-videos/tree/master/data

  • @transportation-talk
    @transportation-talk 9 років тому +4

    Great video once again. I think the focus of this series should be on ML and Scikit learn. You can explain the relevant pandas code wherever required as you did in this video.
    One question: Is there any algorithm in ML which can select the most relevant / explanatory predictor variables (features) from the data set (instead of user using trial and error approach)? I think this is critical for the data sets with high number of features

    • @dataschool
      @dataschool  9 років тому +1

      umair durrani Great question! There is no "silver bullet" for feature selection, meaning no single strategy that will always tell you which variables to keep in your model. Domain understanding, data exploration, and human intuition are key.
      That being said, the Random Forests model will give you a measure of "variable importance" (on a scale of 0 to 1), and you could use that to guide the selection. As well, regularized linear models will shrink coefficients down to zero as the "penalty term" increases, effectively performing feature selection. Just keep in mind that both need to be tuned to perform properly, and features need to be scaled when performing regularization. scikit-learn has some more guidance on feature selection here: scikit-learn.org/stable/modules/feature_selection.html
      Thanks again for your kind and helpful comments!

    • @darronfuller5297
      @darronfuller5297 9 років тому +1

      umair durrani Umair, there are several useful techniques for feature selection that I recommend you look into. Statistical methods such as forward- and backward-elimination are perfectly suited for determining the most predictive variables in a regression model and easy to understand and implement. Decision Trees inherently perform feature selection in that the variable splits are deemed significant and automatically chosen by the algorithm. A bit more on the complex side are Principle Component Analysis (PCA) and Association Rules which I believe PCA is in sci-kit-learn. Good luck! Darron. www.linkedin.com/in/votefordata

    • @sabr9906
      @sabr9906 8 років тому

      +Data School Could you please advise in another course more about Feature Selection? Which models are more suitable for several cases etc. Like for example, sorting features' scores from RandomizedLasso, or by ranking from RecursiveFeatureElimination, or by selecting K best?

    • @dataschool
      @dataschool  8 років тому

      +Sabr Tasbolatov Thanks for the suggestion! I'll consider it for the future.

    • @dataschool
      @dataschool  6 років тому

      I just released a video about feature selection which might be helpful to you! ua-cam.com/video/YaKMeAlHgqQ/v-deo.html

  • @loaiabdallatif4947
    @loaiabdallatif4947 7 років тому

    very useful video on liner regression
    thanks very much Mr.
    Kevin Markham

  • @harshrajj9995
    @harshrajj9995 5 років тому +1

    Such great content you provide sir! Thank you so much.

  • @saranemohan
    @saranemohan 7 років тому

    It's wonderful tutorial ever I seen regarding machine learning. I expect more videos related to machine learning. if you made some video regarding some optimization technique of linear regression, then it should be more beneficial. ( like bfgs etc )

    • @dataschool
      @dataschool  7 років тому

      Thanks so much for your kind words! I'll take your suggestion under consideration.

  • @Tony770jr
    @Tony770jr 9 років тому

    Cool stuff, would like to see more pandas integrated with scikit learn.

    • @dataschool
      @dataschool  9 років тому

      Tony770jr Thanks for the suggestion!

  • @siming07
    @siming07 9 років тому

    Thank you so much for the video, really great introduction to Pandas and SKlearn, I hope you can focus more on the sklearn with pandas dataframe, again, thanks for the great video!

    • @dataschool
      @dataschool  9 років тому

      +Siming Zhao You're very welcome, and thanks for your comment!

  • @libardomm.trasimaco
    @libardomm.trasimaco 7 років тому

    I absolutely love what you do!. Thank you very very much!

    • @dataschool
      @dataschool  7 років тому +1

      You are very very welcome!

  • @suemareverton7756
    @suemareverton7756 8 років тому +1

    These videos helped me a lot! Thank you so much!!

    • @dataschool
      @dataschool  8 років тому

      Great, I'm glad the series is helpful to you!

  • @aracelyssunico8116
    @aracelyssunico8116 7 років тому

    Super Helpful! Your explanation are clear and clean :) thanks

  • @musabosman2843
    @musabosman2843 3 роки тому +1

    Nicely presented and delivered. Thank you!. I have subscribed to your channel!

  • @abdelrhmanshokr7546
    @abdelrhmanshokr7546 4 роки тому

    dude you're one of the best

  • @sharlene1176
    @sharlene1176 9 років тому

    Thank you very much for this video series!!! This is really helpful!

    • @dataschool
      @dataschool  9 років тому

      +shalin LUO You're very welcome!

  • @troywalters6106
    @troywalters6106 9 років тому

    Great tutorial!! After watching this and looking at the sklearn docs, it seems as if the LinearRegression() object has only coef_ and intercept_ attributes. Does sklearn not provide metrics such as standard errors, t-statistics, p-values, and R-squared? If not, what is the reasoning behind it ? Thanks.

    • @dataschool
      @dataschool  9 років тому +1

      Troy Walters Thanks for your comment! You can indeed compute R-squared using the r2_score function in the sklearn.metrics module. Regarding the others, I think the scikit-learn contributors would argue that those metrics belong in a statistics library, not a machine learning library. Here is a relevant discussion from the scikit-learn mailing list: www.mail-archive.com/scikit-learn-general%40lists.sourceforge.net/msg13102.html

  • @HossainRabin
    @HossainRabin 6 років тому

    Fantastic tutorial series for PYTHON beginners ...Can you please start teaching us deep learning and neural network?
    I learn PANDAS, Numpy from your tutorial..
    Thanks a lot man

    • @dataschool
      @dataschool  6 років тому

      Thanks for your suggestion!

  • @Drithin69
    @Drithin69 8 років тому

    At 20:35, Isn't 0.179 > 0.046 ?. Then Radio ads should lead to an increase in more sales than Tv right?
    Please clarify :)

    • @dataschool
      @dataschool  8 років тому

      +C. Drithin It's correct to say that a unit increase in radio spending is associated with a greater increase in sales than a unit increase in TV spending.

  • @siddhidhavale7329
    @siddhidhavale7329 4 роки тому +2

    Hi, the file URL isn' valid. Can you please share it?

  • @ebenezerpopoola7860
    @ebenezerpopoola7860 8 років тому

    Wow! this is very clear. You are the best.

    • @dataschool
      @dataschool  8 років тому

      Thanks very much for your kind comment!

  • @21121990jay
    @21121990jay 7 років тому

    Very helpful video !!! thanks for sharing your knowledge.
    looking forward for more !!

    • @dataschool
      @dataschool  7 років тому

      You're very welcome! Glad to hear it was helpful to you!

  • @philippe6724
    @philippe6724 4 роки тому

    Impressive teacher!

  • @danielkazmi
    @danielkazmi 6 років тому +1

    Absolutely amazing material, thank you Kevin!
    I just wanted to know how would you deal with non-numerical features (i.e Gender, Occupation, Education, etc.) when constructing your ML model? Would you assign them numerical values? If possible, I'd like some guidance or a push in the right direction.
    Again you explain this material much better than most channels do, please keep up the phenomenal work!

    • @dataschool
      @dataschool  6 років тому +1

      Thanks very much for your kind words!
      This might be helpful to you: ua-cam.com/video/0s_1IsROgDc/v-deo.html

  • @elilavi7514
    @elilavi7514 9 років тому

    Thanks for good video !
    Will be great if you can in a future video take any data set from some kaggle competition any try to work with ,
    feature engineering is an interesting issue too.
    Two technical notes :
    - for people who works with proxy , to install seaborn with anaconda have to define http/https proxy first , so on anaconda prompt execute following command : "set http_proxy=X.X.X.X:port_number"
    - for Python 3 users zip command looks like :
    "list(zip(feature_cols,linreg.coef_))"

    • @dataschool
      @dataschool  9 років тому

      Eli Lavi Sounds good... thanks for the notes!

  • @elivazquez7582
    @elivazquez7582 7 років тому

    Great videos - all of them! Thanks for doing this.

    • @dataschool
      @dataschool  7 років тому

      Thanks for your kind comment!

  • @ameer9900
    @ameer9900 7 років тому

    wonderful videos for machine learning beginners.

    • @dataschool
      @dataschool  7 років тому

      Thanks! Glad it was helpful to you.

  • @nackyding
    @nackyding 7 років тому

    Thanks. Awesome tutorials. I'm learning a lot. Thank you again.

  • @zymx2007
    @zymx2007 4 роки тому

    Hi Kevin, I'm new to both Python and machine learning. Your tutorials are great learning materials. I understanding this is a 5-year old presentation and I'm wondering if you would still answer a question I have related to this tutorial. Specifically, when I was trying to get the pairplots you demonstrated, I got the following error: KeyError: "['Sales'] not in index" and I got three blank boxes. What was wrong? Many Thanks for your help. FYI, I also tried to find answers by Googling online and haven't been able to find any answers that work.

  • @LonglongFeng
    @LonglongFeng 7 років тому

    at 29:28, how could you indicate 'sales ranged from 5 to 25' based on the RMSE?

    • @wcschuknecht
      @wcschuknecht 7 років тому

      It's not perfect, but if you check out minutes four and five you'll see via the .head() and .tail() methods 10 records out of the overall 200 with sales in the 5 - 25 range. Aside from that I'm not sure it's explicitly explained aside from the reference you called out.

    • @dataschool
      @dataschool  7 років тому

      I was not stating this based on the RMSE. Rather, I was stating this because I knew this about the dataset from examining it. Sorry that was not clear!

  • @gopalgoel2289
    @gopalgoel2289 7 років тому

    Thank you sooo much these are the best tutorial series :)

    • @dataschool
      @dataschool  7 років тому

      Thanks for your kind comment!

  • @ankitbiradar8599
    @ankitbiradar8599 9 років тому +4

    Could you teach how to program Neural Networks and SVM using sckit-learn ?

    • @dataschool
      @dataschool  9 років тому +1

      +ankit biradar Thanks for the suggestion! I'll consider it for a future video.

  • @hsin-yuku4086
    @hsin-yuku4086 4 роки тому

    Thank you for the awesome videos, clear and to the point. However, I have a question regarding the retraining for the feature selection part (starting 30:31) : Won't it introduce data snooping bias when retraining to pick for different features?

  • @mirzaburgic
    @mirzaburgic 3 роки тому

    Great content, you have an inspiring way of presenting, keep it up!
    I have one question though, why is the TV coefficient smaller than the Radio coefficient, even though from the plots and best fit line it looks like the sales go up faster with more TV ad spending?

  • @shawnz9833
    @shawnz9833 8 років тому

    Very Very nice explanation. Thank you Kevin

  • @pradhyumnchoudhary7383
    @pradhyumnchoudhary7383 4 роки тому

    I am getting a parser error for reading the csv file from the website. (3:00)

    • @utkarshkunwar4319
      @utkarshkunwar4319 4 роки тому

      Here it is..
      github.com/justmarkham/scikit-learn-videos/tree/master/data