Python Data Science Tutorial: Analyzing the 2019 Stack Overflow Developer Survey

Поділитися
Вставка
  • Опубліковано 30 чер 2024
  • In this video, we will be learning how to use analyze survey data in Python.
    This video is sponsored by Brilliant. Go to brilliant.org/cms to sign up for free. Be one of the first 200 people to sign up with this link and get 20% off your premium subscription.
    In this Python Programming video, we will be learning how to download and analyze real-world data from the 2019 Stack Overflow Developer Survey. This is terrific practice for anyone getting into the data science field. We will learn different ways to analyze this data and also some best practices. Let's get started...
    The code from this video can be found at:
    bit.ly/SO-Survey-2019
    CSV Tutorial - • Python Tutorial: CSV M...
    Jupyter Notebooks Tutorial - • Jupyter Notebook Tutor...
    ✅ Support My Channel Through Patreon:
    / coreyms
    ✅ Become a Channel Member:
    / @coreyms
    ✅ One-Time Contribution Through PayPal:
    goo.gl/649HFY
    ✅ Cryptocurrency Donations:
    Bitcoin Wallet - 3MPH8oY2EAgbLVy7RBMinwcBntggi7qeG3
    Ethereum Wallet - 0x151649418616068fB46C3598083817101d3bCD33
    Litecoin Wallet - MPvEBY5fxGkmPQgocfJbxP6EmTo5UUXMot
    ✅ Corey's Public Amazon Wishlist
    a.co/inIyro1
    ✅ Equipment I Use and Books I Recommend:
    www.amazon.com/shop/coreyschafer
    ▶️ You Can Find Me On:
    My Website - coreyms.com/
    My Second Channel - / coreymschafer
    Facebook - / coreymschafer
    Twitter - / coreymschafer
    Instagram - / coreymschafer
    #Python

КОМЕНТАРІ • 145

  • @coreyms
    @coreyms  4 роки тому +55

    I hope everyone finds this useful. Going out and practicing with real-world data like this is one of the best (if not the best) way to practice your skills. If any of you do your own analysis that's interesting then be sure to share it with us in the comment section below!
    PS I forgot to mention that as usual, the code for this video can be found in the description section.

    • @shpazhist
      @shpazhist 4 роки тому +4

      Corey, can you please do a video where you explain how to get to that level of intermediate-advanced developer where one can play with code really-really easily? In essence, say if one wants to learn about new library or framework or a new challenge - - can you please explain your process how do you go about actually learning that field rather than doing videos on standalone topics as IMO that would really help a lot of people to sorta move to the next level a lot. Thx

    • @bodekolton5961
      @bodekolton5961 3 роки тому

      I know it's quite off topic but do anybody know a good website to stream new tv shows online?

    • @troylorenzo6430
      @troylorenzo6430 3 роки тому

      @Bode Kolton Lately I have been using Flixzone. Just google for it :)

    • @maddoxronin9643
      @maddoxronin9643 3 роки тому

      @Troy Lorenzo yea, have been watching on Flixzone for since april myself =)

    • @bodekolton5961
      @bodekolton5961 3 роки тому

      @Troy Lorenzo thank you, I went there and it seems like a nice service :D I really appreciate it!

  • @mrityunjayahiremath3951
    @mrityunjayahiremath3951 4 роки тому +70

    Python + Data Science + Corey = ❤️

  • @Ghasakable
    @Ghasakable 4 роки тому +19

    Man, you are awesome, you are the reason that I open UA-cam, please if you come to Japan one day, I will do my best to show you good places here in Tokyo.

    • @coreyms
      @coreyms  4 роки тому +4

      That'd be awesome!

  • @AvantjerTube
    @AvantjerTube 4 роки тому +5

    Thanks Corey!
    I’ve been following your excellent tutorials for quite awhile.
    Just wanted to say that I’m excited and looking forward to your Pandas course(s).

  • @mylesmontclair
    @mylesmontclair 4 роки тому +1

    Great to have you back producing great content for us. Thanks, Corey!

  • @anaesthesiadolorosa7100
    @anaesthesiadolorosa7100 4 роки тому +1

    It's awesome that you've started making data science videos. Please keep them coming! ;)

  • @Sauhros
    @Sauhros 4 роки тому

    Just what I needed to start off my winter break. I can't wait for your pandas series! Keep up the great work Corey.

  • @GuilhermePalazzoRodrigues
    @GuilhermePalazzoRodrigues 4 роки тому +9

    That was incredible, really easy to follow and replicate, thank you very much.

  • @sanukurien2752
    @sanukurien2752 4 роки тому +1

    @Corey, I love watching your videos when I have my morning coffee. It's elegant and super informative. Thank you!

  • @afternoontiger9319
    @afternoontiger9319 4 роки тому +14

    I'm definitely interested in a future tutorial of using pandas with this data! Otherwise a fantastic job

  • @BoukariYameogo
    @BoukariYameogo 4 роки тому +2

    This is what I was waiting for! Thank you!

  • @dadoll1660
    @dadoll1660 4 роки тому +1

    Wow, this is Amazing! Please make more videos for this DS Tutorial series. Thank you Corey...

  • @kizzavincent
    @kizzavincent 4 роки тому +1

    Very nice to have you back. Many thanks.

  • @mancalos
    @mancalos 3 роки тому

    Finally a project not using either the Titanic or Iris datasets. Corey, you are a gem. Thanks for all the effort you put into this and every other tutorial.

  • @DistortedV12
    @DistortedV12 4 роки тому +1

    Wow Corey you do it all; thanks for the helpful tutorials!

  • @user-gx9hk8gt3k
    @user-gx9hk8gt3k 4 роки тому +1

    Thank you for Python Data Science Tutorial! Keep it up, please!

  • @rangavembar
    @rangavembar 4 роки тому +1

    Brilliant as always!! Looking forward to the Pandas series!!

  • @Lfppfs
    @Lfppfs 3 роки тому

    Awesome tutorial, thanks Corey!

  • @ajb4964
    @ajb4964 4 роки тому +7

    Wow, please do more like this.

  • @Vasko10100
    @Vasko10100 4 роки тому +3

    Finally, i understand analyzing Data in Python! I went from Coursera via EDX, UDEMY etc. But this.... this is a pure gold sir! Please, do some course or learning path for Data Science with Machine Learning in Python in depth( let it be if needed a paid learning path). You are one of the best! If you somehow manage to come in Macedonia, tell me, i want to bow before you!

    • @coreyms
      @coreyms  4 роки тому +3

      Thanks! Im still working on a Pandas series. 4 videos are complete but I'm working to finish the rest soon. Glad to have you as a subscriber and member. You all make it possible for me to release these for free. Thanks again!
      media.giphy.com/media/WvUyMX1gB1gnC/giphy.gif

  • @yosefco3
    @yosefco3 4 роки тому +2

    Hi great Corey ! Thank you very much on your channel , your flask tutorial is the best i ever seen ! Please could you do a flask-api tutorial too ?

  • @HunniiChii
    @HunniiChii 4 роки тому +1

    Hope to see more of these videos from you in the future!

  • @jayjay6804
    @jayjay6804 4 роки тому +1

    As hobbiest my first inclination would be to extract the data using pivot tables in libreoffice. Thank you for demonstrating the use of python for this purpose.

  • @philinnc
    @philinnc 4 роки тому

    Looking forward to seeing a future video where pandas is used to analyze this same data. Really appreciated your stepping through the thought process, something I haven't seen in many other data science tutorials. Approaches to data cleaning is another interesting topic that would be nice to see covered in a future video. Recently there were these FEC reports... :-)

  • @_boris
    @_boris 4 роки тому +1

    Your videos are so brilliant as I want to keep them to myself only... 💎
    ...however, way too good not to share them with others. Shared with my classmates - hopefully, they find it useful :)

  • @ramil17998
    @ramil17998 4 роки тому +1

    Thanks a lot for the video! I love it cause its as real as it can get!

  • @nikhillingam4630
    @nikhillingam4630 4 роки тому +1

    absolutely brilliant work well done

  • @KevinOMalleyisonlysmallreally
    @KevinOMalleyisonlysmallreally 4 роки тому

    You produce some of the best tutorials on python. Itd really help if you did one on advancing jupyter notebook use, and seeing up sublime to work with pandas.

  • @borgestheborg
    @borgestheborg 4 роки тому +2

    Oh baby! Never clicked on a video so fast in my life! :D

  • @akhilrapalli4118
    @akhilrapalli4118 4 роки тому +1

    wow what a video from an expert on python like u verymuch corey

  • @yomajo
    @yomajo 4 роки тому +1

    It seems like Counter() could be a topic of its own. I need to dig more into that. Awesome code-along experience!

  • @gaunterodimm3569
    @gaunterodimm3569 4 роки тому +1

    Great video! This video helped me soooooooo much!

  • @amnah.-.4708
    @amnah.-.4708 4 роки тому

    Hi Corey, thank you for the content you put out, it is beyond exceptional. Also, it would be great if you could make an in-depth course on SpaCy, and its application in Data Science.

  • @michaelworkspace
    @michaelworkspace 4 роки тому +25

    In case anybody ran into an error like "UnicodeDecodeError: 'charmap' codec can't decode byte 0x81" while reading the csv file like I had, the fix is to add a encoding='utf-8' parameter to the with open() function. e.g. with open('survey_results_public.csv', encoding='utf-8') as f:

    • @Mayur7Garg
      @Mayur7Garg 4 роки тому +4

      Thx man! You are a savior.

    • @shivstn98
      @shivstn98 4 роки тому +1

      It can be also fixed by using raw string I.e. use r before first quote
      Eg. r"hello world"

    • @jemand4036
      @jemand4036 4 роки тому +1

      thank u so much, i had no idea what was going on!

    • @irfandy4hmad
      @irfandy4hmad 4 роки тому +1

      Thanks bro. U saved our time

    • @michaelworkspace
      @michaelworkspace 4 роки тому

      Irfandy Ahmad No problem bro. Have a good one!

  • @rangabharath4253
    @rangabharath4253 4 роки тому +1

    Awesome as always

  • @bishantadhikari3440
    @bishantadhikari3440 4 роки тому

    Your videos are great and is probably the most easily understandable in all of youtube.
    Also, can you make videos about implementing AI in python cause there are not any good ones that meet your standard

  • @SagarSapkota-eo4ei
    @SagarSapkota-eo4ei 4 роки тому +1

    Hi Corey great and amazing content ..thanks a lot buddy

  • @gtosXD
    @gtosXD 4 роки тому +1

    Thank you!!!! Really great video!

  • @muralitechtimes6487
    @muralitechtimes6487 Рік тому

    You are true MASTER of coding. We are now a days not finding your other videos. Do it for us which re very help us to grow.

  • @Strider1066
    @Strider1066 3 роки тому

    I liked the incremental approach of coding by examples .

  • @Jo-id9zm
    @Jo-id9zm 4 роки тому +6

    You're doing God's work, Corey!

  • @syedabdulsalam4659
    @syedabdulsalam4659 4 роки тому +2

    Finally you are back :p

  • @arnopisspot5115
    @arnopisspot5115 4 роки тому

    Looking forward the panda series boss. Thank you for all your hard work

  • @MAli-os2yj
    @MAli-os2yj 4 роки тому +3

    Hi Corey! Love your videos man! I have finished your Flask series and currently doing the Django series. I have a few suggestions for future videos around the Blog Website that involve some ML and Data Science:
    1. Introduce 'like and comment' functionality to the application which allows a user to comment and like posts by other users.
    2. Then, integrate a recommendation system engine to the Blog application, the system will recommend content to the individual user based on his own posts and the posts that he has liked.
    3. Also, a user can look into the profile of other users and see a word cloud of all posts that the other has submitted, this way they can get a rough idea of what the other uses posts.
    Keep Doing the good work man!

  • @someonesomebody716
    @someonesomebody716 4 роки тому +1

    Data Science + Corey Sir = YAY!!!

  • @alexanderten5497
    @alexanderten5497 4 роки тому +1

    Notification from Corey and Iam here to like the video

  • @hserdarkuyuk
    @hserdarkuyuk 4 роки тому +1

    Please do similar live coding, these are great to improve our perspective

  • @jackleone4150
    @jackleone4150 4 роки тому +2

    If anyone else is facing the UnicodeDecodeError try adding this second argument to the open function =>
    with open('data/survey_results_public.csv', encoding="utf8")

    • @sylvanmasha
      @sylvanmasha 4 роки тому

      wow, thank you so so much. i was facing the same problem

  • @BrendanMetcalfe
    @BrendanMetcalfe 4 роки тому +1

    nice vid! thanks

  • @anandinamdar4054
    @anandinamdar4054 4 роки тому +1

    Legend 🙌🙌

  • @sandeepgaur2407
    @sandeepgaur2407 4 роки тому

    Awesome tutorial like always. Please make video on asyncio in python.. please please.

  • @aydinjalilov2328
    @aydinjalilov2328 4 роки тому

    it blew my mind.

  • @rosyndapee8129
    @rosyndapee8129 4 роки тому

    Nice! What about a future video with some visualization of geospatial data? I.e. using some GPS tracks converted into CSV file with all points lat and long. Would you suggest to use postrgres or postgis to then make queries to the geodata?

  • @736939
    @736939 4 роки тому

    Thank you Corey! Continue please: ScikitLearn, TensorFlow

  • @daltonmarhufu9549
    @daltonmarhufu9549 4 роки тому

    Great vid! Really helpful, could you possibly do a Pandas version of this?

  • @rolandgerard6064
    @rolandgerard6064 4 роки тому +1

    man, you are good 👍

  • @nowyouknow2249
    @nowyouknow2249 4 роки тому +1

    Nailed it

  • @md.akib5124
    @md.akib5124 4 роки тому +1

    heeiiii corey. awesome to see another amazing tut from you

  • @barungh
    @barungh 4 роки тому +1

    Badass !👍👌

  • @theDrewDag
    @theDrewDag 4 роки тому

    Please make this the first of a Data Science playlist. PLEASE.

  • @nitishchhabra_me
    @nitishchhabra_me 4 роки тому

    Hey @corey please make videos for Airflow DAG's also. They are basically python codes so it would fit nicely within your python series

  • @javierrivasseau8956
    @javierrivasseau8956 Рік тому

    Hi Corey, how are you? i have been watching your series and videos, and think they are the very best out there, i really like to learn with you, I would like to ask you some things if you could solve them, when you make the matplotlib series, you separate the convertedcomp for all the devs and for the python dev and javascript dev, can you upload a video to solve it? I tried anyway and could not. ?

  • @Medhusalem
    @Medhusalem 4 роки тому

    Hey Corey, I have seen the previous videos on DataFrames with Pandas where you used Jupyter as well. You mentioned in this video that it is nice that you can run code brackets or just certain lines of code.
    My question would be then, what is the difference between Jupyter and Spyder? Since Spyder can also execute certain code lines, without loading everything in again and has a super awesome functionality where you have a variable manager in which you can view every variable you declared.

  • @user-bs3jr5ix5r
    @user-bs3jr5ix5r 4 роки тому

    Hello, Corey! Please append your matplotlib tutorial with a lesson about Legend Picking

  • @skandrskon7091
    @skandrskon7091 4 роки тому

    for line in csv_reader :
    counts[line['hobbyist'] ] += 1
    really great idea for increment on counts dict
    thanks

  • @kingslypaul2999
    @kingslypaul2999 4 роки тому

    Can u make a video on pyttxs3 ...or something better .....thanks for all your God skilled contents that makes ppl understand anything u teach

  • @DistortedV12
    @DistortedV12 4 роки тому +2

    From this, looks like you can't go wrong with javascript, python, SQL and a C type language. Wonder how long these will dominate.

  • @morganfree100
    @morganfree100 4 роки тому +1

    More please sir..(:-

  • @rahulsharma-st5yg
    @rahulsharma-st5yg 4 роки тому +1

    @corey: when to expect some data science real life projects?

  • @shivamkumraa
    @shivamkumraa 4 роки тому +8

    Who else liked before watching this. Just 1000% sure this guy can't make any shit.

    • @Jmzz542
      @Jmzz542 4 роки тому +1

      what makes you say that?

  • @opst1704
    @opst1704 4 роки тому

    If one day I decide to learn Python, then do I waste my time by learning C language now?
    I am learning C, primarily because I want to understand how memory works and DSA is easier to understand with C. Will this knowledge be helpful if I decide to learn Python? Seems like C and Python have a lot of differences, in Python there are so many things that I don't need to worry about.
    One more question, is it difficult to find out what is happening behind scenes in Python, for example: implementation of input() function?
    Great videos, the main reason why I am considering to learn Python.

  • @mitos95176
    @mitos95176 4 роки тому +1

    Now I want to perform this task using pandas.

  • @strange_man
    @strange_man 4 роки тому +1

    The video that I need :D

  • @sprk2009
    @sprk2009 4 роки тому

    Your videos are awesome. I am new to Python and preparing for Python certification. Can you please let me know which one of the following programs are good to proceed? Thanks in advance.
    1. PCAP: Programming Essentials in Python from Netacad
    2. Microsoft 98-381 Introduction to Programming Using Python

  • @iangitonga2811
    @iangitonga2811 4 роки тому +2

    Great content.

  • @aryanvikash1967
    @aryanvikash1967 4 роки тому

    Corey please create a tutorial on python coroutine.. async await its so confusing .. please

  • @graybits4111
    @graybits4111 4 роки тому

    thanks for this great video
    the total % for each dev type should be 100%, which is not the case

    • @coreyms
      @coreyms  4 роки тому

      Not necessarily. They can choose multiple languages. So for example, for front-end developers, most said they know JavaScript, but they also knew other languages as well. So each language is a percentage of how many developers said they know a particular language. Hope that clears it up

    • @graybits4111
      @graybits4111 4 роки тому

      @@coreyms it is clear now, thanks

  • @arkhamian7586
    @arkhamian7586 4 роки тому +1

    Don't stop doing this.

  • @ebrahimm.roshdy4487
    @ebrahimm.roshdy4487 4 роки тому

    Please do it using pandas since it's way easier and so common for data scientists

  • @chinmayabarik557
    @chinmayabarik557 4 роки тому

    Please do a pandas tutorial with this survey results sir..

  • @natesh1
    @natesh1 4 роки тому

    Please do a pandas tutorial series with this survey results too.

    • @bluekeybo
      @bluekeybo 4 роки тому

      You didn't watch the video till the end did you

  • @pingojr4048
    @pingojr4048 4 роки тому

    Can you do something for home bill management?

  • @ashissahoo4158
    @ashissahoo4158 3 роки тому

    Is the pandas version of the last question has been uploaded?

  • @johnnya590
    @johnnya590 4 роки тому +1

    I don't know if you realize this, but at the start of EVERY video you say "Hey there how's it going everybody?" :)

  • @gnuter3000
    @gnuter3000 4 роки тому +1

    greatgreatgreat

  • @Medhusalem
    @Medhusalem 4 роки тому +1

    It didn't work somehow and I had to do a google search on my error: 'charmap' codec can't decode byte 0x81 in position 5739.
    So I had to change the with open part and specify the encoding part, after that everything worked fine.
    *with open('data/survey_results_public.csv', encoding='utf-8') as f:*

  • @amankothiyal7074
    @amankothiyal7074 3 роки тому

    Hey can you analyse it using numpy library

  • @boolloop
    @boolloop 4 роки тому

    I love your videos, but I wondered why you didn't use pandas for this?

    • @coreyms
      @coreyms  4 роки тому +3

      I mentioned the reason in the video. I wanted to show how to do this without Pandas first, and once I release my Pandas series then I will redo this video using Pandas as well. That way people who are unfamiliar with Pandas can do this as well

    • @boolloop
      @boolloop 4 роки тому

      @@coreyms neat, looking forward to it!

  • @pavanmandikal
    @pavanmandikal 4 роки тому +1

    Please do make a video on numpy also

  • @hemanth6951
    @hemanth6951 4 роки тому +1

    i saw you in another UA-cam channels

    • @coreyms
      @coreyms  4 роки тому +1

      Which one was that?

    • @hemanth6951
      @hemanth6951 4 роки тому +1

      @@coreyms some thing pyconf channel with some one interviewing with you last year

    • @coreyms
      @coreyms  4 роки тому +1

      @@hemanth6951 Oh yeah! I think that was Dan Bader. He's a great guy.

    • @hemanth6951
      @hemanth6951 4 роки тому +2

      @@coreyms yes yes yes finally I saw you and this second time I'm chatting with you I'm so happy and luky 😊😀😀

  • @yuliu1105
    @yuliu1105 4 роки тому

    Please update pandas version of the 2nd half of the problem, my forloops don't work, bummer. Thank you

  • @darrenlefcoe
    @darrenlefcoe 4 роки тому

    On inspection of the data. The csv.DictReader(f) throws 594 errors in the for loop >> line in csvReader:
    Not sure what causes the error, but this exists.
    Needs to be handled with
    try:
    except UnicodeDecodeError:
    who else sees this ?

  • @marinbeslo7841
    @marinbeslo7841 4 роки тому

    Is the video with pandas-way out? :)

  • @darrenlefcoe
    @darrenlefcoe 4 роки тому

    for line in csvReader: failed on line 16.
    UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 5739: character maps to
    problem with the data-set or other reason ?

    • @sylvanmasha
      @sylvanmasha 4 роки тому +1

      f anyone else is facing the UnicodeDecodeError try adding this second argument to the open function =>
      with open('data/survey_results_public.csv', encoding="utf8")
      this is a comment i found by Jack Leone. you're welcome

  • @shaguftakhan302
    @shaguftakhan302 3 роки тому

    I clicked on this video just to see what data scientists do.

  • @adildeshmukh4140
    @adildeshmukh4140 4 роки тому +1

    yes, please pandas

  • @christiaan3315
    @christiaan3315 3 роки тому

    Sorry. I am getting error messages... 'for line in csv.reader: row = next(self.reader) etc

  • @Ajithkumarsekar96
    @Ajithkumarsekar96 4 роки тому

    Almost everyone has used Javascript and html/css? From front-end developer to system-administrator

  • @samishabli5191
    @samishabli5191 4 роки тому

    i got this error: Yes
    Traceback (most recent call last):
    and it just shows me the first yes and no
    and in the end it says: return codecs.charmap_decode(input,self.errors,decoding_table)[0]
    UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 5739: character maps to

    • @atishayshukla1117
      @atishayshukla1117 4 роки тому +2

      Its an encoding issue with the file, open a file by specifying proper encoding something like: with open('data/survey_results_public.csv', encoding='utf8')

    • @ThePasupathi
      @ThePasupathi 4 роки тому

      @@atishayshukla1117 thank you..it works

  • @olicairns8971
    @olicairns8971 4 роки тому

    This is much easier if you use pandas

    • @coreyms
      @coreyms  4 роки тому

      Agreed. I mentioned this in the video, but I wanted to show how to do this without Pandas first, and once I release my Pandas series then I will redo this video using Pandas as well. That way people who are unfamiliar with Pandas can do this as well