This is really awesome. I just started as an absolute beginner of coding, only finished Dojo's tutorial for the absolute beginner, and I am able to catch up with most of what you taught so far (1:39:00)!! Thank you!!!
Very useful class for understanding some basic aspect of pandas that is often not explained in other tutorials, Long data, applying functions to dataframes and using accessors. Thank you!!
Great lecture ! Thanks I just have one doubt at 2:05:39 we use avg_2 function, but we did not need to vectorise it on the other hand avg_2_mod needed vectorisation. I can’t understand what is the difference between two functions… I.e. why does one need vectorisation and the other doesn’t for the same inputs ??
I have a query regarding running my python code in jupyter notebook. Sometime in the middle during running code, the cursor jumps to the next cell instead of running code. I have tried so many things e.g restart the notebook, rewrite code and so many but the same result. Can anybody help me regards this issue?
Excellent tutorial. Very detailed. I have one gripe though. And it's not Daniel. EVERYONE/EVERY tutorial does this. They name their dataframe df. That's like naming your spreadsheet "spreadsheet" or "ss". Or naming a variable by it's datatype. No one ever names age as "i" or "int". They call their variables by the real world things they are. And a dataframe is a variable. DataFrames should be named like we name spreadsheets (their tabs) or database tables.
hungarian notation has a place in python. including the type in the name tells you what type it should be, though you need typechecking and asserts to guarantee that.
At @2:10:49 you directly split the Series without using.. Str. Split('/'). How it split data frame Series. In previous example while splitting we use.. Str. Split. Pls explain
I think you're talking about the "function" example/question. so i'm not using str.split becuase that's how you use split in a pandas series. but we're writing a function that takes in a single string so we have direct access to the string methods becuase it's really regular "my_string".split("_") in base python. we then apply the function to our data.
Why does my output even include Nan values from the table shown in ua-cam.com/video/5rNu16O3YNE/v-deo.html like for the day "Thur" it shows Lunch and Dinner(this has Nan) but in video, there's only lunch
yeah, but you cant automate any reporting in excel, with pandas you need to write code only once for any report and next time onward you can reproduce it.
thanks for letting me know. I just realized the other day that I can get a little more screen real estate by hitting F11 so I'll be sure to do this in the future.
@@enthought hi! Where can i find the dataset and the codes to follow along? Couldnt find it in Daniel Chen's github page. Could u please send me the link or email me at tolekbaeva@bk.ru. Thank you!!
@@RAL2010 oh did not know that's what caused it. :\ I use my phone for my teaching notes. Since it's a live coding sessions it would be super disruptive to tab back and fourth on the screen... Might also be that the phone was probably pluged in and charging. Would the interference be just from the charger? or does putting it in airplane mode help?
1:03 Intro
5:37 Intro to Pandas
47:29 Tidying Dataset
1:39:17 Apply method on a DataFrame
2:45:05 Modelling and Data Preparation for ML
Daniel, best teacher in the world ! nothing is better than teaching with live examples it is very intuitive !
This is really awesome. I just started as an absolute beginner of coding, only finished Dojo's tutorial for the absolute beginner, and I am able to catch up with most of what you taught so far (1:39:00)!! Thank you!!!
This video explains Pandas so well. Great job Daniel, this is by far the best Pandas video on youtube.
This is hands down the best panda tutorial I've ever watched so far. Thank you, Daniel:)
His Book Pandas for everyone is the best out there. Really in-depth.
That was a long video but I managed to complete it. I liked the honesty of the guy and he was doing things on the go. Chapeau bas!
I've watched bunch of Pandas tutorial videos and this is definitely the best one so far.
Sweet! all these for freeeee.... I'm a true believer that information should be free and i say a BIG THANK YOU for this Daniel,
Watched the entire video - really helpful stuff as a pseudo beginner.
Best Pandas tutorial so far I can find. Thanks.
one of the best tutorials on pandas
Best pandas tutorial… glad I found this talk.
Very useful class for understanding some basic aspect of pandas that is often not explained in other tutorials, Long data, applying functions to dataframes and using accessors. Thank you!!
that mitigated my pandas headache! Thanks!
@1:15:40 it is:
ebola_long['cd_country'].str.split('_').str.get(0)
Great video for beginners. Thanks for sharing.
Nice tutorial indeed. Can you point out any such nice tutorial for beginners of ML?
Well explained ...Thank you Daniel.
Great lecture ! Thanks
I just have one doubt at 2:05:39 we use avg_2 function, but we did not need to vectorise it on the other hand avg_2_mod needed vectorisation. I can’t understand what is the difference between two functions… I.e. why does one need vectorisation and the other doesn’t for the same inputs ??
is there a single place where we can find these datasets, like a shared drive perhaps? Would be good to follow along with that as well.
Sir at 1:15:45 , we need to call two str to get the desired value,
Like, ebola_long['cd_country'].str.split('_').str.get(0)
true, I know it works by calling it twice but it does it make intuitive sense to call it twice.
Excellent, he forget to do it this time, but he did it in pyData conference in 2018
ua-cam.com/video/iYie42M1ZyU/v-deo.html
It was very helpful and informative. Thank you very much for posting this video!
Where can I access the datasets being used in this video?
where can i find the data that has been used in this video, please ?
@1:18 How would be I sure that the new columns corresponds to correct row?
I have a query regarding running my python code in jupyter notebook. Sometime in the middle during running code, the cursor jumps to the next cell instead of running code. I have tried so many things e.g restart the notebook, rewrite code and so many but the same result. Can anybody help me regards this issue?
Check if your kernel shut off due to inactivity or something
@@puar6124 checked it too... but still the same :(
This is awesome. Thank you.
Recommended for beginners
you dropped total_bill in X=tips_dummy no?
Excellent tutorial. Very detailed. I have one gripe though. And it's not Daniel. EVERYONE/EVERY tutorial does this. They name their dataframe df. That's like naming your spreadsheet "spreadsheet" or "ss". Or naming a variable by it's datatype. No one ever names age as "i" or "int". They call their variables by the real world things they are. And a dataframe is a variable. DataFrames should be named like we name spreadsheets (their tabs) or database tables.
hungarian notation has a place in python. including the type in the name tells you what type it should be, though you need typechecking and asserts to guarantee that.
At @2:10:49 you directly split the Series without using.. Str. Split('/'). How it split data frame Series. In previous example while splitting we use.. Str. Split. Pls explain
I think you're talking about the "function" example/question.
so i'm not using str.split becuase that's how you use split in a pandas series. but we're writing a function that takes in a single string so we have direct access to the string methods becuase it's really regular "my_string".split("_") in base python.
we then apply the function to our data.
Can i have the access to your notes u have? please
of if someone is having ?
You can get all the notes from Github
Why does my output even include Nan values from the table shown in ua-cam.com/video/5rNu16O3YNE/v-deo.html like for the day "Thur" it shows Lunch and Dinner(this has Nan) but in video, there's only lunch
Great tutorial ,great Daniel :) thanks
I need to remember the syntax, while at the same time excel show you average value ,jus drag to your data , the average showed
yeah, but you cant automate any reporting in excel, with pandas you need to write code only once for any report and next time onward you can reproduce it.
i need an extra tutorial for that
what os is that, is he on kde neon?
KDE for sure.
I run/ran arch (antergos) with KDE.
the window size for coding should be full windowed, can't see quite well
thanks for letting me know. I just realized the other day that I can get a little more screen real estate by hitting F11 so I'll be sure to do this in the future.
Is the Slack Channel still working? I can't find it.
Sorry, Robb. The SciPy 2019 Slack Channel is no longer active.
@@enthought hi! Where can i find the dataset and the codes to follow along? Couldnt find it in Daniel Chen's github page. Could u please send me the link or email me at tolekbaeva@bk.ru. Thank you!!
@@KukaKaz I just found it today by searching on GitHub using the description in the video
Does the audio artifact on this video ever stop? Driving me crazy.
it's his phone, he should have switched it off.
@@RAL2010 oh did not know that's what caused it. :\ I use my phone for my teaching notes. Since it's a live coding sessions it would be super disruptive to tab back and fourth on the screen...
Might also be that the phone was probably pluged in and charging. Would the interference be just from the charger? or does putting it in airplane mode help?
Thanks Daniel
He was really a nice guy
2:00:00 vectorize it is useful for me
excellent thanks you
great content, annoying crackling noise :(
excellent
million likes
Melt around 50:00
Melt 50:00
Pivot 1:20:00
Apply 1:39:00
24:35
you think. wowl
Pandas was never that easy elsewhere
in excel the pivot table stuff is much easier, (for me at least)
Again you make video. Put that Mobile phone away from your mic.
he has great content, but that phone interference is so distracting