Karina Data Scientist
Karina Data Scientist
  • 113
  • 105 393
Advanced Pandas Tutorial: 7 Data Analysis Techniques You Need to Know
Master Advanced Pandas Techniques for Data Analysis | Python Tutorial
In this hands-on tutorial, we'll cover essential advanced Pandas techniques including rolling averages, pivot tables, lambda functions, percent change, .query and cross-tabulation.
----------------------
Dataset used in this video - www.kaggle.com/datasets/yusufdelikkaya/online-sales-dataset
----------------------
🐼 Pandas Cheat Sheet - stan.store/KarinaDataScientist/p/complete-pandas-cheat-sheet
💻 Data Analysis with Python Masterclass - stan.store/KarinaDataScientist/p/start-programming-with-python
💌 Join my newsletter and get access to various freebies - stan.store/KarinaDataScientist
🧠 AI to help you analyse your data - powerdrill.ai/?via=karina-samsonova
----------------------
Timestamps:
----------------------
00:00 Intro
00:30 Dataset
00:42 Read the file
01:22 Convert object to datetime
02:00 Add new column
03:35 Add Weekday
04:58 Rolling Sum
08:23 Percent change
11:05 Advanced aggregation
14:49 lambda funciton
17:51 Advanced pivot
20:07 Cross tabulation
21:45 Product analysis
23:24 .query for filtering
24:20 Category performance
🎥 Other videos you might be interested in
----------------------
ua-cam.com/video/MZCUIcCSySY/v-deo.html
ua-cam.com/video/xLW796-J5fI/v-deo.html
----------------------
About me
----------------------
Hi, my name is Karina and I'm a finance person turned data person.
My mission is to transform intimidating tech into accessible tools. I aim to empower 1 million people to harness the power of AI, Python, SQL, and Excel to work smarter, not harder.
Contact
----------------------
UA-cam: youtube comments are by far the best way to get a response from me!
email for business inquiries only: karinadatascientist@gmail.com
----------------------
Social Media:
----------------------
TikTok: www.tiktok.com/@karinadatascientist
Instagram: karinadatascientist
Linkedin: www.linkedin.com/in/karina-samsonova/
Переглядів: 323

Відео

How I'd Learn Python for Data Analysis in 2025
Переглядів 1,3 тис.21 годину тому
Why Python is so popular among data analysts and data scientists. What modules / libraries you should learn 🐼 Pandas Cheat Sheet - stan.store/KarinaDataScientist/p/complete-pandas-cheat-sheet 💻 Data Analysis with Python Masterclass - stan.store/KarinaDataScientist/p/start-programming-with-python 💌 Join my newsletter and get access to various freebies - stan.store/KarinaDataScientist 🧠 AI to hel...
How To Write a Data Analyst / Data Scientist Resume
Переглядів 97014 днів тому
Are you struggling to land your dream job in data analytics or data science? In this video, I review real CVs and provide tips to help you create a standout resume that grabs recruiters' attention. Whether you're a beginner or an experienced professional, these insights will help you craft a resume tailored for top analytics roles. What You'll Learn: Common mistakes in data science CVs and how ...
Basic Guide to Pandas in 20 min. Python for Beginners
Переглядів 1,1 тис.21 день тому
This video is an introduction to Pandas. We will work with Customer dataset. We will discuss how to read file, view statistics, check for duplicates, how to group, pivot, sort data, create bins using .cut and how to create a simple bar chart. Dataset - www.kaggle.com/datasets/mjlyma/customer-purchase-dataset Pandas Documentation - pandas.pydata.org/docs/user_guide/index.html 🐼 Pandas Cheat Shee...
Credit Card Fraud Detection using ML (Python) | Random Forest
Переглядів 1,6 тис.Місяць тому
This video is a ML tutorial. We will work on credit card fraud detection model using random forest. Dataset used - www.kaggle.com/datasets/nelgiriyewithana/credit-card-fraud-detection-dataset-2023/data 💌 Join my newsletter and get access to various freebies - stan.store/KarinaDataScientist 🧠 AI to help you analyse your data - powerdrill.ai/?via=karina-samsonova Timestamps: 00:00 Intro 00:12 Lib...
Master SQL Window Functions in 15 Minutes | ROW_NUMBER, LAG/LEAD, RANK, FIRST_VALUE
Переглядів 406Місяць тому
Master SQL Window Functions in 15 Minutes | ROW_NUMBER, LAG/LEAD, RANK, FIRST_VALUE
Time series forecasting in ML (ARIMA, Holt-Winters)
Переглядів 1,4 тис.Місяць тому
Time series forecasting in ML (ARIMA, Holt-Winters)
SQL Tutorial (JOIN, GROUP BY, SUBQUERY, HAVING, IN)
Переглядів 511Місяць тому
SQL Tutorial (JOIN, GROUP BY, SUBQUERY, HAVING, IN)
SQL Tutorial (SELECT, WHERE, AND, OR, COUNT, SUM, AVG, ROUND)
Переглядів 733Місяць тому
SQL Tutorial (SELECT, WHERE, AND, OR, COUNT, SUM, AVG, ROUND)
What NOT to do in a Job Interview
Переглядів 4342 місяці тому
What NOT to do in a Job Interview
Exploratory Data Analysis in 20 Minutes
Переглядів 1,5 тис.2 місяці тому
Exploratory Data Analysis in 20 Minutes
Excel Lookups Made EASY! Master VLOOKUP, HLOOKUP, and XLOOKUP in 10 MIN
Переглядів 1 тис.2 місяці тому
Excel Lookups Made EASY! Master VLOOKUP, HLOOKUP, and XLOOKUP in 10 MIN
Build Image Recognition Model In Python in 20 min
Переглядів 1,2 тис.2 місяці тому
Build Image Recognition Model In Python in 20 min
Data Cleaning in Pandas in 20 minutes
Переглядів 4,2 тис.3 місяці тому
Data Cleaning in Pandas in 20 minutes
Surprising Ways to Build a PERFECT DATA Portfolio
Переглядів 2,3 тис.3 місяці тому
Surprising Ways to Build a PERFECT DATA Portfolio
Word Cloud in Python
Переглядів 7193 місяці тому
Word Cloud in Python
For Loops and While Loops in Python
Переглядів 3713 місяці тому
For Loops and While Loops in Python
Hugging Face Tutorial (2024) - Sentiment Analysis, Text Generation, LLM
Переглядів 1,6 тис.3 місяці тому
Hugging Face Tutorial (2024) - Sentiment Analysis, Text Generation, LLM
Data Analytics Portfolio Project - Netflix customer base (Python)
Переглядів 2,7 тис.4 місяці тому
Data Analytics Portfolio Project - Netflix customer base (Python)
Excel Functions EVERYONE needs (FILTER, SORT, DATEDIF, TEXTSPLIT, TEXJOIN, CHOOSECOLS)
Переглядів 2,5 тис.4 місяці тому
Excel Functions EVERYONE needs (FILTER, SORT, DATEDIF, TEXTSPLIT, TEXJOIN, CHOOSECOLS)
Data Science Portfolio Project - Churn prediction model
Переглядів 4,1 тис.4 місяці тому
Data Science Portfolio Project - Churn prediction model
How To Make Beautiful Line Charts Using Matplotlib in Python
Переглядів 6644 місяці тому
How To Make Beautiful Line Charts Using Matplotlib in Python
Why Polars is better than Pandas?
Переглядів 9984 місяці тому
Why Polars is better than Pandas?
How I'd become a Data Analyst if I had to start today
Переглядів 2,4 тис.5 місяців тому
How I'd become a Data Analyst if I had to start today
Data Analysis Project in Excel (Beginner to Intermediate)
Переглядів 3,5 тис.5 місяців тому
Data Analysis Project in Excel (Beginner to Intermediate)
Data Analytics Portfolio Project - Coffee Sales
Переглядів 6 тис.5 місяців тому
Data Analytics Portfolio Project - Coffee Sales
Data Science Project - RFM model
Переглядів 4,2 тис.5 місяців тому
Data Science Project - RFM model
T-test in MS Excel
Переглядів 2815 місяців тому
T-test in MS Excel
Data Science Portfolio Project - K-means
Переглядів 2,4 тис.6 місяців тому
Data Science Portfolio Project - K-means
Monte Carlo Simulation (Predict NVIDIA Share Price)
Переглядів 6896 місяців тому
Monte Carlo Simulation (Predict NVIDIA Share Price)

КОМЕНТАРІ

  • @olubunmifaneye8395
    @olubunmifaneye8395 12 годин тому

    Thank you

  • @saifullahhaidar391
    @saifullahhaidar391 15 годин тому

    Great sharing Karina. Karina can I add these in my portfolio as I am on the job hunt.

  • @mohammedrafi9662
    @mohammedrafi9662 17 годин тому

    Make video on how mba finance guy became data analyst

  • @hamidraza1584
    @hamidraza1584 20 годин тому

    Amazing video, video and your look is awesome , love from Lahore Pakistan

  • @ahmedammarali
    @ahmedammarali День тому

    Hello Karina, Have a good day, My name is Ahmed Hassan, I'm an ex-finance and now I am trying to get my way through the data field, I found your fascinating UA-cam channel while I was trying to learn from experts in this field and I found your videos so helpful, so I'm sending to you firstly to thank you very much for your wonderful efforts, secondly to ask you if you don't mind if I tagged you and insert the link of your UA-cam videos on my posts on linked in that explain the projects I made and who help me to learn me the essential skills to achieve it. once again thank you for your help to teach others. Have a good time.

    • @karinadatascientist
      @karinadatascientist 23 години тому

      Hi Ahmed. Of course tag me on Linkedin, that's very kind of you. Thank you

    • @ahmedammarali
      @ahmedammarali 22 години тому

      @karinadatascientist Hi Karina, Thank you, I'm very happy by your response.

  • @reanwithkimleng
    @reanwithkimleng 5 днів тому

    Hello sister , feature selection of three columns in 3D if more than three??❤

  • @thugthamizha11
    @thugthamizha11 6 днів тому

    Merci beaucoup madame. C'est vraiment utile

  • @AbyxsDragon
    @AbyxsDragon 7 днів тому

    Thank god there are subtitles Because i am definitely hearing that wrong 😅 Shame there are no other comments I wanted to know if other people heard it too

    • @ivanlim7583
      @ivanlim7583 7 днів тому

      😂

    • @karinadatascientist
      @karinadatascientist 6 днів тому

      yes, other people heard it too! On TikTok they didn't hold it back, which made this video go sort of viral, getting nearly 23,000 views 😂.

  • @okotpascal
    @okotpascal 8 днів тому

    I need help here on how you came up with the offset arrangement and criteria. I ask this because I am working on a different data set and I really need to include the legend/label correctly. thanks

    • @karinadatascientist
      @karinadatascientist 8 днів тому

      It is a manual process, I had to just try different options till it was placed where I wanted it to be. For example, with line labels, this is a code I used: labels = df_transformed.columns offsets = [0, 0, 0, 5, 0, 2] # Adjust positions to avoid overlap for i, (label, offset) in enumerate(zip(labels, offsets)): y_pos = df_transformed[label].dropna().iloc[-1] + offset ax.text(df_transformed.index[-1] + 2, y_pos, label, color=COLOR_SCALE[i], fontsize=10, va='center') I started with offset all zeroes, didn't like it, started adjusting one by one to see what works best. Same with x, y here: fig.text( 0.015, 0.90, # (x,y) coordinates "Share of boys or girls within the relevant age group who are enrolled in primary, secondary, and tertiary education.", color=GREY30, fontsize=12, )

  • @okotpascal
    @okotpascal 8 днів тому

    I'm happy you learnt python, I am learning a lot from you already. Will you consider creating a Web app using Taipy(maybe to track vehicles), I have already seen some good videos on YT but I also know I will grasp it better from you. Thank You for the great effort.

  • @karthiksundaram544
    @karthiksundaram544 8 днів тому

    Hi can u suggest good courses on data analyst 🎉

    • @karinadatascientist
      @karinadatascientist 8 днів тому

      edX courses are good and free. You have to try various one to see which resonates. With some you will not like instructor, with others - boring tasks. You need a combination of courses to find what you like :)

    • @karthiksundaram544
      @karthiksundaram544 7 днів тому

      hi can u suggest any good course on generative ai

    • @karthiksundaram544
      @karthiksundaram544 6 днів тому

      hi is coursera a good choice

  • @nindogaming4132
    @nindogaming4132 9 днів тому

    To add on, a decent course I find helpful was Jose Portilla's Machine learning course. It has a decent data section for pandas, numpy, and matplotlib

  • @CultureofSpeech
    @CultureofSpeech 9 днів тому

    Bravo 👏 Maestro 👏 Lit 🌠 Impressive 😍 Big respect and gratitude for your satisfactory Work 🚀🌱🌟

  • @bhushanlonkar5117
    @bhushanlonkar5117 9 днів тому

    Please Same videos on SQl, Excel and PowerBi.

  • @ZakirHussain-nd4fw
    @ZakirHussain-nd4fw 9 днів тому

    Are you Reading?

  • @nindogaming4132
    @nindogaming4132 10 днів тому

    Subscribed! Can you give recommendations on what you think would be great end to end (python, sql, tableau) resume ready projects for aspiring data analysts?

    • @karinadatascientist
      @karinadatascientist 10 днів тому

      Thank you. Any business related projects - customer related, product related, expenses. Marketing campaign analytics (google, facebook, tiktok), user customer analysis (to help create a customer avatar), sales analysis. That's why I believe that sports or health projects are great, especially for practicing, but shouldn't be included in the portfolio, if you want to work for e-commerce, for example.

    • @nindogaming4132
      @nindogaming4132 9 днів тому

      ​@karinadatascientist Thanks! An e-commerce sales dashboard was on my list, but I wanted to do a Covid-19 Vaccination Analysis and a healthcare readmission prediction using logistic regression. Should I drop these in favor of more business oriented projects? I know I'm asking a lot of questions haha, but any help would be greatly appreciated!

    • @karinadatascientist
      @karinadatascientist 9 днів тому

      @@nindogaming4132 I would avoid COVID19 data. It was very popular in 2019 and 2020 :) Choose more business oriented

    • @nindogaming4132
      @nindogaming4132 9 днів тому

      @@karinadatascientist Thank you!

  • @bahar.around.theworld
    @bahar.around.theworld 10 днів тому

    thanks for this great tutorial!

  • @xalxika
    @xalxika 10 днів тому

    Thank you! I'm a very new programmer and learning Python - haven't heard of Jupyter before so this turned me on to that. Cheers!

  • @hoperight5280
    @hoperight5280 10 днів тому

    Great tutorial, but just to mention that I generally use the MATCH function inside of the CHOOSECOLS; it will avoid me counting the number of columns that I want to return in my new table.

    • @karinadatascientist
      @karinadatascientist 10 днів тому

      Fair enough. If you know how to use index+match, you can achieve a lot in Excel!

  • @karthiksundaram544
    @karthiksundaram544 11 днів тому

    🎉

  • @luffysan2662
    @luffysan2662 11 днів тому

    Hey Karina 😁 guess who's here haha.

  • @balakrishna8167
    @balakrishna8167 12 днів тому

    How AI is impacting data science roles Karina? People are recommending to transition into mlops or data engineering

    • @karinadatascientist
      @karinadatascientist 12 днів тому

      I think transitioning into data engineering can be a good choice, if you are interested in that field. AI will not replace data science or data analytics, but it is another tool that will be a part of our daily workflow

  • @siddhilipare
    @siddhilipare 14 днів тому

    Thank you for making this video. The tips are very helpful. 😊

  • @emmanueledozie1357
    @emmanueledozie1357 14 днів тому

    I have really learnt a lot following you especially with thst your python tips you drop occasionally.

  • @emmanueledozie1357
    @emmanueledozie1357 14 днів тому

    I just found out this is what i have been looking for 🎉

  • @emmanueledozie1357
    @emmanueledozie1357 14 днів тому

    Thanks ❤

  • @BillyT83
    @BillyT83 15 днів тому

    Useful recommendations and tips! Thank You, Karina! 😀

  • @anuvabkumarrit4669
    @anuvabkumarrit4669 15 днів тому

    Yes interested in part 2, if possible

  • @SuccessVstheworld
    @SuccessVstheworld 15 днів тому

    Hwyyyy was hoping if you added a template for us to download and tweak

    • @karinadatascientist
      @karinadatascientist 15 днів тому

      The CV template I show is available as a part of my freebies package, that you get once you sign up to my newsletter stan.store/KarinaDataScientist/

  • @JT-lo9nv
    @JT-lo9nv 16 днів тому

    Thank you for all the content you have provided on your channel! As an aspiring data analyst your channel has been a great resource. I also just purchased your Python for data analysis course!

  • @Saicharan_beast
    @Saicharan_beast 16 днів тому

    superb teaching

  • @paulok8232
    @paulok8232 18 днів тому

    Thank for sharing! hugs from Brasil

  • @aloneshortofficial2770
    @aloneshortofficial2770 18 днів тому

    Thanks a lot for necessary information 😊😊

  • @prachigupta5800
    @prachigupta5800 19 днів тому

    Nice Explanation ❤

  • @moonilyas
    @moonilyas 19 днів тому

    Great Tutorial, hope more parts will be coming soon!

  • @RakshitDwivedi-pq8qc
    @RakshitDwivedi-pq8qc 21 день тому

    This is amazing. I am just starting into Python especially the main libraries for data science. Can you make a short video on Numpy and Matplotlib and Sklearn as well. These short but detailed videos are so entertaining and helps retain in memory for long.

  • @erickmuthama2106
    @erickmuthama2106 21 день тому

    Thank you for your teachings

  • @galukashadrack1304
    @galukashadrack1304 22 дні тому

    Thanks for shairing this

  • @galukashadrack1304
    @galukashadrack1304 22 дні тому

    Thank you very much for this

  • @shaikusman536
    @shaikusman536 22 дні тому

    Awsome Content karen..thanks a ton...Respect from Bangalore...

  • @okotpascal
    @okotpascal 23 дні тому

    I would love for you to include some chaining or usage of the ".pipe" attribute in the subsequent parts that you're yet to upload. Or maybe in a separate video not in this series if possible. thank you for the good work and free education

  • @hamidraza1584
    @hamidraza1584 23 дні тому

    You are amazing teacher, lots of love from Lahore

  • @martinotieno4524
    @martinotieno4524 23 дні тому

    Can you do something on git and git hub

  • @martinotieno4524
    @martinotieno4524 23 дні тому

    Short and precise