Handling Missing Value in Time Series Data using Python

Поділитися
Вставка
  • Опубліковано 2 лис 2024

КОМЕНТАРІ • 46

  • @jaspreet0305
    @jaspreet0305 4 роки тому +21

    23:46
    To the viewers who already have prior knowledge of TSA, can directly visit 23:46 to skip the fundamentals and get to the main content of imputations strategies

  • @abdulhannan-go8hl
    @abdulhannan-go8hl Рік тому +1

    Wow! just awesome, I rarely comment on any UA-cam video but this series is a masterpiece, and I am looking forward to completing it. For those who are searching for a good time series playlist, this playlist is sufficient to learn from A to Z.

  • @rabbilbhuiyan5666
    @rabbilbhuiyan5666 3 роки тому +1

    Excellent and clear video for manipulating missing values in time series data. Thanks a lot for such knowledge sharing !

  • @satyamrastogi5316
    @satyamrastogi5316 4 роки тому +3

    Actively following your time series' series!!!!!!! Please upload more videos and in depth algorithms of time series forecasting. Thank a lot sir.

    • @AIEngineeringLife
      @AIEngineeringLife  4 роки тому +2

      Thank you Satyam.. I will be going to algorithm only next month. This month is more for foundational concepts of timeseries which is key before getting into algorithm

  • @vipulgaurav3813
    @vipulgaurav3813 4 роки тому +3

    Wonderful content sir! Loved the hands-on approach taught me how to handle a problem apart from theory! Thankyou, will be following the entire playlist and waiting for more!

  • @anujsaboo4938
    @anujsaboo4938 4 роки тому +1

    Thank you for making such elaborate videos on such a topic where not enough content is present. This really helps.
    Just the last imputation of previous year has a mistake, the correct code would be below:
    aq_df_imp['TIME_PYEAR'] = aq_df_imp.apply(lambda x: aq_df.loc[x['year_month_day_hour'] + pd.offsets.DateOffset(years=-1)]['TEMP'] if pd.isna(x['TEMP']) else x['TEMP'],axis=1)

  • @sckeshari
    @sckeshari 4 роки тому +1

    Very intuitive and clear video with understanding point of view.

  • @jaspreet0305
    @jaspreet0305 4 роки тому +3

    Lots of details and concepts discussed. Thanks a lot for sharing.
    Sir if you could share your GitHub repo for this codes in your description box, it would be a lot easy for us to replicate the codes, and to spend more time on consuming concepts and less time in re-writing coding.

    • @AIEngineeringLife
      @AIEngineeringLife  4 роки тому +3

      Jaspreet - My git repo for this series is here - github.com/srivatsan88/End-to-End-Time-Series
      Maybe I need to go back and edit my video with individual links when I get time :)

    • @jaspreet0305
      @jaspreet0305 4 роки тому +1

      @@AIEngineeringLife Thanks for sharing the link. This repo will help a lot.
      To ease out time taken to edit descriptions, we viewers will also drop this repo link in comment section of other videos in this playlist :)
      Keep sharing your knowledge and helping us.

  • @vaisuliafu3342
    @vaisuliafu3342 3 роки тому +2

    great video, thank you very much. I believe the 'TEMP_ROLLING' correction should be:
    aq_df_imp['TEMP_ROLLING'] = aq_df_imp['TEMP'].fillna(aq_df_imp['TEMP'].rolling(3, min_periods=1).mean())
    ...where the window size is 3 because it includes the NA value itself.

  • @TarekFansa
    @TarekFansa 3 роки тому +1

    very good explanation ! and a very nice youtuber !
    That was helpful !
    i am Mathe Tutor on UA-cam
    and learning Python
    much Love from Germany !

  • @arunap4086
    @arunap4086 3 роки тому +3

    Actually instead of taking the previous year value of temperature we are taking the next year value as we apply "- pd.offsets.DateOffset(year=-1)", should be either "-+pd.offsets.DateOffset(year=-1)" or "- pd.offsets.DateOffset(year=1)"

    • @AIEngineeringLife
      @AIEngineeringLife  3 роки тому

      Arun.. Yes that was a mistake I did. Basically must have tested the code properly :) . But for that small change other concepts are same. I have commented the right code in another comment as well

  • @minudixit9956
    @minudixit9956 3 роки тому

    Thanks for showing the slider graph from Plotly, was very helpful....

  • @meanuj1
    @meanuj1 3 роки тому +1

    Thnak you very much for making concepts Lucid and clear..Could you please post code repo also..?

    • @AIEngineeringLife
      @AIEngineeringLife  3 роки тому +1

      Here is my repo - github.com/srivatsan88/End-to-End-Time-Series

  • @anshulsingh430
    @anshulsingh430 4 роки тому +1

    Very good approach to time series data, i have questions, out of the all approach which one you think fits best

    • @AIEngineeringLife
      @AIEngineeringLife  4 роки тому +1

      Anshul.. it depends on the data and business understanding.. but these are different common ways and there can be more depending on need

  • @thechaoticneuron
    @thechaoticneuron 4 роки тому +1

    Hello Sir,
    Did you consciously impute 2015 Temperature missing values with 2016 values?
    You gave an offset of "-1" years and subracted it from 2015 index, resulting in 2016 values.

    • @AIEngineeringLife
      @AIEngineeringLife  4 роки тому +1

      Dammalapati.. Oh really.. I think I typed by mistake then.. Offset should be 1. Thanks for pointing it out. Will check it in sometime

  • @jongcheulkim7284
    @jongcheulkim7284 2 роки тому

    Thank you so much. This is very helpful.

  • @dipinpaul5894
    @dipinpaul5894 4 роки тому +1

    Very informative 👍

  • @sangeethaezhumalai168
    @sangeethaezhumalai168 3 роки тому

    For the monthly sales data. If there is no sales in a particular month. How to handle this discontinuity?? Can we impute 0 there because there is no sales happened that time?

  • @aislingkeating6465
    @aislingkeating6465 2 роки тому

    Hi, How would I impute the previous years data but matching on date as well as latitude and longitude point?

  • @abinavprasad6231
    @abinavprasad6231 3 роки тому

    Hey, for the last code the substituting null with previous year values, m getting a timestamp error..I checked in a lot of places but couldn't find. Can u help me with it?

  • @vidyakurada4728
    @vidyakurada4728 4 роки тому +1

    Thanks ! That was helpful

  • @randyluong6275
    @randyluong6275 3 роки тому +1

    Dear, why the 1st NAN (2015-02-21 120000) + 2nd NAN (2015-02-21 140000) + 3rd NAN (2015-02-21200000) of TEMP_PREVY = [1.9, 4.9, 3] 31:20; while they are [3, 3.5, 1.2] respectively in 2014 30:02 ?

    • @AIEngineeringLife
      @AIEngineeringLife  3 роки тому +1

      Randy.. Sorry, That was my mistake. A bug in code. DataOffset should be added and not subtracted in the function. Try below
      aq_df_imp['TEMP_PREVY']=aq_df_imp.apply(lambda x: aq_df.loc[x['year_month_day_hour'] + pd.offsets.DateOffset(years=-1)]['TEMP'] if pd.isna(x['TEMP']) else x['TEMP'], axis=1)

  • @lohithmunakala
    @lohithmunakala 4 роки тому +1

    Thank you so much 🙌

  • @oscarelysee5077
    @oscarelysee5077 3 роки тому

    when I make this aq_df_imp['TEMP_PREVY']=aq_df_imp.apply(lambda x: aq_df.loc[x['year_month_day_hour'] + pd.offsets.DateOffset(years=-1)]['TEMP'] if pd.isna(x['TEMP']) else x['TEMP'], axis=1)
    i have error KeyError: Timestamp('2017-03-29 00:00:00')

  • @khushisharma-tp7ff
    @khushisharma-tp7ff 10 місяців тому

    where can i find the code?

  • @atharvnikhare7113
    @atharvnikhare7113 2 роки тому

    Could you please send me this notebook ? It will be really helpful.
    Thanks

  • @sudheeshe1384
    @sudheeshe1384 3 роки тому +1

    Can you please share code file

    • @AIEngineeringLife
      @AIEngineeringLife  3 роки тому

      Search for github srivatsan88 and in that this is available in end to end time series playlist. Let me know in case if you have trouble locating this file

  • @nehamanpreet1044
    @nehamanpreet1044 2 роки тому

    can you share the code

  • @chandanapriya1488
    @chandanapriya1488 3 роки тому

    ---------------------------------------------------------------------------
    ModuleNotFoundError Traceback (most recent call last)
    in
    ----> 1 import plotly.express as px
    2
    3 fig = px.line(delhi,x='date',y = 'pm25',title='pm25 with Slider')
    4
    5 fig.update_xaxes(
    ModuleNotFoundError: No module named 'plotly'
    I'm getting this error even though I installed 'pip install plotly'

    • @AIEngineeringLife
      @AIEngineeringLife  3 роки тому

      Are you using colab or local notebook. If local can you check if it is pointing to right python environment

  • @vashistnarayansingh5995
    @vashistnarayansingh5995 4 роки тому +1

    Sound quality is not good

    • @AIEngineeringLife
      @AIEngineeringLife  4 роки тому

      Sorry for it. Sometimes recording plays against me while I try my best to fix it. Can you please watch without headset this video so noise is less. I am trying to fix it permanently

    • @vashistnarayansingh5995
      @vashistnarayansingh5995 4 роки тому

      @@AIEngineeringLife no worries other videos audio quality is good. Thanks for making efforts for us