Monthly Sales Forecast with Seasonality and Trend - EXCEL regression with dummy variables

Поділитися
Вставка
  • Опубліковано 25 лип 2024
  • Welcome to our comprehensive tutorial on Monthly Sales Forecasting using Excel Regression with Dummy Variables, where we'll guide you through the intricacies of forecasting sales with both seasonality and trend. Whether you're a business analyst, data enthusiast, or just looking to enhance your Excel skills, this video is your gateway to mastering the art of accurate sales predictions.
    📊 In this step-by-step tutorial, you'll learn:
    1️⃣ The fundamentals of regression analysis in Excel, leveraging the powerful Data Analysis Tool Pack.
    2️⃣ How to effectively capture and incorporate seasonality into your sales forecasts using dummy variables.
    3️⃣ The importance of recognizing and accounting for trends in your data to make more accurate predictions.
    4️⃣ A deep dive into Winter's Smoothing Method, demystifying this widely-used forecasting technique.
    5️⃣ Practical tips and best practices to ensure your forecasts are reliable and actionable.
    Sales forecasting is a critical aspect of business planning, helping you make informed decisions on inventory management, resource allocation, and overall business strategy. Whether you're dealing with historical sales data, financial planning, or simply want to gain a deeper understanding of predictive analytics, this tutorial will provide you with the knowledge and skills you need.
    ===== CHAPTERS =====
    2:01 - Create the Dummy Variables
    3:31 - The Regression tool in Excel's Analysis Tool pack
    7:01 - Create the Regression Model
    10:00 - Visualize with Excel Chart
    #ExcelRegression #SalesForecasting #DataAnalysis #DummyVariables #WinterSmoothing #BusinessAnalytics #ExcelTutorial #DataScience #TrendAnalysis #SeasonalityForecasting #BusinessIntelligence
  • Наука та технологія

КОМЕНТАРІ • 90

  • @johnshepherd007
    @johnshepherd007 4 дні тому

    thank you very much, this really helped me a lot!

  • @EricD_192
    @EricD_192 Рік тому +4

    Great content explained in detail! Amazing!

  • @chilarmah
    @chilarmah Рік тому +1

    Great work! Solved a problem I have been working on for days.

  • @mariusalexandrugogorita
    @mariusalexandrugogorita 3 місяці тому

    Best video ever! Thank you very much!

  • @merjenorazmammedova6516
    @merjenorazmammedova6516 7 місяців тому

    Thank you for the video, it is really helpful!

    • @Data.Analytics.Central
      @Data.Analytics.Central  7 місяців тому +1

      Thank you Merjen for the appreciation. I am glad you have found it useful!

  • @Tiramisu2024
    @Tiramisu2024 3 місяці тому

    The best lecture, Thank you so much!

  • @JohnKamauNjenga
    @JohnKamauNjenga Рік тому +2

    Simple and elegantly presented. Was working on a forecast and other descriptions online were abhorrent to say least. The error range was HUGE, but thanks to you my standard error reduced to 2%. Asante Sana!!!🤗

  • @EkundayoOnifade
    @EkundayoOnifade 2 місяці тому

    Thanks for the video was very useful.

  • @ahuhokay
    @ahuhokay 4 місяці тому

    u are the best bro thank you for the help🙏

  • @hafizelbadawi5409
    @hafizelbadawi5409 6 місяців тому

    I would like to thank you so much

  • @sacca9156
    @sacca9156 6 місяців тому

    Hello Sir
    Where can I get the Excel worksheet to follow your presentation. Thanks

  • @adityavedam1174
    @adityavedam1174 27 днів тому +1

    Could you share the data file for Practice please ?

  • @davidjosevarelagarcia7011
    @davidjosevarelagarcia7011 8 місяців тому

    Great video, is very usefull, thanks. i have a question, why dont use december when you transpose the months?

    • @Data.Analytics.Central
      @Data.Analytics.Central  8 місяців тому

      Hi David, thanks for the appreciation!
      We do not use December because the 12-th category is captured by the intercept, and is specified when all the 11 dummy variables are all set to zero.
      This is the general rule: one variable less then the number of categories. If you have just 2 categories (yes and no) you put just one dummy (yes).

  • @sisayzewde1728
    @sisayzewde1728 6 місяців тому +1

    I think residual should be zero or close to zero! right? but in your case it is too much; so, can we say your forecast is good?

  • @MichaGouszka
    @MichaGouszka Рік тому

    Thank you for great content!
    What if we would like to add another variable - i.e., a change in product prices (let's assume cyclical price increases, as well as occasional promotions, for example, for a month of time)? How would your model then need to be modified?

    • @Data.Analytics.Central
      @Data.Analytics.Central  Рік тому

      Hi Michal, thank you for the question!
      If you have a one-off event, such as a price promotion, that you want to include in your Holt-Winters exponential smoothing model, there are a few different approaches you can take:
      1. Include the event as a predictor in the model: If the event has a clear effect on the time series being forecasted, you can add it as an additional predictor in the model. For example, if you are forecasting sales and you have data on the price of the product, you can add a binary predictor to the model to indicate whether the promotion is occurring in a given month.
      2. Adjust the trend component to account for the event: If the event has a more complex effect on the time series, you may need to adjust the trend component of the model to take it into account. For example, if the promotion is expected to have a significant impact on sales, you could adjust the trend component to reflect this.
      3. Incorporate the event into the seasonality component: If the event has a seasonal effect on the time series, you can incorporate it into the seasonality component of the model. For example, if the promotion is expected to have a particularly strong impact on sales in a particular month, you could adjust the seasonality component for that month to reflect this.
      I hope this helps!

    • @HessaM-bc6tn
      @HessaM-bc6tn Рік тому

      This’s really useful thank you! Could you please do a tutorial on that?

  • @caseykirkup
    @caseykirkup 2 місяці тому

    Amazing

  • @emilytran84
    @emilytran84 2 місяці тому

    Thank you for your great video! Do you think this work correctly with 12 months data historical and some month value = 0 as well ? I look forward to hearing your advices.
    Then how can we apply for multi sku products.

    • @emilytran84
      @emilytran84 2 місяці тому

      Hello, I look forward to hearing from you :)

  • @user-cb1sc7se8g
    @user-cb1sc7se8g 2 місяці тому

    If I have to do the same thing but on a day level
    How can I achieved that?

  • @Hanspeterbretti
    @Hanspeterbretti Рік тому

    Is is usable for other figures like ebt, ooe, etc?

  • @roshandhumal1193
    @roshandhumal1193 6 місяців тому +1

    Sir, could you please explain us why we have to lock 🔒 intercept, please explain and please explain me about p value.

    • @Data.Analytics.Central
      @Data.Analytics.Central  5 місяців тому

      Hello, we lock the Intercept because in the equation the intercept is only one, while the coefficients are multiple. Please read here about the p value:
      www.investopedia.com/terms/p/p-value.asp

  • @marcelporcescu4675
    @marcelporcescu4675 22 дні тому

    But how can we do daily. I tried but it show only 16 variable maximum, but we have 30 days

  • @aapriyanka8323
    @aapriyanka8323 8 місяців тому

    Excellent sir. Can we use the same process for 5 year forecast. Please do reply sir

  • @sisayzewde1728
    @sisayzewde1728 6 місяців тому

    and also, why you used three years data? what will be wrong if I use two- or four-years data?

  • @kiamarieamedao1398
    @kiamarieamedao1398 8 місяців тому

    Hello! Don't know what I did wrong but when I tried to use the Regression tool in Excel's pack, it said "The number of rows and columns in X range cannot be the same."

    • @Data.Analytics.Central
      @Data.Analytics.Central  8 місяців тому

      Hello, these can be the causes:
      Case 2: The number of rows is less than the number of x-columns
      It is not statistically valid for the number of rows to be less than the number of x (variable) columns. The number of rows of data must be larger than the number of columns of data (x-columns plus y-columns).
      Case 3: You specify a zero constant
      Do not specify a zero constant (b=0) in the function.

  • @sayednab
    @sayednab Рік тому +1

    would you mind explain, why to exclude the last month on your dummy variable?

    • @Data.Analytics.Central
      @Data.Analytics.Central  Рік тому

      Hi Nab, thanks for the question.
      That is because the 12-th category is captured by the intercept, and is specified when all the 11 dummy variables are all set to zero.
      This is the general rule: one variable less then the number of categories. If you have just 2 categories (yes and no) you put just one dummy (yes).

    • @sayednab
      @sayednab Рік тому +1

      @@Data.Analytics.Central I see. It makes sense. thanks for the explanation sir. could we use this model for stock price as well?

    • @Data.Analytics.Central
      @Data.Analytics.Central  Рік тому

      @@sayednab It is available for any pattern that has seasonality.

  • @sushmabasnet4593
    @sushmabasnet4593 Рік тому

    What about daily forecast, how do we create matrix?

    • @Data.Analytics.Central
      @Data.Analytics.Central  Рік тому

      Hi Sushma,
      You must create separate sets of dummy variables for different seasonality:
      - To capture day of the week seasonality, create 6 dummy variables.
      - To capture day of the month seasonality, create 30 dummy variables
      - To capture month of the year, create 11 dummy variables.

  • @rahuldhali7681
    @rahuldhali7681 Рік тому

    Hey, what if we have to forecast yearly sales. How many years would we need to enter as the dummy variable ?

    • @Data.Analytics.Central
      @Data.Analytics.Central  Рік тому

      Hi Rahul, the number of dummy variables is equal with the number of seasons minus 1. So if your seasonality is 5 years, you use 4 dummy variables.

    • @nurul.alifiaa
      @nurul.alifiaa 11 місяців тому

      @@Data.Analytics.Central Hello, i want to ask if i'm going to forecast annual prices for 6 years (2023-2029) so the row for the dummy variable should be (2023-2028), right? pls kindly enlighten me, thank you very much!

    • @Data.Analytics.Central
      @Data.Analytics.Central  11 місяців тому

      @@nurul.alifiaa Hi, Nurul. The number of dummy variables are related to the number of seasons that you have in your data. So, if you have 4 seasons, you use 3 dummies. And then you can extrapolate on how many years you want. In my model I have forecasted only one extra year vs. the actual data, but we can drag down the formula for another 6 years to forecast.

    • @nurul.alifiaa
      @nurul.alifiaa 11 місяців тому

      @@Data.Analytics.Central aaaa so if i have datas from jan 2013-juli 2023, i can use jan-nov (11) dummies? i mean i can use the same formulas like yours (?) anyway thank you so much for your answer!!

    • @Data.Analytics.Central
      @Data.Analytics.Central  11 місяців тому

      @@nurul.alifiaa First you have to understand how many seasons you have. If it is 12 months, you will have 11 dummies, but but you need histroric data for at least 12 months, so you can train the model (it is not enough just jan23-july23

  • @KhinMohMohSoe
    @KhinMohMohSoe 10 місяців тому

    why time period t is used ?.

  • @ericamartin98
    @ericamartin98 Рік тому

    Hello. When using this method, Excel is showing the value function but I don't know what I did wrong. Any idea?

    • @Data.Analytics.Central
      @Data.Analytics.Central  Рік тому

      Hello. It maybe from the relative/fixed cells references in the formula Mmult(). Please check what column/rows must be fixed and what relative. I hope this helps.

    • @tanvisharma924
      @tanvisharma924 Рік тому

      SAME , DID U FIND THE ANSWER?

    • @stefan-kk8ln
      @stefan-kk8ln 9 місяців тому

      Yes, I spent 15 minutes confused about why I got it as well, and it's because you didn't fill out the binary numbers on the forecast sections, but only on the input before forecast that is

  • @Mavis_1991
    @Mavis_1991 8 місяців тому

    When I used the Regression function, the numbers in the P-Value is #NUM! Any idea why?

    • @Data.Analytics.Central
      @Data.Analytics.Central  8 місяців тому

      Hello, please check these:
      support.microsoft.com/en-us/office/how-to-correct-a-num-error-f5193bfc-4400-43f4-88c4-8e1dcca0428b
      and these:
      The "NUM!" error in regression analysis usually occurs when there is a problem with the data in the cells that are being used for the analysis. This error message indicates that Excel is unable to perform the calculation because it is encountering a value that is not a number.
      There are a few reasons why you might be seeing this error message in your regression analysis. One possibility is that there are blank cells or cells with text in the range of data that you are using for the analysis. Another possibility is that there are cells with errors, such as #DIV/0! or #VALUE!, in the range of data.
      To fix this issue, you can try the following steps:
      Check the range of data that you are using for the analysis and make sure that there are no blank cells or cells with text in the range.
      Check for any cells with errors in the range of data and correct them if necessary.
      If you have deleted some fields of data, make sure that you have not accidentally deleted any cells that are being used in the analysis.
      Check the formatting of the cells in the range of data and make sure that they are all formatted as numbers.
      If none of the above steps work, try copying the data to a new worksheet and performing the analysis again.

    • @Mavis_1991
      @Mavis_1991 8 місяців тому

      @@Data.Analytics.Central I still get #NUM! on my P-values.

  • @roshandhumal1193
    @roshandhumal1193 6 місяців тому

    Sir what If we have Jan to March instead of Jan to Dev
    Because of I have value from Jan to March and when I am doing the method the out is not showing right could you please give me a hint

    • @Data.Analytics.Central
      @Data.Analytics.Central  3 місяці тому

      Hello, I hope you follow the rule: one variable less then the number of categories, so in your case of 3 months you use 2 variables

  • @nikomou3426
    @nikomou3426 8 місяців тому

    so "t" is for trend and "jan" thru "nov" is for seasonality. am i correct

  • @ramchandranemade5676
    @ramchandranemade5676 Рік тому

    How we can analyse the forecast with second order linear function with seasonality.

    • @ramchandranemade5676
      @ramchandranemade5676 Рік тому

      Possible?

    • @Data.Analytics.Central
      @Data.Analytics.Central  Рік тому

      Hello, can you give more context to your question?

    • @ramchandranemade5676
      @ramchandranemade5676 Рік тому

      I have a data which follows the quadratic graph. 2nd order linear equation fit the data. If I apply this trend+seasonality method starting forecast (from period 1 to 6) getting negative.

    • @ramchandranemade5676
      @ramchandranemade5676 Рік тому

      Above method follow single order trend. is there same method which consider the period square term also.(2nd order)

    • @Data.Analytics.Central
      @Data.Analytics.Central  Рік тому

      @@ramchandranemade5676 To clarify, the dummy variable method is typically used for linear regression models to represent categorical variables as binary variables (0 or 1). However, it is not directly applicable to quadratic models.
      In a quadratic regression model, you can still include categorical variables by converting them into numeric form (e.g., using integer encoding) and incorporating them as independent variables with quadratic terms if necessary. For example, if you have a categorical variable with three levels (A, B, C), you can use two numeric variables (e.g., 0 and 1) to represent this variable and then include both linear and quadratic terms for these numeric variables in the model.
      So, while the dummy variable method is not directly used in quadratic regression models, you can still include categorical variables in a modified numeric format and consider their linear or quadratic effects in the model.

  • @Seftehandle
    @Seftehandle 7 місяців тому

    P value error #num and some very low coefficients for significance f 0.8465 f 0.56 p greater than 0.05. please do similar videos on different. Stat methods

  • @tanvisharma924
    @tanvisharma924 Рік тому

    why dummy variable was used

    • @Data.Analytics.Central
      @Data.Analytics.Central  Рік тому

      Dummy variables are useful because they allow us to include categorical variables in our analysis, which would otherwise be difficult to include due to their non-numeric nature.

  • @hipernet
    @hipernet 11 місяців тому

    why don't you use December?

    • @Data.Analytics.Central
      @Data.Analytics.Central  11 місяців тому

      Hello, thanks for the question.
      That is because the 12-th category is captured by the intercept, and is specified when all the 11 dummy variables are all set to zero.
      This is the general rule: one variable less then the number of categories. If you have just 2 categories (yes and no) you put just one dummy (yes).

    • @hipernet
      @hipernet 11 місяців тому

      thanks@@Data.Analytics.Central .
      All my p values are bigger than 0.05, even my f and r2 are good. Then I cannot use this way, right?
      finally im using =forecast.ets()

  • @vijaymane34
    @vijaymane34 4 місяці тому

    Disaponted. You didn't tell how to get trend (series) out from the time series.

    • @vijaymane34
      @vijaymane34 4 місяці тому

      Intercept + time trend * period number (1 to n)

  • @JayJay-fz7sw
    @JayJay-fz7sw 8 місяців тому

    Its giving a biased forecast line

    • @Data.Analytics.Central
      @Data.Analytics.Central  8 місяців тому

      Hi, I hope your data has a linear relationship between the independent variables and the dependent variable.

  • @catalin.ardeleanu
    @catalin.ardeleanu Рік тому +1

    interesting example. Science based :) What do you think about latest forecast functions included in the "pack" =FORECAST.ETS.SEASONALITY()?

    • @Data.Analytics.Central
      @Data.Analytics.Central  Рік тому +1

      Dear Catalin, Thank you for the appreciation!
      We can arrive at the same results as the ones I did in the video by simply using the Excel function FORECAST.ETS().
      For example the forecasted value of 01-Jan-18 (380.432$ in cell D38) can be calculated like this: =FORECAST.ETS(A38,$C$2:$C$37,$A$2:$A$37,1,1,1) and the result would be 378.675$, very close to the manual method.
      The related function FORECAST.ETS.SEASONALITY() is just telling you what is the seasonality that the previous function FORECAST.ETS() was based on. If we aply it to our case:
      =FORECAST.ETS.SEASONALITY(C2:C37,A2:A37) we get the result 12, just what we have expected (12 months).
      I hope it helps!