Statistics 101: Multiple Linear Regression, Two Categorical Variables

Поділитися
Вставка
  • Опубліковано 11 січ 2025

КОМЕНТАРІ • 114

  • @jonathansiegel2818
    @jonathansiegel2818 3 роки тому +1

    Brandon Folt is absolutely the best... the very best... teacher in fundamental statistics. What a gift you are, Brandon, to the world!

    • @BrandonFoltz
      @BrandonFoltz  3 роки тому

      Very kind of you. I appreciate your kind words. However learners like you are the true gift. Pay it forward when you can.

  • @americanbluediamonds
    @americanbluediamonds 5 років тому +10

    I am also a KW real estate agent, your example is so excited to me and I can use to explain to my clients why certain area the price increases subtantially.

  • @knightsky8378
    @knightsky8378 8 років тому +13

    I am in the MS program in Applied Economics. Your videos has helped me so much, especially in further understanding econometrics. This is some really great stuff right here. Thanks again!

  • @akanlunadewuyi2588
    @akanlunadewuyi2588 7 років тому +8

    Good morning sir, I finish college this year read statistics as option. I still do not understand statistics until i found your videos. It's really great. Thanks for good work and your passion to help. God bless you sir.

  • @beckyhuber8137
    @beckyhuber8137 6 років тому

    I am a graduate student in a stat class. I have followed Brandon Folz video as my resources. Thank you.

  • @juanaa.2111
    @juanaa.2111 6 років тому +7

    Thank you for sharing your knowledge. I am using the IBM SPSS, but the values and meanings are the same. Thank you for including the interpretation of the outputs, that really brings it all together.

  • @JoshFlorii
    @JoshFlorii 7 років тому

    SUCH a good video. Spent like 5 minutes googling this subject only to turn to youtube, and you have once again not disappointed.

  • @andresparradrumaths3651
    @andresparradrumaths3651 7 років тому

    great! i'm a teacher from Colombia and your videos are very useful for my classes...thank you!

  • @Nico-jc7zr
    @Nico-jc7zr 3 роки тому

    Thanks for the video. I have a masters in economics and I have to get into regressions again for work. This is a nice reminder of the basic concepts. Nicely illustrated too.

  • @zhehabeshascience3066
    @zhehabeshascience3066 3 роки тому

    you are the best teacher i understod for statistics via youtube

  • @cathycirina-chiu6972
    @cathycirina-chiu6972 3 роки тому

    i so appreciate your encouraging intro and wonderful teaching style

  • @BrandonFoltz
    @BrandonFoltz  10 років тому +6

    *NEW* video is up! Part B will be up later this evening (US EST). Thank you all so much!

    • @SuzanneAmsalem
      @SuzanneAmsalem 10 років тому

      Thank you:))

    • @pascaleyram
      @pascaleyram 10 років тому

      Could you make a short video on logistic regression analysis?

    • @BrandonFoltz
      @BrandonFoltz  10 років тому +2

      I may have the first Logistic video up tonight :)

    • @pascaleyram
      @pascaleyram 10 років тому

      ***** Thanks a lot

    • @madhurjyadeka5569
      @madhurjyadeka5569 4 роки тому

      Hello Sir, I'm estimating apartment prices based on 5 factors and in your earlier videos where you took 3 independent variables there were
      7 regression analysis.
      So in my model that I'm about to build where there are 5 variables...
      Do I need to take the variables and analyse them :
      1 at a time
      2 at a time
      3 at a time
      And so on.. untill I find the best result ?

  • @vanessachen2330
    @vanessachen2330 5 років тому

    Watching your videos help me to improved my grade in Stat class, thanks for your passion of helping and providing such great series on Statistics.

  • @getitdone913
    @getitdone913 2 роки тому

    I know this is an old video, but thank you so much for uploading it. It's so helpful!

  • @lisameretekristensen3181
    @lisameretekristensen3181 4 роки тому

    Dear Brandon, I want to thank you for providing such excellent explanations and examples of statistical scenarios and how to go about analysing and prepping analysis. The latter is especially tough to find in textbooks in my experience. Thank you.

  • @LoizidesGeorge
    @LoizidesGeorge 5 років тому

    Brandon
    Thanks a lot for all your the videos in the statistics series.
    You really saved me hours and days!
    Whenever you are in Cyprus I will devote 1-2-3 days to you + a an old house in the mountains of Marathasa, returning your courtesy to share them!
    Just let me know 1-2 hours before you arrive at the airport!
    Regards
    Γ
    [ Loizides George ]

  • @ActionSportsExtreme
    @ActionSportsExtreme 8 років тому +4

    I'm really happy I stumbled upon you!

  • @mkme2358
    @mkme2358 3 роки тому

    Great videos very detailed! For better learning... I would suggest tests questions and answers.

  • @rajkumar-xr5im
    @rajkumar-xr5im 9 років тому

    Great ..Good service to society ..Please keep doing..

  • @rasyimahramli5b125
    @rasyimahramli5b125 4 роки тому

    Such a great video with the clear explanation. Thank you for your good work.

  • @wendypaul3910
    @wendypaul3910 3 роки тому

    Still saving lives in 2021, thank you Sir for blessing us with your knowledge and time☺

    • @BrandonFoltz
      @BrandonFoltz  3 роки тому +1

      So nice of you. Not sure about saving lives but I want you to have the chance to make the one you want.

    • @wendypaul3910
      @wendypaul3910 3 роки тому

      @@BrandonFoltz literally true, passed my quantative statistics exam, I do not need to resit, could not do this without you, thank you is not enough. Please be encouraged and please continue to produce these amazing videos. 👏🏾👏🏾❤️

  • @shreyanshishukla6309
    @shreyanshishukla6309 7 місяців тому

    I found very difficult to understand statistical use in research. your video makes it easy to understand the basic concepts. Pease Keep Uploading videos

  • @mecha_studio_official
    @mecha_studio_official 7 років тому +4

    Hi Brandon! Thanks for the excellent videos. You are a great teacher! Just a suggestion. It would be good if you can provide the datasets for us to do our own practice using statistical softwares adopted by our college.

  • @fabio7621
    @fabio7621 10 років тому +3

    Great video Brandon! Please, keep it up, you are helping so many people, more than you can imagine! :-)

  • @mostafalotfi1818
    @mostafalotfi1818 4 роки тому

    Great tutorials. Enjoyed learning from your videos a lot.

  • @mesfin_bikilo
    @mesfin_bikilo 8 років тому

    Thank you very much i found the tutorial very important and i like the way you explain, easly understood for bigginers like me . Thank you keep it up sharing is caring !!!

  • @sofieseymour481
    @sofieseymour481 4 роки тому

    It looks like in the surface plot at 13:20, the directions are labeled incorrectly. Shouldn't it be east, north, south, west? If not, could you explain, because this doesn't seem to match the data shown in the scatterplot at 9:33.

  • @darshitparkhiya1223
    @darshitparkhiya1223 6 років тому +4

    Thank you sir for giving such a great videos, can you please provide 100 row data which you have used in Video

  • @florramirez2740
    @florramirez2740 Рік тому

    Your videos are amazing, Im love in it. Im actually seeing since I have a job to do, however, I would like to know the machinary work in order to reproduce the graphics and others stuffs. I will appreciate so much if you can teach us. Very cool videos!!

  • @bunyaweepunch2785
    @bunyaweepunch2785 4 роки тому

    Thanks for your video. They help me a lot!

  • @TheCablebill
    @TheCablebill 9 років тому

    Thanks for the videos. I am finding them helpful.
    Around the 9:30 mark of this one, the scatter plot is showing regression lines for sqft~price for each of the four categories. The visual patterns clearly indicate that a generalized function to predict price from ft^2 and region is not represented well by the simple linear template in use.
    Specifically, their is a potential missing multiplier term. the region effect could be expressed as a modifier of the slope of the regression line for sqft~price in addition to affecting y-intercept. In other words, some region factor could also be included with the existing coefficient for the sqft term. I believe this would generate a more effective prediction model for this circumstance.
    So I suspect that I'm not the first to note this possibility, and my question is: What am I talking about? Is there an established technique for the type of curve-fitting I describe, and what is it called?
    Thanks again for a great lecture series.

    • @BrandonFoltz
      @BrandonFoltz  9 років тому +1

      TheCablebill An interaction between region and square-footage could be analyzed using other methods such as ANOVA / ANCOVA where one or more of the independent variables are nominal. There are also some coding techniques used to examine interactions. That is a bit beyond my goal for the video but a good point!

  • @kayyalo9621
    @kayyalo9621 9 років тому +1

    Your videos are very much helpful

  • @simratahluwalia965
    @simratahluwalia965 7 років тому

    Great work Brandon .. keep it up....

  • @jessecuster7246
    @jessecuster7246 9 років тому

    Just awesome. Thank you for you work.

  • @ProfessorJoaoArantes
    @ProfessorJoaoArantes 3 роки тому +1

    Dear Brandom, could you please make the dataset available for download? Thank you!

  • @anooppaul1
    @anooppaul1 8 років тому

    Thank you for Videos. It is very helpful to me

  • @VSP4591
    @VSP4591 2 роки тому

    Well done. Thank you.

  • @kobic8
    @kobic8 3 роки тому

    again, so HAPPY I came across your page here, I do have a question though, what if the dependent variable is categorical, and has more than one option e.g. direction (N /E / W / S) ?

  • @nkvd1000
    @nkvd1000 9 років тому +6

    Dear Mr Foltz
    I would be grateful if you could post a link to the dataset that you used in the example.
    Many Thanks

    • @BrandonFoltz
      @BrandonFoltz  9 років тому +2

      +nkvd1000 Hope to do that soon on my blog.

    • @helen4805
      @helen4805 5 років тому

      Hi Brandon, did you ever post this dataset? I would love to use it in my program. i am following along programming everything in Matlab.

    • @adityaravi1876
      @adityaravi1876 5 років тому

      @@BrandonFoltz , can you please share the link of the data sheet... it will help us to practice in Minitab.

  • @1985dv
    @1985dv 6 років тому +4

    with example of the region with 4 variables how do you know which one to use. In this case you omitted East, how did you decide on that? confused on what to use and what to take out

    • @mahendrabhattarai5903
      @mahendrabhattarai5903 5 років тому

      Same confusion here. Did you find the reason now?

    • @RodrigoTechador
      @RodrigoTechador 4 роки тому

      It's completely arbitrary. You can choose whichever category you want to be your reference category. The results will be the same.

  • @nehabhatt1285
    @nehabhatt1285 4 роки тому

    Great Video.

  • @sheebee8398
    @sheebee8398 7 років тому

    Outstanding! Thank you!

  • @sonallagad284
    @sonallagad284 2 роки тому

    How do you plot the scatter plot of the exemplary schools where the plot shows the red and blue colors for the diff values.

  • @kasunpathirana9410
    @kasunpathirana9410 4 роки тому

    good explanation

  • @riddhirekhawat
    @riddhirekhawat 8 років тому

    It would be really helpful if you upload some videos on sample survey. SRS, stratified, double sampling and all such.

  • @lettersforkumar
    @lettersforkumar 5 років тому

    how does the scatter plot look like if there are more than 2 continious predictor variables? in your example if we want to add age of house as predictor variable, where doest it lie on the plot?

  • @ayiteajavon1894
    @ayiteajavon1894 10 років тому

    Very helpful. Thank you!

  • @vanishingtears
    @vanishingtears 9 років тому

    Dear Brandon Foltz
    I have a data set of sales, advertisement then dummy variables (years, months and quarters)..How to find out which month was the most and least successful? what is annual growth, quarterly growth?
    Please provide help as to what approach we should use when we have such timeseries element incorporated in the cross section data in the form of time as dummy variables? How would our interpretation of the model will differ?

  • @divneetbagga8258
    @divneetbagga8258 7 років тому +1

    Hi Brandon!!
    I wish to run the problem given in the video by myself so that i can tally the results.
    For the same, I would require the entire data
    In the video there are only 15 entries given.
    Thanks!

  • @leahhazanovich2556
    @leahhazanovich2556 3 роки тому

    Thanks! This is super cleae!!

  • @LukasStammler
    @LukasStammler 9 років тому

    thanks a lot for these superb lectures. I do all your examples in R and now I ask, if it is possible to get the home price dataset for the lecture Multiple Regression Part 5.

  • @janaria1985
    @janaria1985 8 років тому +3

    okay I understand the concept of dummy variable but with example of the region with 4 variables how do you know which one to use. In this case you omitted East, how did you decide on that? confused on what to use and what to take out

    • @brunofischer808
      @brunofischer808 6 років тому +1

      Hello. You actually do not care. Pick the combination you prefer. When you getto interpret the results, you will get to the same conclusion, independently from the set of dummies you selected.

  • @moom-sey
    @moom-sey 9 років тому

    Thank you so much!!! your video help me so much :)

  • @danyouse409
    @danyouse409 7 років тому +1

    Did you post a link to the dataset? If so, I cannot find it. Thank you!

  • @iamshauno
    @iamshauno 7 років тому

    Hi! Is it possible to do regression using an independent variable with 7 units and a dependent variable with 20 units? Or should both variables have the same number of units?

  • @ThePmac14
    @ThePmac14 4 роки тому

    Thanks Brandon

  • @YogeshprabhuJ
    @YogeshprabhuJ 10 років тому

    Awesome... You should do some on logistic regression too..

    • @BrandonFoltz
      @BrandonFoltz  10 років тому +1

      Thanks! Logistic regression is my next topic actually 😃

  • @tanmaybhayani
    @tanmaybhayani 4 роки тому +1

    But what to do if we dont know the number of categories, and the number of categories is not fixed. eg:-model of a car, to predict car prices.

  • @vinopavankumarathasan6736
    @vinopavankumarathasan6736 7 років тому

    Hi I want to do a statistical analysis with two independent variables (IV) and both are categorical and dependent variable is interval. I have chosen the multiple regression. Guide me whether my choice is right...

  • @郭巧生-w4o
    @郭巧生-w4o 5 років тому +1

    what if the hause is at southwest or in the middle of the town?

    • @MostHitMan
      @MostHitMan 5 років тому

      郭巧生 make more variables SW NW ES etc

  • @sullainvictus
    @sullainvictus 9 років тому

    Great videos, Brandon! They are very informative and easy to understand.
    I have a question regarding dummy variables. The method you outline in this video and the previous one (Part 4) seem to deal with changing not the slope of the line, but the intercept. What if you have a situation where a categorical variable changes not just the position of the line (the intercept) but also the slope of the line. For example, what if the relationship between sqft and price were somehow a negative relationship in exemplary school districts? Would this method capture that effect?

  • @khaledabdu2
    @khaledabdu2 8 років тому

    can you do a video on confounding and interaction for medical examples?

  • @srijaP1
    @srijaP1 5 років тому

    Hi, I have around 42 independent variable (genotypes )and 8 dependent variable (cognition score) with age and gender as co variate. However my dependent variable are positively correlated so I have done PCA and have 2 component now. What kind of statistical analysis I should do?

  • @yvonnessy
    @yvonnessy 9 років тому +1

    Thanks for the videos! I'm learning a lot.
    I have a question here.. Can I do a linear regression if the dependent variable is a categorical variable? If yes, how can it be done??

    • @BrandonFoltz
      @BrandonFoltz  9 років тому +2

      Yvonne Szeto If the dependent variable is categorical regular multiple regression cannot be used. It will require logistic regression (I have a video series on that as well). There is binary, ordinal, and multinomial logistic regression depending on the structure of the dependent variable. My videos are about binary logistic regression. Hope that helps!

    • @yvonnessy
      @yvonnessy 9 років тому

      ***** Thanks for the quick reply! I will check those videos out. :)

    • @Lbanin
      @Lbanin 7 років тому

      Hi Brandon, would you paste here the title of the video you refer above as I'm currently running an analysis over categorical dependent and independent variables (all variables are categorical). Thank you so much!

  • @bensonmoima6872
    @bensonmoima6872 2 роки тому

    Dear Sir, thanks for the great work. Doing my bachelor thesis in Germany and this video has really come in handy. One problem though, I have failed to plot this data exactly like you did on an excel scatter plot. Sgft on x-axis & prices on y-axis but I have failed to implant the categorical variables (yes/no) . How did you do that sir? Did you use excel for it as well?

  • @hangnisa2176
    @hangnisa2176 6 років тому

    Could I know how to calculate b2 coefficient in multiple regression?

  • @dinethprabash1001
    @dinethprabash1001 7 років тому

    Thanks, if you can number your videos (index number) it would be more helpful. After downloading, its hard to figure out which video comes first.

  • @sibghaafzal247
    @sibghaafzal247 5 років тому +1

    Hi! Thank you for this helpful video! I am a psychology stats student and have a question with regards to the way that variable are coded, as I am a little confused. Am I right in thinking that if you code something as 1, then in the regression equation, it will mean that the outcome is always higher for that variable in comparison to the variable coded as 0, which essentially means we can influence our findings based on how the variables are coded? I hope this can be clarified, as I feel like this is not necessarily the case but I am not sure why - thank you in advance!!

    • @nabajyotidey5613
      @nabajyotidey5613 Рік тому

      So basically ordinal data instead of nominal data.Your observation is good.

  • @burhankl2331
    @burhankl2331 9 років тому

    ***** Hi, one quick question , is it possible to perform regression analysis on ONLY categorical variables? in other words-can one perform a regression analysis if all the independent variables are categorical?

    • @xuchuan6401
      @xuchuan6401 2 місяці тому

      Yes, and this will be equivalent to ANOVA

  • @vardeh1
    @vardeh1 10 років тому

    Thanks Brandon. In the equation, how do we calculate the constant?

    • @BrandonFoltz
      @BrandonFoltz  10 років тому +1

      vardeh1 No problem! I go over the calculation and interpretation in Part B which I should have uploaded tomorrow evening/night. So check back then. Thanks for watching! :)

  • @varundeshpande3674
    @varundeshpande3674 4 роки тому

    Sir, as we haven't added any variable for East region how will we account for in any house is situated in the East?

    • @xuchuan6401
      @xuchuan6401 2 місяці тому

      Every other region will be compared to East. East is the baseline

    • @varundeshpande3674
      @varundeshpande3674 2 місяці тому

      @xuchuan6401 helped 👌

  • @Big-guy1981
    @Big-guy1981 5 років тому

    Hi. Hi can we apply these great videos to predicting the outcome of a sports event, say a baseball game?

    • @BrandonFoltz
      @BrandonFoltz  5 років тому +1

      Hi! For a win/loss prediction you could utilize Logistic Regression since your outcome is binary. The reality is most betting organizations already do this. Once they figure out the probability they then adjust the payout odds. So I always just recommend going with the conventional wisdom unless you know something everyone else does not. :)

  • @yegonb
    @yegonb 4 роки тому

    I have enjoyed your lessons, but I could not reach you on tweeter under your handle. I am doing a research and I would like to get your perspective on a few things.

  • @gauravms6681
    @gauravms6681 6 років тому

    sir can u list the books which helped you in these videos(machine learning and statistics) please it would be very helpful

    • @cococnk388
      @cococnk388 2 роки тому

      Statistics for business and economics by David Anderson , Business Analytics by Jeffrey D.Camm , The Hundred-Page Machine Learning Book by Andriy Burkov
      Brandon shared this books in his recent live on youtube.
      Hope it helps.

  • @utopiasolutions8797
    @utopiasolutions8797 5 років тому

    In your next video the equations all have the same slope. How is it possible?

  • @polomarco1256
    @polomarco1256 4 роки тому

    How to know the minimum amount of sample from huge population i.e. a nation?

  • @tiannadermody4761
    @tiannadermody4761 2 роки тому

    Good explanations but it would be good if you could provide the code for these scatterplot outputs

    • @BrandonFoltz
      @BrandonFoltz  2 роки тому

      Hello! Thanks for watching. The scatter plots were actually done in Minitab or JMP (It's been a while sorry) so there is no code to share. They are both traditional stats software packages.

  • @8625gaurav
    @8625gaurav 9 років тому

    Thanks a lot...

  • @adazeeviohana3495
    @adazeeviohana3495 9 років тому

    thanks for the clear videos! but what about the subtitles, there are tons of mistakes, seems that whoever did them did not really listen to what was said and made no effort to do a good job... sometimes actually really funny !

    • @RodrigoTechador
      @RodrigoTechador 4 роки тому

      The subtitles are automatically generated by Google's voice recognition technology.

  • @mercygeorge2961
    @mercygeorge2961 6 років тому

    why n-1?

  • @davids8347
    @davids8347 2 роки тому

    I fail to understand how a university class that you are paying hundreds of dollars to be in can take 1.5 hours to complicate and make confusing a topic that a free UA-cam video can take 15 minutes to clearly explain... 🤦‍♂

  • @x_kingsas-_-
    @x_kingsas-_- 8 років тому

    The South region has the steepest form not the west

  • @janaria1985
    @janaria1985 8 років тому

    sorry I get it now