Project 11. Medical Insurance Cost Prediction using Machine Learning with Python | ML Projects

Поділитися
Вставка
  • Опубліковано 30 січ 2025

КОМЕНТАРІ • 117

  • @aakash2915
    @aakash2915 2 роки тому +5

    Thank you so so much bro. I have completed my semester project yesterday. You are the best educational UA-cam channel ever watched 💗🥺

  • @Shashankramteke-b3t
    @Shashankramteke-b3t 5 місяців тому +1

    Thank you sir ,
    To explain this project in proper way
    😀

  • @shanashaji882
    @shanashaji882 11 місяців тому +2

    why standardization is not done in this project?

  • @ashwinizende7923
    @ashwinizende7923 3 роки тому +4

    my prediction on test case goes wrong why?

  • @sanjushreepradeepkumar7907
    @sanjushreepradeepkumar7907 6 місяців тому

    Great explanation!

  • @Music.Air99
    @Music.Air99 Рік тому

    your explaination is best, thank u so much

  • @SenghengMeasheanh
    @SenghengMeasheanh 6 місяців тому +1

    Why don't you find any Outlier, and don't you find the correlation between the feature and the target ?

  • @ranjanpal7217
    @ranjanpal7217 2 роки тому

    Amazing explanation

  • @mohd.navaidansari895
    @mohd.navaidansari895 3 роки тому

    very good work, very well explained

  • @lisitashamatutu1140
    @lisitashamatutu1140 2 роки тому

    Great stuff

  • @sreyassistla9554
    @sreyassistla9554 2 роки тому +1

    what is the future scope of this or how is this project useful for the real life scenario

  • @Shreenavalikihori
    @Shreenavalikihori 6 місяців тому

    Can any tell the insurance cost in the data set is paid by insurance company or by patient

  • @aditkaushal9397
    @aditkaushal9397 7 місяців тому

    test accuracy for : simple linear regression : 0.76
    svm regression: -0.11
    random forest regression: 0.803
    so our choice right now can be to plot random forest regression model or use it for this use case I also processed the values of the X_test in standard scalar and label encoded all the categorical data. My prediction using the random forest regressor for data like : 'age':19,'sex':'male','bmi':27.9,'children':0,'smoker':'yes','region':'northeast' is coming out as 17207.38 which is close to the actual value

  • @santhoshr8943
    @santhoshr8943 2 роки тому +7

    Hi brother
    Nice work you are doing. I have a doubt at last after running the predictive system there is a huge variation in my prediction and there is a User Warning with x does not have valid feature name. Can I know your suggestion on this?

    • @nilankushdas5896
      @nilankushdas5896 6 місяців тому

      I had the same issue. The predictions are not accurate....the predicted value of the exact input data shown in the video is only accurate.

  • @LoneWolf-rj1px
    @LoneWolf-rj1px 2 роки тому +4

    Random Forest Regression gave me 83.47% test accuracy. BTW, please teach hyperparameter tuning and ensemble methods for each ML problem to help in improving the accuracy!

  • @syedmuzammil4999
    @syedmuzammil4999 20 днів тому

    which are all the tools used in this project please confirm

  • @Frank-d7b
    @Frank-d7b Рік тому

    Quick question, why can't we use label encoding instead of manually replacing values. I'm new to these

  • @rm12423
    @rm12423 3 роки тому +2

    Great work.
    Request you to make end to end project including front end development and database connectivity.

    • @Siddhardhan
      @Siddhardhan  3 роки тому

      hi! this machine learning course is all about understanding Machine Learning concepts & implementing. after this, I'll make Deep Learning course. then I'll make deployment videos. that's what I have planned for now. thanks! but I'll definitely consider ur request.

    • @rajkumarray3224
      @rajkumarray3224 2 роки тому

      can you please help me telling how to deploy machine learing in backend and html,css in frontend ? Which tutorial you followed please sir ? Help

  • @sangeethacr4253
    @sangeethacr4253 Місяць тому

    How to create a website for this project sit

  • @girijavarma5271
    @girijavarma5271 3 роки тому

    @siddhardhan want to ask while building model with DTR, with feature_importance can I drop the region feature becz its playing the least role in the model

  • @priyanka-uk3hr
    @priyanka-uk3hr Рік тому

    smokers data is not converting into 1's and 0's after applying replace? can you tell the solution

  • @jeetpatel2336
    @jeetpatel2336 Рік тому

    Hi, I was watching project 12 which was of sales forecasting. In that whole video I expected that there will be a chart which will show how sales will be in future. It was not there the video ended at R2 SQAURE. Than I read through comments and found out someone else had some question and you directed him to this video. Now in this video predictive system is build and video ended there. Now will you direct me to another video? I am a fresher and want to do some complete project and your video seems nice but if I have to go from 1 video to another I get confused. In short I have no idea how to complete my project on sales forecasting of big mart. Any direction will be good. Thank you😄

  • @rahulgaud4340
    @rahulgaud4340 Рік тому

    Sir why i am getting my charges value prediction more and more than actual data

  • @meet6592
    @meet6592 3 роки тому

    Really appreciate!👏🏻

  • @growingfire
    @growingfire 8 місяців тому

    Thanks a lot !

  • @ibrahimsoliman1842
    @ibrahimsoliman1842 3 роки тому

    Great effort. Thanks

  • @aditishrivastava3976
    @aditishrivastava3976 7 місяців тому

    Sir my linear regression is not executing why sir

  • @poonamchoudhary3697
    @poonamchoudhary3697 3 роки тому +2

    Does preprocessing done here and missing value? Any more things we can include in this?

    • @Siddhardhan
      @Siddhardhan  3 роки тому

      hi! sometimes we may have to handle the outliers.

  • @vinaynaik953
    @vinaynaik953 3 роки тому +2

    Thank you

  • @koradasaikiran4146
    @koradasaikiran4146 3 роки тому +2

    Hello Siddardhan bro , can this idea of predicting Medical Insurance cost would be convincing to the Insurance company as we are considering the age ? For instance I have a 18 year old student who had filled the inputs (for example) like his gender , bmi , smoker / non smoker.... If he got a certain value after giving his inputs , that value could be really reliable as he was a student still..

  • @abhinavgupta2004
    @abhinavgupta2004 Рік тому

    wonderful

  • @sumasony6205
    @sumasony6205 Рік тому

    Is this project is similar to customer lifetime value about insurance

  • @dinakar.t6320
    @dinakar.t6320 2 роки тому

    Thanks alot sir

  • @surajgangani2011
    @surajgangani2011 3 роки тому

    Well explained

  • @SarveshKumar-wu3hc
    @SarveshKumar-wu3hc 2 роки тому

    How to create te endpoint ?

  • @RAHULYADAV-nl4rn
    @RAHULYADAV-nl4rn 3 роки тому +2

    how we choose which model to use for prediction?

    • @Siddhardhan
      @Siddhardhan  3 роки тому +1

      Hi Rahul! It is a very broad topic. I'll make a separate video on model selection in "Model Training" module. You can go through the course curriculum. I have given for course curriculum file in the video description

    • @Siddhardhan
      @Siddhardhan  3 роки тому +3

      As of now, make note of this:
      Classification problems:
      1. Logistic regression
      2. Support vector machine
      3. Random forest, etc.
      Regression problems:
      1. Linear regression
      2. Svm for regression
      3. Xgb rrgressor, etc
      Image recognition:
      1. Convolutional neural networks
      Speech data :
      2. Recurrent neural networks

    • @RAHULYADAV-nl4rn
      @RAHULYADAV-nl4rn 3 роки тому

      @@Siddhardhan also if possible next time can you use flask for model deployment/model predictive system and i take data as input from the user then predict the value is right in predictive system
      like age=int(input("enter your age")).....for 5 features
      then inputdata=(age,bmi...,smoker)
      inputarray=np.array(inpiutdata)
      predict=regressor.predict(inputarray.reshape(1,-1))

    • @Siddhardhan
      @Siddhardhan  3 роки тому +1

      Hi Rahul! Making deployment videos at the moment, doesn't fit the machine learning course. The contents of this course will be centred around Machine Learning models, their math, data cleaning and other things. After this machine learning course, I'll make a deep learning course. Then we can go with deployment. That would be a better order for Learning.

    • @RAHULYADAV-nl4rn
      @RAHULYADAV-nl4rn 3 роки тому +1

      @@Siddhardhan oka and thanks really looking forward to this

  • @kumariaparna2877
    @kumariaparna2877 3 роки тому

    Hi , as there is scale variation in the fields.Shouldn' we go for Standard Scaler before fitting. Can you please advise.

    • @Siddhardhan
      @Siddhardhan  3 роки тому

      hi! as there are several categorical columns in this dataset, I didn't use standard scaler. you can try it anyways and see whether it's useful in this case.

  • @sherifhussein4247
    @sherifhussein4247 2 роки тому

    hi, how can you calculate the P value?

  • @ComputerScienceSimplified
    @ComputerScienceSimplified 3 роки тому +2

    Awesome video, keep up the incredible work! :)

  • @nitishmc6929
    @nitishmc6929 3 роки тому

    In have a small dought in the section:-Building a predictive section...
    What is the need of reshaping it to (1,-1).... what does that mean actually.... kindly give an explanation for this.... thank you

  • @dubeyjikipaathshala2849
    @dubeyjikipaathshala2849 Рік тому

    bro can we work on jupyter notebooks ?? will it be simmilar to it

    • @Siddhardhan
      @Siddhardhan  Рік тому

      Yes, you can. But please practice with pycharm & vscode as well. Notebooks are good only for exploratory purpose.

  • @abdallahmohamed701
    @abdallahmohamed701 2 роки тому

    hi sir if I want using Genetic algorithm what can I do for it if you explain this algorithm before pleas send me video
    thanks a lot to you

  • @pratikshamusmade8730
    @pratikshamusmade8730 2 роки тому

    Hello sir in this project put in my resume but what will be the description put the resume please Sujjection

  • @sanjayp7027
    @sanjayp7027 3 роки тому

    Hey, Is there a reason why are you encoding regions as 0,1,2,3 ? If you encode like that it means one region is greater than other. Shouldn't it be one hot encoding ?

    • @Siddhardhan
      @Siddhardhan  3 роки тому

      4 classes does not cause any problem. you can definitely try one hot encoding as well. there's nothing wrong with it.

  • @reyanshgrovervlogs542
    @reyanshgrovervlogs542 3 роки тому

    I need this project. How i will get?

  • @saadhashmi9232
    @saadhashmi9232 2 роки тому

    Sir why u are not scaling the data?

  • @AspirantSarvesh
    @AspirantSarvesh 5 днів тому

    30:55

  • @rohidaskhandge2548
    @rohidaskhandge2548 2 роки тому

    The mean absolute error is around 2000 .
    How can it be improved

  • @ahmedesam6856
    @ahmedesam6856 3 роки тому

    ربنا يسترك يا جدع روح

  • @varshakamble7669
    @varshakamble7669 3 роки тому

    Help me with this error
    Value error: could not convert string to float: 'northeast'

    • @Siddhardhan
      @Siddhardhan  3 роки тому

      hi! check your label Encoding part. there may be a minor mistake.

  • @sanchayanpal3185
    @sanchayanpal3185 3 роки тому

    Hi sir I want to learn in private from u what is the procedure to apply

    • @Siddhardhan
      @Siddhardhan  3 роки тому

      Hi! I am not giving one-to-one training as of now.

  • @shamirrrr
    @shamirrrr 3 роки тому

    May I put this project in my resume as a fresher if I also convert this project into the flask and deploy it on Heroku?

    • @Siddhardhan
      @Siddhardhan  3 роки тому

      you can practice this code well & put it in your resume. but I am not sure whether you can deploy it in some application. the dataset might be copyrighted.

    • @shamirrrr
      @shamirrrr 3 роки тому

      @@Siddhardhan sahi sahi, thanks

  • @abuoday4112
    @abuoday4112 Рік тому

    لماذا استخدمنا خوارزمية linerTegession

  • @amitkumar5602
    @amitkumar5602 2 роки тому

    Display screen looks blurr.. Not properly visible.

  • @shashankpandey1966
    @shashankpandey1966 3 роки тому

    can i put this on my resume ? Is this project count ?

  • @AspirantSarvesh
    @AspirantSarvesh 5 днів тому

    9:00

  • @vaishnaviyada
    @vaishnaviyada 3 роки тому

    can I know the accuracy of this prediction?

    • @Siddhardhan
      @Siddhardhan  3 роки тому

      we don't calculate accuracy for Regression problems. there are other values like r squared value, mean absolute error, etc. please refer the video.

    • @vaishnaviyada
      @vaishnaviyada 3 роки тому +2

      @@Siddhardhan actually if I tried for other inputs which are there in dataset I'm not getting at least near values like I'm getting double of actual

    • @Siddhardhan
      @Siddhardhan  3 роки тому +2

      Try optimising the model

    • @dimplek7813
      @dimplek7813 3 роки тому

      @@vaishnaviyada even I'm facing the same problem

    • @dimplek7813
      @dimplek7813 3 роки тому

      @@Siddhardhan how am I supposed to do that?

  • @aaditya1267
    @aaditya1267 Рік тому

    bro some of the results are varying with a very big diffrence

  • @panjabirang4814
    @panjabirang4814 3 роки тому

    Allam.o.Alaikum sir

  • @kadhijakubra3805
    @kadhijakubra3805 2 роки тому

    Hello sir!can you send the ppt

  • @gibsonl2996
    @gibsonl2996 3 роки тому

    NameError: name 'regressor' is not defined ??

  • @LongSensei
    @LongSensei 3 роки тому

    Great work! I have a question, why do we always use this dataset? Do insurance companies still use this data today? Can we add features?

    • @Siddhardhan
      @Siddhardhan  3 роки тому

      hi! it's not like that. this dataset is for demonstration. companies will have their own data.

    • @LongSensei
      @LongSensei 3 роки тому

      @@Siddhardhan Yep! Thanks very much! can you suggest me some more features to improve the data?

    • @Siddhardhan
      @Siddhardhan  3 роки тому

      Like?

  • @jonnythebaddy
    @jonnythebaddy 3 роки тому

    while encoding the categorical variables, I'm getting this error:
    data.replace({'sex':{'male':0, 'female':1}}, inplace=True) --> CODE
    AttributeError: 'numpy.ndarray' object has no attribute 'replace' --> ERROR
    What should I do ?

    • @Siddhardhan
      @Siddhardhan  3 роки тому

      try restarting the runtime and run the cells again. check the code given in description

  • @UtkarshGupta-fy5tv
    @UtkarshGupta-fy5tv 2 місяці тому

    the model is totally wromg if taken any other input raather then you used your model gives vague solution

  • @panjabirang4814
    @panjabirang4814 3 роки тому

    I need your in python plz help me

    • @Siddhardhan
      @Siddhardhan  3 роки тому

      write a mail to datascience2323@gmail.com

  • @datascience8012
    @datascience8012 9 місяців тому

    Lekin accuracy tih 77% hai

  • @mrs.shekarvennavaram4921
    @mrs.shekarvennavaram4921 2 роки тому

    i dont understand any of this :DDDDDDDDDDDDDDDDDDDDDDDDDDD

  • @салаватишбулатов
    @салаватишбулатов 3 роки тому +2

    crossplot "actual vs. prediction" shows that they are not fit very well. The plotting script:
    plt.scatter(Y_train, training_data_prediction)
    plt.xlabel("Actual")
    plt.ylabel("Predicted")
    plt.show()