Hi everyone! I know we've all wished for a roadmap to success-I know I have. That’s why I’m excited to share a Free Portfolio Builder designed just for data analytics professionals! 🚀 It includes + Customizable Templates: Tailor your portfolio with sleek, professional templates that highlight your skills and experience in data analytics. + Auto-Populating Project Bank: Easily showcase your projects; the builder automatically populates your portfolio with data from your project bank. + Career-Ready Layout: Designed with recruiters in mind, the layout ensures your portfolio is both visually appealing and easy to navigate. Click the link to download now and start building a portfolio that sets you apart! Download Portfolio builder: imvuladesigns.gumroad.com/l/etqal
A nice first effort. A couple of question (1) What did you do about the 100,000 rows/rides that were missing start/end station name? (2) What did you do with the rides that had trip durations of less than or equal to zero i.e.
Thank you! I decided due to incomplete data with regards to the exact stations i didn't look to analyze/ filter through stations. I think i touch on the lack of granular data or whole data so the analysis is general in nature because of this. If the data were more complete i would look further into it.
@@Sorted.Data0 Im working on my capstone now and I agree that the station names were not significant to the findings for the question you were asked (How do casual and annual bike-riders use cyclistic bikes differently from one another?), when you have duration, time of day, and day of the week.
Great Analysis, comprenhensive, thanks for sharing. Also I'm working 2nd June 2024 in the business case and this is really impired me. Thanks a lot. I just have a question? is good to tell during speach, What tool did you used to make this Analysis? thanks...
Hey Tendo, I have a quick question about the data used in the capstone. There were instances with the data where ride_duration lasted some ridiculous amounts of hours (in the hundreds in fact). Did you include these instances when you cleaned your data or did you remove them?
I understood those to be people who took the bike for a longer than usual commute. It was unclear of the exact intent. i thought best to leave it as the question that needed answering would need to cater to the majority of rider behavior's. I might be wrong though as I understand the project better. LOL Stay tuned for my final outlook
@@emmanuelcerros1108 Well I am currently on the project, and because of the data size, I opted to use R Studio for my analysis. It easily takes care of the data size.
Can anyone help me with this project ? I'm not able to figure out how to move ahead after finding values such as mean, mode, average, average ride length for users by day of week etc for each sheets
hey mate, i just had a doubt regarding the data viz, im having problem for visualizing the Y axis. can you please tell me what code/function did you run to visualize the Y axis., thank you!!
I think I had the same problem, I was trying to use geom_bar, but need to be used geom_col instead. I know already pass 2 months from your question and probably you already found the solution.
Hi Sidarth, Did you mean you were trying to use the aesthetic function? In order to plot X and Y axis in R typically you use the aes() function. I hope this has cleared up the confusion
Well done, awesome sharing/communicating the findings. I am busy finalizing my project. It seems that you didn't care about the unusual data points lying at extreme of the distribution and this raises a great concern. Would believe in riding duration over 24 hours in such service? Line charts and scatter plot are the most recommended to vizualise variations across time rather than bar charts. Last but not the least it would have been interesting to look at the riding time between the two groups.
And also, this was amazing. But I have one question; what was the reason that the days of the week were in no specific order? Was that a learning curve oops or did you have an observation that this enhanced? Thank you again for this!
That was my mistake! There is a function in order to order the days of the week appropriately. It is more for cosmetic purpose but i felt for the sake of the project i'd be a little lazy.
I instead used SQL functions within Rstudio to help group the data appropriately. Its easier to do that than having to move the altered tables from MySQL to Rstudio.
What I did was use Group_by : There are 3 different types of bike categories if you take a look at the dataset as it is. There is Classic, electric, and other which may be pedal assisted. Hope this helps
Here is a video reply to everyone asking about formatting time and Date. Use the 'Lubridate' function in R-it's super easy! Follow this link to watch the short with the code breakdown. Don't forget to like, comment and share it with anyone who might need it! : ua-cam.com/users/shortszqOGBVVs_7w?si=y9WGGCMAnDT_EYwf
Hi everyone! I know we've all wished for a roadmap to success-I know I have. That’s why I’m excited to share a Free Portfolio Builder designed just for data analytics professionals! 🚀
It includes
+ Customizable Templates: Tailor your portfolio with sleek, professional templates that highlight your skills and experience in data analytics.
+ Auto-Populating Project Bank: Easily showcase your projects; the builder automatically populates your portfolio with data from your project bank.
+ Career-Ready Layout: Designed with recruiters in mind, the layout ensures your portfolio is both visually appealing and easy to navigate.
Click the link to download now and start building a portfolio that sets you apart!
Download Portfolio builder: imvuladesigns.gumroad.com/l/etqal
well. Tendo. It is a good project to demonstrate your learnings and skill from the training course. Well done and keep it up mate
Thank you for your presentation you have helped me find structure for my own project.
How would I implement and upload something like this I've done power point projects is this different
thank you for sharing . i found this helpful. watching from nigeria
My people, love from England
wonderful job! I am also working on the Cyclistic project.
A nice first effort. A couple of question (1) What did you do about the 100,000 rows/rides that were missing start/end station name? (2) What did you do with the rides that had trip durations of less than or equal to zero i.e.
Thank you! I decided due to incomplete data with regards to the exact stations i didn't look to analyze/ filter through stations. I think i touch on the lack of granular data or whole data so the analysis is general in nature because of this. If the data were more complete i would look further into it.
@@Sorted.Data0 Im working on my capstone now and I agree that the station names were not significant to the findings for the question you were asked (How do casual and annual bike-riders use cyclistic bikes differently from one another?), when you have duration, time of day, and day of the week.
Still had lat and long to create map on tableau
Great Analysis, comprenhensive, thanks for sharing. Also I'm working 2nd June 2024 in the business case and this is really impired me. Thanks a lot. I just have a question? is good to tell during speach, What tool did you used to make this Analysis? thanks...
Thank you Tendo, this was really insightful
How would I implement and upload something like this I've done power point projects is this different
Great Analysis, Tendo
Thank you Appreciated! I hope yours worked well!
Hey Tendo, I have a quick question about the data used in the capstone. There were instances with the data where ride_duration lasted some ridiculous amounts of hours (in the hundreds in fact). Did you include these instances when you cleaned your data or did you remove them?
I understood those to be people who took the bike for a longer than usual commute. It was unclear of the exact intent. i thought best to leave it as the question that needed answering would need to cater to the majority of rider behavior's. I might be wrong though as I understand the project better. LOL Stay tuned for my final outlook
Thanks so much. It very helpful. Working on the project but confused😂. Now am cleared. Permit me to ask further question.
Great job Tendo!
This was a very informative and educational video Tendo.
how did you import the data to excel/google spreadsheets/big query? it's too big
hello, I am currently working on this project and I ran into the same problem....I'm hoping you eventually figured it out and you can be of help
I got around this problem by using google storage. However you do have to pay for its use.
@@emmanuelcerros1108 Well I am currently on the project, and because of the data size, I opted to use R Studio for my analysis. It easily takes care of the data size.
I would suggest using kaggle as they have the complete dataset already loaded for you to use.
Can anyone help me with this project ? I'm not able to figure out how to move ahead after finding values such as mean, mode, average, average ride length for users by day of week etc for each sheets
thank you for sharing .
My pleasure
@@Sorted.Data0 💯💯👏🏻
hey mate, i just had a doubt regarding the data viz, im having problem for visualizing the Y axis. can you please tell me what code/function did you run to visualize the Y axis., thank you!!
I think I had the same problem, I was trying to use geom_bar, but need to be used geom_col instead. I know already pass 2 months from your question and probably you already found the solution.
Hi Sidarth, Did you mean you were trying to use the aesthetic function? In order to plot X and Y axis in R typically you use the aes() function. I hope this has cleared up the confusion
Well done, awesome sharing/communicating the findings.
I am busy finalizing my project.
It seems that you didn't care about the unusual data points lying at extreme of the distribution and this raises a great concern. Would believe in riding duration over 24 hours in such service?
Line charts and scatter plot are the most recommended to vizualise variations across time rather than bar charts.
Last but not the least it would have been interesting to look at the riding time between the two groups.
How would I implement and upload something like this I've done power point projects is this different
Thank you for the presentation. It was helpful. I'm on mine now.
I assume you used one data source out of the many provided? Thanks
If you head to kaggle.com the complete datasets for the project should be available
Hey man thank you for the presentation!
Just one question, did you land a analytic job?
I have! I am on my second stint at the moment! Always learning!
Please, can I know what tool you used for cleaning, analysis and visualization?
I used Kaggle as the code manager and various functions using R. alternatively you can also use Rstudio as it is not internet based
Hey Tendo. I'm going this project as well. mind if I pick your brain on some of your code? I'm looking at your kaggle notebook.
Hi David! Let me know what you need!
And also, this was amazing. But I have one question; what was the reason that the days of the week were in no specific order? Was that a learning curve oops or did you have an observation that this enhanced? Thank you again for this!
That was my mistake! There is a function in order to order the days of the week appropriately. It is more for cosmetic purpose but i felt for the sake of the project i'd be a little lazy.
hi there.. did you do your analysis work with SQL in a database or in Rstudio? or both?
I instead used SQL functions within Rstudio to help group the data appropriately. Its easier to do that than having to move the altered tables from MySQL to Rstudio.
I have a question?
How did you find bike type please let me know!
What I did was use Group_by : There are 3 different types of bike categories if you take a look at the dataset as it is. There is Classic, electric, and other which may be pedal assisted. Hope this helps
what tool did you use to make this presentation?
I used google slides to make the presentation deck! Keep it simple!
Here is a video reply to everyone asking about formatting time and Date. Use the 'Lubridate' function in R-it's super easy! Follow this link to watch the short with the code breakdown. Don't forget to like, comment and share it with anyone who might need it! : ua-cam.com/users/shortszqOGBVVs_7w?si=y9WGGCMAnDT_EYwf
great job!
Thank you! Cheers!
Where can i find the dataset of cyclistic?
The datasets should all be available on kaggle.com
Great Job