Cyclistic Capstone project - Google data analytics

Поділитися
Вставка
  • Опубліковано 20 жов 2024

КОМЕНТАРІ • 57

  • @Sorted.Data0
    @Sorted.Data0  2 місяці тому +1

    Hi everyone! I know we've all wished for a roadmap to success-I know I have. That’s why I’m excited to share a Free Portfolio Builder designed just for data analytics professionals! 🚀
    It includes
    + Customizable Templates: Tailor your portfolio with sleek, professional templates that highlight your skills and experience in data analytics.
    + Auto-Populating Project Bank: Easily showcase your projects; the builder automatically populates your portfolio with data from your project bank.
    + Career-Ready Layout: Designed with recruiters in mind, the layout ensures your portfolio is both visually appealing and easy to navigate.
    Click the link to download now and start building a portfolio that sets you apart!
    Download Portfolio builder: imvuladesigns.gumroad.com/l/etqal

  • @bennywong9857
    @bennywong9857 2 роки тому +17

    well. Tendo. It is a good project to demonstrate your learnings and skill from the training course. Well done and keep it up mate

  • @arilouis2493
    @arilouis2493 Рік тому +4

    Thank you for your presentation you have helped me find structure for my own project.

    • @3ma_lengend
      @3ma_lengend 5 місяців тому

      How would I implement and upload something like this I've done power point projects is this different

  • @odusanyaomotoyosi7663
    @odusanyaomotoyosi7663 2 роки тому +5

    thank you for sharing . i found this helpful. watching from nigeria

    • @pakicassava
      @pakicassava 2 роки тому +1

      My people, love from England

  • @grif5332
    @grif5332 2 роки тому +2

    wonderful job! I am also working on the Cyclistic project.

  • @CaribouDataScience
    @CaribouDataScience 2 роки тому +7

    A nice first effort. A couple of question (1) What did you do about the 100,000 rows/rides that were missing start/end station name? (2) What did you do with the rides that had trip durations of less than or equal to zero i.e.

    • @Sorted.Data0
      @Sorted.Data0  2 роки тому +3

      Thank you! I decided due to incomplete data with regards to the exact stations i didn't look to analyze/ filter through stations. I think i touch on the lack of granular data or whole data so the analysis is general in nature because of this. If the data were more complete i would look further into it.

    • @Fadein1980
      @Fadein1980 Рік тому +1

      @@Sorted.Data0 Im working on my capstone now and I agree that the station names were not significant to the findings for the question you were asked (How do casual and annual bike-riders use cyclistic bikes differently from one another?), when you have duration, time of day, and day of the week.

    • @sumstance
      @sumstance Рік тому

      Still had lat and long to create map on tableau

  • @armandogutierrez2880
    @armandogutierrez2880 4 місяці тому

    Great Analysis, comprenhensive, thanks for sharing. Also I'm working 2nd June 2024 in the business case and this is really impired me. Thanks a lot. I just have a question? is good to tell during speach, What tool did you used to make this Analysis? thanks...

  • @estheroffem
    @estheroffem Рік тому

    Thank you Tendo, this was really insightful

  • @3ma_lengend
    @3ma_lengend 5 місяців тому

    How would I implement and upload something like this I've done power point projects is this different

  • @creativeluf
    @creativeluf Рік тому +1

    Great Analysis, Tendo

    • @Sorted.Data0
      @Sorted.Data0  13 днів тому

      Thank you Appreciated! I hope yours worked well!

  • @Fadein1980
    @Fadein1980 Рік тому +11

    Hey Tendo, I have a quick question about the data used in the capstone. There were instances with the data where ride_duration lasted some ridiculous amounts of hours (in the hundreds in fact). Did you include these instances when you cleaned your data or did you remove them?

    • @Sorted.Data0
      @Sorted.Data0  13 днів тому

      I understood those to be people who took the bike for a longer than usual commute. It was unclear of the exact intent. i thought best to leave it as the question that needed answering would need to cater to the majority of rider behavior's. I might be wrong though as I understand the project better. LOL Stay tuned for my final outlook

  • @nwaolisahenrietta2939
    @nwaolisahenrietta2939 2 роки тому +3

    Thanks so much. It very helpful. Working on the project but confused😂. Now am cleared. Permit me to ask further question.

  • @Philippodcast
    @Philippodcast Рік тому +1

    Great job Tendo!

  • @user-bw9vo5if6l
    @user-bw9vo5if6l 2 роки тому

    This was a very informative and educational video Tendo.

  • @djnadine99
    @djnadine99 2 роки тому +3

    how did you import the data to excel/google spreadsheets/big query? it's too big

    • @angelobello9671
      @angelobello9671 Рік тому +1

      hello, I am currently working on this project and I ran into the same problem....I'm hoping you eventually figured it out and you can be of help

    • @emmanuelcerros1108
      @emmanuelcerros1108 Рік тому

      I got around this problem by using google storage. However you do have to pay for its use.

    • @edmondnathan7611
      @edmondnathan7611 Рік тому +2

      @@emmanuelcerros1108 Well I am currently on the project, and because of the data size, I opted to use R Studio for my analysis. It easily takes care of the data size.

    • @Sorted.Data0
      @Sorted.Data0  13 днів тому

      I would suggest using kaggle as they have the complete dataset already loaded for you to use.

  • @ashishpottekat3625
    @ashishpottekat3625 10 місяців тому

    Can anyone help me with this project ? I'm not able to figure out how to move ahead after finding values such as mean, mode, average, average ride length for users by day of week etc for each sheets

  • @brendamg7298
    @brendamg7298 2 роки тому +2

    thank you for sharing .

  • @sidharthplays8359
    @sidharthplays8359 2 роки тому +2

    hey mate, i just had a doubt regarding the data viz, im having problem for visualizing the Y axis. can you please tell me what code/function did you run to visualize the Y axis., thank you!!

    • @hinnecco1
      @hinnecco1 2 роки тому

      I think I had the same problem, I was trying to use geom_bar, but need to be used geom_col instead. I know already pass 2 months from your question and probably you already found the solution.

    • @Sorted.Data0
      @Sorted.Data0  2 роки тому +1

      Hi Sidarth, Did you mean you were trying to use the aesthetic function? In order to plot X and Y axis in R typically you use the aes() function. I hope this has cleared up the confusion

  • @afonsoosorio2099
    @afonsoosorio2099 Рік тому

    Well done, awesome sharing/communicating the findings.
    I am busy finalizing my project.
    It seems that you didn't care about the unusual data points lying at extreme of the distribution and this raises a great concern. Would believe in riding duration over 24 hours in such service?
    Line charts and scatter plot are the most recommended to vizualise variations across time rather than bar charts.
    Last but not the least it would have been interesting to look at the riding time between the two groups.

    • @3ma_lengend
      @3ma_lengend 5 місяців тому

      How would I implement and upload something like this I've done power point projects is this different

  • @emmanuelnnacho636
    @emmanuelnnacho636 11 місяців тому

    Thank you for the presentation. It was helpful. I'm on mine now.
    I assume you used one data source out of the many provided? Thanks

    • @Sorted.Data0
      @Sorted.Data0  13 днів тому

      If you head to kaggle.com the complete datasets for the project should be available

  • @bubs4552
    @bubs4552 Рік тому +1

    Hey man thank you for the presentation!
    Just one question, did you land a analytic job?

    • @Sorted.Data0
      @Sorted.Data0  13 днів тому

      I have! I am on my second stint at the moment! Always learning!

  • @sholaajisegiri6754
    @sholaajisegiri6754 Рік тому

    Please, can I know what tool you used for cleaning, analysis and visualization?

    • @Sorted.Data0
      @Sorted.Data0  13 днів тому

      I used Kaggle as the code manager and various functions using R. alternatively you can also use Rstudio as it is not internet based

  • @Itsdlu
    @Itsdlu 2 роки тому +1

    Hey Tendo. I'm going this project as well. mind if I pick your brain on some of your code? I'm looking at your kaggle notebook.

    • @Sorted.Data0
      @Sorted.Data0  2 роки тому

      Hi David! Let me know what you need!

  • @melissahirst3078
    @melissahirst3078 2 роки тому +3

    And also, this was amazing. But I have one question; what was the reason that the days of the week were in no specific order? Was that a learning curve oops or did you have an observation that this enhanced? Thank you again for this!

    • @Sorted.Data0
      @Sorted.Data0  2 роки тому

      That was my mistake! There is a function in order to order the days of the week appropriately. It is more for cosmetic purpose but i felt for the sake of the project i'd be a little lazy.

  • @patrickfullsail2011
    @patrickfullsail2011 Рік тому

    hi there.. did you do your analysis work with SQL in a database or in Rstudio? or both?

    • @Sorted.Data0
      @Sorted.Data0  13 днів тому

      I instead used SQL functions within Rstudio to help group the data appropriately. Its easier to do that than having to move the altered tables from MySQL to Rstudio.

  • @fawadh
    @fawadh Рік тому

    I have a question?
    How did you find bike type please let me know!

    • @Sorted.Data0
      @Sorted.Data0  13 днів тому

      What I did was use Group_by : There are 3 different types of bike categories if you take a look at the dataset as it is. There is Classic, electric, and other which may be pedal assisted. Hope this helps

  • @SurajKumar-wz5tv
    @SurajKumar-wz5tv 2 роки тому

    what tool did you use to make this presentation?

    • @Sorted.Data0
      @Sorted.Data0  2 роки тому +1

      I used google slides to make the presentation deck! Keep it simple!

  • @Sorted.Data0
    @Sorted.Data0  2 місяці тому

    Here is a video reply to everyone asking about formatting time and Date. Use the 'Lubridate' function in R-it's super easy! Follow this link to watch the short with the code breakdown. Don't forget to like, comment and share it with anyone who might need it! : ua-cam.com/users/shortszqOGBVVs_7w?si=y9WGGCMAnDT_EYwf

  • @matthewbyrne7590
    @matthewbyrne7590 Рік тому

    great job!

  • @Gautam_45
    @Gautam_45 Рік тому

    Where can i find the dataset of cyclistic?

    • @Sorted.Data0
      @Sorted.Data0  13 днів тому

      The datasets should all be available on kaggle.com

  • @harypriyatna6983
    @harypriyatna6983 Рік тому

    Great Job