Explore demographic employment data with k-means

Поділитися
Вставка
  • Опубліковано 23 січ 2025

КОМЕНТАРІ • 33

  • @jamescrumpler3438
    @jamescrumpler3438 3 роки тому +4

    Excellent timing on the video! I started a Data n Drinks club, and we are trying to use kmeans clustering to fix Ohio's district map. We hope that by clustering based on a set of criteria we can address some of the election problems in the state :)

  • @Adam_0464
    @Adam_0464 3 роки тому +1

    Thank you! This is an excellent tutorial on Kmean. Many functions are explored very elegantly.

  • @HaiLeQuang
    @HaiLeQuang 3 роки тому +1

    This channel is a gem!

  • @Adam_0464
    @Adam_0464 3 роки тому

    You nicely enrich the unsupervised literature in UA-cam. Keep going!

  • @ehoumanevans4783
    @ehoumanevans4783 3 роки тому +1

    Julia, thank you for your presentation, Please how do you mange to automatically adjust your scripts positions, like at 6:39 and 10:00 min of presentation?

    • @JuliaSilge
      @JuliaSilge  3 роки тому +2

      I am using the keyboard to reindent lines, which is Cmd + I on a Mac, so first I select my whole script (Cmd + A) and then I reindent. You can find keyboard shortcuts here: support.rstudio.com/hc/en-us/articles/200711853-Keyboard-Shortcuts

  • @tighthead03
    @tighthead03 3 роки тому

    Great video, thanks for sharing

  • @carolinapelegrincuartero9287
    @carolinapelegrincuartero9287 3 роки тому

    love your videos Julia!!! thx a lot!!

  • @parthanaboinapraveen1835
    @parthanaboinapraveen1835 3 роки тому

    Can you pls explain use case of k-mean cluster in security domain

  • @yy-ho6op
    @yy-ho6op 3 роки тому

    Thanks for the tutorial, the tidy() function is very handy.

  • @alehkcee645
    @alehkcee645 3 роки тому

    Thank you for the great video! Can you please explain how users can then find the centroid for each cluster? Or isolate which cluster is the largest?

    • @JuliaSilge
      @JuliaSilge  3 роки тому +2

      The `tidy()` method gives you the centers of the clusters, as well as the size (i.e. how many members there are) for each cluster.

  • @FieldsDynamic
    @FieldsDynamic 3 роки тому

    Thank Julia. Awesome videos.

  • @masheroz
    @masheroz 3 роки тому

    At 8:55 you made some proportions by dividing by the total. How could you append new columns instead of overwriting the originals?

    • @thomaswood1994
      @thomaswood1994 3 роки тому +1

      You put the function in a named list and it will append the function name to the original column. You can adjust how that works with the .names argument

  • @hesamseraj
    @hesamseraj 3 роки тому

    Thanks again for the great videos.

  • @erickknackstedt3131
    @erickknackstedt3131 3 роки тому

    Amazing content as always. Thank you!

  • @khaledf3977
    @khaledf3977 3 роки тому +1

    can u please explain what to_sbake_case do. thanks, why not just to lower case

    • @jalufero
      @jalufero 3 роки тому +1

      It converts strings into "snake case" format. en.wikipedia.org/wiki/Snake_case

  • @UCKszbcV
    @UCKszbcV 3 роки тому

    Thanks @julia silge. What RStudio Theme are you using? :)

    • @JuliaSilge
      @JuliaSilge  3 роки тому +1

      It's one of the ones available in the rsthemes package, I believe Oceanic Plus:
      www.garrickadenbuie.com/project/rsthemes/

    • @UCKszbcV
      @UCKszbcV 3 роки тому

      @@JuliaSilge Thanks! Is the different colour of brackets according to their hiercharchy part of theme or something that configure separately?

    • @JuliaSilge
      @JuliaSilge  3 роки тому +1

      @@UCKszbcV The rainbow parentheses came as an option in RStudio 1.4; you do need to turn it on the option:
      blog.rstudio.com/2020/11/04/rstudio-1-4-preview-rainbow-parentheses/

    • @UCKszbcV
      @UCKszbcV 3 роки тому

      @@JuliaSilge Thanks for taking the time to answer, you are a ⭐!

  • @observerXIII
    @observerXIII 3 роки тому +1

    your scripts are always so fancy.

  • @jayendrakhadikar1291
    @jayendrakhadikar1291 3 роки тому

    Excellent 🙏

  • @Ilproff77
    @Ilproff77 3 роки тому

    Nice video. How about to show some best practices to use tidy models with time series forecasting? You haven’t touch time series in any of your video

    • @JuliaSilge
      @JuliaSilge  3 роки тому

      If you're interested in time series forecasting with tidymodels, you can check out modeltime from Matt Dancho and some of his videos:
      business-science.github.io/modeltime/

    • @Ilproff77
      @Ilproff77 3 роки тому

      @@JuliaSilge I know the work of Matt, but I just wonder if there is some best practices for time series forecasting using tidymodels that do not involve additional packages but thanks for the suggestion.

  • @chrismartin7395
    @chrismartin7395 3 роки тому

    Thanks Julia - that was a really helpful video! Just wondering, are there any other algorithms you recommend as an alternative for cases (like this one) where you don't see clear seperation between clusters?

    • @JuliaSilge
      @JuliaSilge  3 роки тому +2

      I think I would go to something like PCA, UMAP, and friends.

  • @fealgu100
    @fealgu100 3 роки тому

    Thank you!

  • @Mrsandis89
    @Mrsandis89 3 роки тому

    Hi Julia, brilliant tutorial as always!!! I want to try this technique soon.
    Also, I was wondering, for STM, you can print out regression coefficients for your topics, would you know if you can manipulate code so that it prints more than 11 lines of coefficients, please?
    I’ve had it open on StackOverflow:
    stackoverflow.com/questions/66352787/can-you-print-more-than-11-covariates-for-summary-estimateeffect