Explore your data using R programming

Поділитися
Вставка
  • Опубліковано 26 чер 2024
  • When doing data analysis, you need to start with a good understanding of you data. To explore your data, R has some fantastic and easy to use functions. In this video I take you through the process of exploring dataset and understanding its various characteristics and dimensions using data that you can access on your computer. This is an R programming for beginners video. It is for people interested in data science, statistics, r programming, quantitative analysis and research in general. I use the tidyverse to do my data exploration. That includes ggplot, dplyr and other packages. I also work in R-Studio.
    If you're interested in doing a course in R - go to www.learnmore365.com

КОМЕНТАРІ • 133

  • @RProgramming101
    @RProgramming101  10 місяців тому +1

    Get my FREE cheat sheets for R programming and statistics (including transcripts of these lessons) here: www.learnmore365.com/courses/rprogramming-resource-library

  • @chriskerr6500
    @chriskerr6500 2 роки тому +46

    Just started Learning R Programming and your videos are some of the best and easiest to follow. Keep up the great work!

  • @TakeFlow1
    @TakeFlow1 2 роки тому +1

    Thank you for these tutorials! So far, your channel is the best one I've found to start learning R.
    Keep up the good work! We appreciate it!

  • @christianjimenez2383
    @christianjimenez2383 2 роки тому +7

    Please, please keep making these videos. It is crucial for us data analyst who are just getting started and would like to continue expanding our knowledge. I would even donate if you had a patreon

  • @darkcloud13
    @darkcloud13 4 місяці тому

    Honestly, I've never done anything in R before, just started randomly watching this video. I understood EVERYTHING! Just amazing, definitely subscribed.

  • @pipertripp
    @pipertripp 2 роки тому

    Great content, Greg. Thanks much for taking the time to put it all together.

  • @goodmanshawnhuang
    @goodmanshawnhuang 2 роки тому +1

    Great work! It re-ignited my passion for R. Thanks a lot!

  • @nkunam
    @nkunam 2 роки тому

    Thank you, Greg. You are the best!

  • @gianluca.pastorelli
    @gianluca.pastorelli 2 роки тому +20

    Your videos are simply great! I feel I've become a bit more proficient with R also thanks to you. I recently started to do some PCA on my datasets and I'm wondering if you could give an introduction to multivariate analysis with R in your next videos. Keep up the good work 👍

  • @josecastro4143
    @josecastro4143 2 роки тому +1

    Thank you for making these videos, it must take a lot of your time in preparation and editing. Cheers from Scotland.

  • @fliegendesklassenzimmer5459
    @fliegendesklassenzimmer5459 2 роки тому

    Great tutorials and huge thank you for your content! I always use a split screen with my data set on the left and your videos on the right. Please keep providing! Greetings from Berlin

  • @yidanjiang7599
    @yidanjiang7599 2 роки тому

    Your videos are amazing and so helpful for half beginner to look like a pro!

  • @mohamedfarag9584
    @mohamedfarag9584 2 роки тому +1

    Well done for the fantastic lectures that make life with R easier

  • @CindyLaquidara
    @CindyLaquidara 10 місяців тому

    I love these videos. They are so incredibly helpful. Thanks a million!

  • @DingDongGrahamBell
    @DingDongGrahamBell 6 місяців тому

    Thank YOU so much! I've learned a lot about R functions. I will be watching the rest of your video tutorial. Thank YOU, Professor!

  • @kunststoffjunkie
    @kunststoffjunkie Рік тому +1

    Wow thank you so much for this! You have taken away my fear of using this program, with this video I broke through a WALL that was in between me and my data analysis. Very clear way of explaining things as compared to other videos.

  • @Junecode
    @Junecode Місяць тому

    Great to see the level of detail to learn R here. You are so right about cut and paste 🙂

  • @rupeshingle2681
    @rupeshingle2681 2 роки тому +3

    the way of your teaching is "THE BEST "
    Thanks for giving the best information

  • @piyushchauhan1955
    @piyushchauhan1955 2 роки тому

    Learned different methods of performing same tasks. Great work ser!

  • @monicabn3067
    @monicabn3067 3 місяці тому

    Thank you for these kind of content, someone who is learning via internet… I salute you 😊

  • @douglaspage7311
    @douglaspage7311 2 роки тому

    Thank you!
    I needed to catch up to all things tidyverse and your videos have been an absolute life saver. It's like you know exactly how my brain works.
    Are you going to get to spatial data and the 'sf' package at some point?

  • @josephkimote661
    @josephkimote661 2 роки тому +1

    Good work right here. Consider doing another on data imputation especially using MICE, I know it'll be🔥🔥. Waiting, thanks in advance

  • @drsajidawan6044
    @drsajidawan6044 Рік тому

    Wow, but also feel bad that why I didn't discover your videos earlier. Great experience!

  • @melissahirst3078
    @melissahirst3078 Рік тому +2

    This is so helpful, thank you so much! I am on the capstone project for Coursera analytics 8 part course and I am SO LOST lol. I know exactly what I want to see, but throughout the course there was only 1 module on "R" and many bookmarks were noted. But the commands are not organized so it takes hours to see what code is necessary to clean something or narrow the data that you are looking for. I think that I chose a project that has an overwhelming amount of data... and it has my Kaggle freaking out lol... I WAS going to use RStudio but I was having trouble importing the datasets that I needed so I figured that if I could just code in Kaggle; and that was the platform that I was going to showcase the project on (AND that is where I found the datasets that I am using) I might as well just run the code there.. but it is taking months lol.

  • @petfield100
    @petfield100 9 місяців тому

    Your videos are simply great!

  • @ebogart42
    @ebogart42 Рік тому

    So incredibly helpful! THANK YOU

  • @ziphozihlentwatwa1546
    @ziphozihlentwatwa1546 2 роки тому +1

    Thank you for the great videos, so well explained, it's like you knew what I needed!

  • @seanmccandless7728
    @seanmccandless7728 Рік тому

    Great video for someone about to move into a position scoping data visualization using the R Suite of applications.

  • @leireagirre2340
    @leireagirre2340 4 місяці тому

    Best R teacher, thanks a lot, it was a great video!

  • @oddsratio4070
    @oddsratio4070 2 роки тому +1

    Hello, I have been using R for 10 years and the Tidyverse for about a year or two. This channel is fantastic. What software do you use to do the coloured, highlighting, annotations ? Thanks

  • @BillyCheong
    @BillyCheong 2 роки тому +1

    Glad that u are back! Thank you

  • @srishtitripathi8543
    @srishtitripathi8543 2 роки тому +1

    I am a beginner in R and I found your videos are the best. Very helpful, thank you so much.

  • @ClaudiaMartinez-jk4mj
    @ClaudiaMartinez-jk4mj 2 роки тому +2

    Omg love your videos just finished a r training and feel i have learnt so much from jus 3 or ur videos.. 💛💛💛💛💛

  • @quito96
    @quito96 2 роки тому +1

    Simply the Best -> I love it

  • @joshuakimeli4564
    @joshuakimeli4564 Рік тому +2

    This is nice, can you do CCA ANALYSIS

  • @romanvasiura6705
    @romanvasiura6705 Рік тому

    Thank you for this great lesson!

  • @Bulgarian83
    @Bulgarian83 Рік тому +1

    Thank you for this great tutorial!

  • @sohaibezzi2223
    @sohaibezzi2223 2 роки тому +1

    really very cool videos i hope you explain more about survival analysis and regression

  • @AlexKashie
    @AlexKashie 8 місяців тому

    Brilliant job... Thanks a lot

  • @lenasophieluise4719
    @lenasophieluise4719 11 місяців тому

    this was sooo helpful! thank you

  • @ponesojanda5334
    @ponesojanda5334 Рік тому +1

    Thank you this was so helpful

  • @MCshaneization
    @MCshaneization 2 роки тому

    Masterful instruction.

    • @RProgramming101
      @RProgramming101  2 роки тому

      thanks for the kind comment (much appreciated)

  • @user-pu9ll7vd5m
    @user-pu9ll7vd5m 2 місяці тому

    Excellente~! Thanks -

  • @giusyadragna8856
    @giusyadragna8856 2 роки тому

    Super helpful video, thanks!

  • @RealistWajahat
    @RealistWajahat Рік тому

    It was a great start.

  • @eyadha1
    @eyadha1 2 роки тому

    Thank you very much. Very helpful

  • @psalm2326
    @psalm2326 Рік тому +1

    Thank you sir, you explain it well ..appreaciate

  • @c.rafatulkabir6971
    @c.rafatulkabir6971 2 роки тому +4

    Greg you are an amazing teacher! can you make videos on PCA and ANOVA using R. Thanks!

  • @saniamartinkasyanju4671
    @saniamartinkasyanju4671 Рік тому

    thank you for making it so understandable....

    • @RProgramming101
      @RProgramming101  Рік тому

      So nice of you to say, Sania - thanks for the great feedback!!

  • @aseelaraji7045
    @aseelaraji7045 2 роки тому +2

    Thanks very much for the great teaching way you do. I would highly appreciate if you can do a video about cluster using R.

  • @qya.4594
    @qya.4594 2 роки тому +2

    Thanks a lot.

  • @setarehsohail5422
    @setarehsohail5422 2 роки тому +1

    many thanks!

  • @najme9315
    @najme9315 11 місяців тому

    Fantastic teaching!

    • @RProgramming101
      @RProgramming101  11 місяців тому

      I'm glad you enjoyed the video! Your positive feedback means a lot to me.

  • @livingroomc
    @livingroomc Рік тому

    Thank you!

  • @korman9872
    @korman9872 Рік тому

    vary Tx sir,

  • @curiousindividual634
    @curiousindividual634 Рік тому

    Oh wow! Nice😊

  • @dirichiumunna9365
    @dirichiumunna9365 Рік тому +1

    This is so useful!

  • @andrewjohnson4352
    @andrewjohnson4352 11 місяців тому

    Great video!

    • @RProgramming101
      @RProgramming101  11 місяців тому

      I'm glad you enjoyed the video! Your positive feedback means a lot to me.

  • @manishdas6525
    @manishdas6525 Рік тому

    It would be amazing if you do a full on tutorial on the R package called caret and machine learning. :)

  • @edwardwaji14
    @edwardwaji14 Рік тому

    You make R interesting.

  • @pinyichang4680
    @pinyichang4680 Рік тому

    thanks to you!

  • @zoro_adi
    @zoro_adi 2 роки тому

    Thanks!!

  • @adeyemioshodi3846
    @adeyemioshodi3846 Рік тому

    Great Tutor.

  • @najeehatalat6206
    @najeehatalat6206 2 роки тому

    Very helpful for teaching students

  • @vincenzo4259
    @vincenzo4259 2 роки тому

    Thanks

  • @bryllcolewan7509
    @bryllcolewan7509 Рік тому

    Amazing :)

  • @richmondanaman3819
    @richmondanaman3819 2 роки тому +1

    I humbly request you to do a video on Bayesian mixing models using R statistics software.

  • @zgb3l
    @zgb3l 2 роки тому +1

    this is so helpful

  • @ToucHDowN514
    @ToucHDowN514 2 роки тому +2

    where have you been?? keep it up!!

    • @RProgramming101
      @RProgramming101  2 роки тому +5

      haha - day job has been busy - but want to do more of this (I love this stuff).

  • @Senapsdesign
    @Senapsdesign 22 дні тому

    Gender and sex is now updated in the dataset. (gender in this video was what sex is now; female/male)

    • @RProgramming101
      @RProgramming101  8 днів тому

      ah - interesting. Thanks for pointing that out.

  • @edongoogle8290
    @edongoogle8290 Рік тому +1

    I've checked but can't find it. Where is the playlist for explore-clean-manipulate you mention in these videos? There are four playlists but none on that cycle exactly :)

    • @easydatascience2508
      @easydatascience2508 Рік тому

      You can try mine too. The playlist for Python and R provide most of the fundamentals. Source files downloadable.

  • @burakdemir3257
    @burakdemir3257 Рік тому

    Hi, just a quick question when I use this line of codes sort(table(starwars$hair_color)) it is now showing the NA values , is it because of R does not know what the NA values are and can not sort it ? Do I think right ?

  • @DrPhilby
    @DrPhilby Рік тому +1

    Tidyverse

  • @akto4677
    @akto4677 2 роки тому

    Hi, I cannot find any library tidyverse on R studio these days ??

  • @kdjbelabed1818
    @kdjbelabed1818 2 роки тому

    How to solve Partial Differential Equations in R?

  • @chaytanyakumar8939
    @chaytanyakumar8939 Рік тому +1

    Where can I find the code?

  • @OnLyhereAlone
    @OnLyhereAlone 2 роки тому

    Please HELP! Why are the functions "arrange" and "filter" not working on my data even after I have made sure and confirmed that the columns are numeric types of data?

  • @BetsabeRosas-dl7mk
    @BetsabeRosas-dl7mk Рік тому +1

    Did anybody by any chance in the very beginning instead of having as the format 87 rows and 13 variables they have A tibble with 87 rows and 14 variables?? If so how could I change the variable count to match his? Any help is appreciated thank you!

    • @afonsoosorio2099
      @afonsoosorio2099 Рік тому

      I have just tried Dim(starwars) and the output was 87 14

  • @rebeccaclosure
    @rebeccaclosure Рік тому

    lifeline!

  • @kazimalithedon
    @kazimalithedon Рік тому +1

    > select(hair_color)
    Error in UseMethod("select") :
    no applicable method for 'select' applied to an object of class "character"
    This is the error system is throwing.

    • @kazimalithedon
      @kazimalithedon Рік тому

      I have installed the package dplyr, still the problem persists.
      I am still not able to use the code -
      > starwars %>%
      + select(hair_color) %>%
      + arrange(desc(n)) %>%
      + View()

  • @xiaonanz9411
    @xiaonanz9411 Рік тому

    Why can't I install tidyverse?

  • @emmylou3013
    @emmylou3013 5 місяців тому

    I think the recent update has changed this dataset starwars

  • @jean-yvesberisse4512
    @jean-yvesberisse4512 2 роки тому

    savage vdo. thankssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssss!

  • @peterschmitt9802
    @peterschmitt9802 2 роки тому +1

    my version of the starwars data renames gender as sex, and gender becomes "masculine",and "feminine"

  • @tchistermorrelebissa8628
    @tchistermorrelebissa8628 Рік тому

    Excuse me, Sir, why not You make it/them more simplified? For example, gender, height and weight or fresh and dry weight (plant) to draw histogramme

  • @jamesleleji6984
    @jamesleleji6984 2 роки тому

    Can you do a difference in differences analysis in R. Thanks

  • @johannepersolanglois6441
    @johannepersolanglois6441 Рік тому

    The StarWars dataset is no more available.

  • @user-cg9cf4mk6e
    @user-cg9cf4mk6e 10 місяців тому

    daughterset

  • @DrPhilby
    @DrPhilby Рік тому +1

    Starwars

  • @JacquesHenriqueDias
    @JacquesHenriqueDias 2 роки тому

    Amazing! Thx a lot, it really helped me to get my first steps in this new language, if you dont mind, I typed all the codes you showed and put some explanations on portuguese for those who might find it useful...
    ```{r}
    # Explorar
    # Limpar
    # Manipular
    # Descrever
    # Visualizar
    # Analisar
    install.packages("tidyverse")
    library(tidyverse)
    # Para ver quais sao as bases de dados que estao dentro do tidyverse
    data()
    # Qunado colocamos uma interrogacao antes da base de dados, o sistema retorna algumas informacoes sobre importantes sobre a base
    ?starwars
    # O comando abaixo retorna o tamanho da base de dados, ou seja, quantas observacoes e quantas variaveis. Esta e a dimensao da base de dados, linhas e colunas respectivamente.
    dim(starwars)
    # A funcao abaixo mostra a estrutura dos dados. Nela nos vemos o nome das variaveis, o tipo dos dados e o valor de algumas primeiras observacoes no formato de lista, pode ficar um pouco confuso...
    str(starwars)
    # O comando abaixo, glimpse, faz a mesma coisa, no entanto e um comando especifico do tidyverse. Ele e menos poluido visualmente
    glimpse(starwars)
    # O comando View, com V maiusculo, abaixo os traz um visao muito mais moderna da base de dados, como se tivessemos vendo no excel mesmo. No entanto esta visualizacao e feita em uma nova aba.
    View(starwars)
    # Usado para visualizar as primeiras observacoes da base, como padrao 6.
    head(starwars)
    # Mesma coisa do comando head, so que para as ultimas observacoes
    tail(starwars)
    # Se voce quiser ver especificamente uma unica variavel
    starwars$name
    # O comando attach faz com que o R entenda que estamos trabalhando somente com essa base, e, entao nao precisamos mais indicar que se trata da base de dados starwars por exemplo
    attach(starwars)
    detach(starwars)
    # O comando names mostra os nomes das variaveis
    names(starwars)
    # O comando length traz pra gente o numero de colunas dessa base
    length(starwars)
    # O comando class retorna o tipo de dados de que se trata a variavel em questao
    class(hair_color)
    # Quando colocamos length no nome de uma variavel o que obtemos e o numero de observacoes dessa variavel
    length(hair_color)
    # O comando unique nos traz os valores unicos presentes nessa variavel
    unique(hair_color)
    # Traz a frequencia com que a variavel assumiu determinado valor
    table(hair_color)
    # Sort ira sortir os valores abaixo do comando table do menor para o maior
    sort(table(hair_color))
    sort(table(hair_color), decreasing = TRUE) # Do maior para o menor
    # Abre uma nova aba onde podemos ver o resultado do comando sort dentro do layout do View
    View(sort(table(hair_color), decreasing = TRUE))
    # O comando abaixo apresenta um grafico de barras sobre as observacoes sortidas em ordem decrescente
    barplot(sort(table(hair_color), decreasing = TRUE))
    # O comando %>% significa para o R executar a partir de %>% ate antes do proximo %>%
    starwars %>%
    select(hair_color) %>% # Apresenta a variavel com o valor de suas observacoes
    count(hair_color) %>% # Conta quantas vezes a variavel assumiu determinado valor
    arrange(desc(n)) %>% # Mostra os primeiros valores num layout tipo excel
    # Vamos agora ver como isolar os valores NA, aqueles que nao foram coletados, para olhar mais de perto
    # Seleciona todas as linhas e colunas da base
    starwars[ , ]
    # Responde se cada observacao e ou nao e um NA
    is.na(hair_color)
    # Seleciona somenta as que estao com NA com todas colunas
    View(starwars[is.na(hair_color) , ])
    # O comando abaixo serve para vermos o tipo da variavel height, lembre-se tem que ter rodado o comando attach antes para funcionar
    class(height)
    length(height) # numero de observacoes para cada varaivel
    summary(height) # resumo estatistico
    boxplot(height) # diagrama de caixa
    hist(height) # histograma
    ```

  • @monzurmorshed.
    @monzurmorshed. 2 роки тому +1

    Thank you!