R vs Python | Which is Better for Data Analysis?

Поділитися
Вставка
  • Опубліковано 18 січ 2025

КОМЕНТАРІ • 408

  • @ALZulas
    @ALZulas 3 роки тому +516

    I used to give a lecture on when to use R and when to use Python. I gave it for many years, but every year both languages would grow closer and closer together. I eventually stopped giving the lecture, because they're so much the same that it doesn't benefit students to talk about it anymore. The only thing that's different anymore, in my opinion, is it depends on how your brain thinks about problems. If you think about and solve problems from a programmer mind set, Python will be easier for you brain to wrap around. If you come from SAS, MPlus, or SPSS, R might be easier for your brain to wrap around. Much like picking skis or snowboards, try them both and go with the one that feels right for the way you work.

    • @jhernandez9617
      @jhernandez9617 3 роки тому +3

      No. F*ck R. It needs to die and become a bad memory for the human species.

    • @BboyKeny
      @BboyKeny 3 роки тому +2

      @@jhernandez9617 Why?

    • @myworldAI
      @myworldAI 3 роки тому

      @@jhernandez9617 haha ,don't F**K R, R👍👍👍👍💖💖❤

    • @dir2310
      @dir2310 3 роки тому +22

      this comment save my 11 mins, thanks!

    • @kevinboone5015
      @kevinboone5015 3 роки тому +2

      I know Im asking randomly but does anyone know a trick to get back into an instagram account..?
      I somehow lost my account password. I appreciate any tips you can give me.

  • @bendirval3612
    @bendirval3612 2 роки тому +67

    Python is unquestionably more straightforward as a language in general. However, it's fundamentally a general-purpose scalar language, not a vector-data language like R or a matrix language like matlab. That fact makes the type of data manipulation and analysis that is meat and potatoes in R less convenient in Python. "Hello world" is easier in Python, but real data analysis is easier in R. I use Python for general programming, but it's just not worth the trouble to force Python to pretend to be R for data analysis, econometrics, or statistics. Python is way, way behind in all forms of data analysis. For example, Python is only now considering basic ideas like "missing" values being different from "not a number" values, which the creators of R thought of and planned for from day 1. If I had to pick one language as "better," I might choose Python, but it's not better for data analysis, which is what's being discussed here.

  • @bernardogrivon9139
    @bernardogrivon9139 3 роки тому +130

    As a noob with only excel background, I got into R much more easily. One huge advantage of R imo is RStudio. Such a great tool to work with. Also in R the documentation is helpful, even the error messages are useful. I'm starting with python, but for me it's not as sticky and intuitive. I find Spyder as an IDE ok, but imo it's way behind RStudio.

    • @jerrisg9457
      @jerrisg9457 3 роки тому +2

      Agreed on RStudio being a huge advantage.

    • @vivekjoshi3769
      @vivekjoshi3769 3 роки тому

      Agree on RStudio. it is really helpful!

    • @hak41711
      @hak41711 3 роки тому +2

      Try using vs code for python

    • @CuteBoyLesh
      @CuteBoyLesh 2 роки тому +1

      Jupyter Notebook or Atom are great

    • @CKinUSAmerica
      @CKinUSAmerica 2 роки тому

      I also pretty much only with excel background. But I picked up python more easily… it’s really hard to understand R language..though I have to learn it anyways

  • @simonparker4992
    @simonparker4992 2 роки тому +60

    Two minutes in, you're pedalling the standard nonsense that R is a statistical package. I've been using R for twelve years and pretty much never for statistics. Text processing, data cleaning, report writing (markdown) and GIS, GIS, GIS. R is really good for mapping and geospatial data processing (not just spatial statistics).

    • @Hugo-fi9xr
      @Hugo-fi9xr 2 роки тому +10

      The strenght of R is the statistics you can't deny that. Sure you can do other things with this language but it strong point is making plots, modifying data frames and statistical tests. Sorry for my english btw

    • @alienboogieman
      @alienboogieman 2 роки тому

      2 mins in and you already b**ching. Geez that flacid ego needs be toned down son.

    • @simonparker4992
      @simonparker4992 2 роки тому +8

      @@alienboogieman I was being polite. It's a crap video at best. It's disingenuous and dishonest at worst.

    • @alienboogieman
      @alienboogieman 2 роки тому

      @@simonparker4992 is that what your wife said to you before she left your ass? If so, good because you assume you know best when you do not.

    • @عبدالمجيدالعتيبي-ز6ع
      @عبدالمجيدالعتيبي-ز6ع Рік тому

      Is there a similarities between csharp and R language, because I'm using c# now and i have good experience with it

  • @lauragonzalez5584
    @lauragonzalez5584 3 роки тому +41

    I've waited long for this video! Right now I'm learning Python and in my company, they use both depending on if we are using classical statistical models or ML. However, I'm also an economist who would like to get more involved in academia and I think R is more used there than Python. Both are excellent choices tbh

    • @adrielbezerra7887
      @adrielbezerra7887 2 роки тому

      @Harry S It depends on where you are and the country laws. Here in Brazil there's no law which regulates the data analyst profession in private market (aka companies). But in public sector is required to have an university degree as statistician, IT, software engineer, etc.

    • @skcocemag
      @skcocemag 2 роки тому +2

      Controversial take: I would suggest python for economists. Reason being, if you're an economist, you are likely to use (or a coauthor is likely to use) Stata. The newer versions of Stata talk to Python really really well. I can run Stata from within Jupyter or Spyder or run Python from within Stata VERY easily, and that includes, for example, running a Python command from Stata USING my live Stata dataset! In other words, you can open stata, play with some data, then run a python command on that data, then run a stata command on that data, etc.
      Will Python be able to do something newer, for example techniques that have come out in the wake of Goodman-Bacon 2018? Probably not. But just use the instructions to turn your section of your .do file into python code and run what you need to there, then switch back. Easy peasy.

    • @hivan04
      @hivan04 11 місяців тому

      Thank you for this comment, I'm currently a second year Econ student and this helped a lot!

  • @davidreynolds3895
    @davidreynolds3895 3 роки тому +32

    The syntax example for R is way more complicated than it needs to be. You technically don’t even need to load any packages to read in a CSV and calculate the mean.

    • @simonparker4992
      @simonparker4992 2 роки тому +13

      x

    • @dominicj7977
      @dominicj7977 9 місяців тому +1

      As a non-programmer who use programming for work, I find R's syntax to be more intuitive. Programmers ma think in a different way.

  • @keith819
    @keith819 3 роки тому +26

    For a new programer, I'd say learn Python.
    It's much much easier to get a job with Python, your in the general software engineer camp vs being locked to data scientist roles.

  • @jamesstonehouse3448
    @jamesstonehouse3448 3 роки тому +18

    I came to R from using C, visual engineering environment (an instrument control language used in metrology), SAS & SQL. Nowadays I make my living with R, automating reporting, text mining, and developing data manipulation tools for an intelligence team. It has to be said that in my industry, I haven't yet come acress a Python user. It might just be that the big players in town are all either R or SAS background.

  • @jerchiury
    @jerchiury 3 роки тому +16

    I don't want to worry too much about data types when doing my analysis. The fact that base R supports operations of matrices and data frames makes it much easier to use. R knows when you are subtracting two series (column/vector whichever) to subtract it term by term for example, it's pretty messy with python when you get lists, series, arrays and such going about all with different methods for that one exact operation.

  • @solom465
    @solom465 3 роки тому +112

    Personally I prefer R when doing hardcore data analysis. Dplyr, ggplot2. and the rest of the tidyverse enable you to do more with dramatically less code compared to Python. For anything outside of hardcore Data analysis I use Python.

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 роки тому +9

      I can definitely feel that

    • @winnumber101
      @winnumber101 3 роки тому +1

      I’m with this

    • @squirrelpatrick3670
      @squirrelpatrick3670 3 роки тому +1

      I really prefer pandas to dplyr, and R was my first language of the two. I did spend a while getting fluent with it.

    • @dominicj7977
      @dominicj7977 3 роки тому +3

      ​@@squirrelpatrick3670 R's data table is one of the fastest in the whole programming language universe.
      I rarely use dplyr or pandas after I started using data table in R

    • @hammadsheikh6032
      @hammadsheikh6032 3 роки тому +1

      Same here! R is king for hardcore analysis, but go beyond that, and Python leaves it in the dust. But R is my first love.

  • @tanvird3
    @tanvird3 3 роки тому +41

    5:56 you can use colMeans(nba[sapply(nba, is.numeric)]) for calculating means of the numeric columns, you don't even have to import any libraries. I understand the python way is still cleaner, however, there are tons of situations where the other way around is true.
    7:09 library(tidyverse) and you get every functionality that python pandas can offer, you don't have to remember a lot of things for doing a simple task.

    • @rashawnhoward564
      @rashawnhoward564 3 роки тому +15

      Even easier summary(data) will give mean, median, Q1, Q2, min, and max of data no loading packages and its cleaner than python.

    • @explicitlynotboundby
      @explicitlynotboundby 3 роки тому +2

      @@rashawnhoward564 Exactly. Alex is bullshitting.

    • @BzRist
      @BzRist 2 роки тому +1

      In R, you could also use library(tidytable) for the same functionality with great memory efficiency.

    • @frankaragona1463
      @frankaragona1463 2 роки тому +1

      Was just going to say this. That was a pretty poor example.. There are so many more situations where R is cleaner and easier to use for wrangling data. I feel that pandas is disappointing whereas dplyr/tidyverse in general are better tools for data science

  • @jacobusstrydom7017
    @jacobusstrydom7017 3 роки тому +37

    Very good, I mostly use R. But when Combining R with python and sql, then you are unstoppable data programing machine. So learn both, it's a lot of fun.

    • @DailyMental
      @DailyMental 3 роки тому

      Hey sir, im currently learning from scratch, would you recommend me to learn R first and then move to python or what would be your approach?

    • @jacobusstrydom7017
      @jacobusstrydom7017 3 роки тому +5

      @@DailyMental wow good question. I learned SQL then python and R.
      My opinion is that R requires less code to do something than python. Also the amazing R Tidyverse package makes it so mush easier to code and to work with data. Also R mostly just works, where with python you do get a couple more issues with package versions.
      But saying that, SQL is also a great place to start because it's easy to read and understand the code.
      Keep in mind that allot of companies store data in a databases so it's always a bonus if you can use SQL to extract data from the database and then analyze it in R or python.
      This is just my opinion. Good luck

    • @DailyMental
      @DailyMental 3 роки тому +1

      @@jacobusstrydom7017 Thank you for the advice! I was thinking this as well, SQL is my first step and then R since im from a business background and its probably better to have a solid foundation before learning more complex syntax.

  • @arcaneknight9799
    @arcaneknight9799 3 роки тому +96

    Power BI vs. Tableau

    • @deniskk2
      @deniskk2 3 роки тому +6

      power bi..... no mistake there. download it & see 30 minutes demo

    • @arcaneknight9799
      @arcaneknight9799 3 роки тому +4

      @@deniskk2 I've used Power BI and love it. I don't have much experience with Tableau, so I'm wondering his justification for Tableau.

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 роки тому +28

      Coming soon!

    • @diazjubairy1729
      @diazjubairy1729 3 роки тому +7

      Tableu is expensive but have great feature and ux, power bi is cheaper even free but not so great ux. Both of the app will do your data viz job eventually

    • @osito9810
      @osito9810 3 роки тому +3

      guysss, Google data studio is better than Power Bi y Tableau

  • @Vivian-ve1qt
    @Vivian-ve1qt 3 роки тому +4

    When I subscribed to this channel two weeks ago I did it because I wanted to be ready for my data analyst interview. I passed it very well and I think this channel helped at least when it came to learning more about the job and the differences between a data analyst and a data scientist. I will start on the first day of March and I am looking forward to it. I am studying for a master in Big Data at the same time and I am learning R there, whereas I need to learn Python for work. R doesn't look difficult to me but Python kinda looks more familiar for me and those with a background in other general purpose programming languages. I agree about the huge amount of libraries in R and I think that it is really great for visualization. However, since Python is becoming the most popular programming language I would already prefer it for that reason alone not counting anything else.

  • @mohamedjelassi9672
    @mohamedjelassi9672 3 роки тому +10

    When we speak about analysis, we speak about mathematics and more precisely statistics... in my point of view, R has more mathematical libraries than python ... and please keep python for web development and other stuffs

  • @datamics
    @datamics 3 роки тому +8

    It truly depends: once on personal preference, and also on what your work, that is your company, requires you to use. I prefer Python, and I think Python will grow to offer the same amount of features (if not more) as R in the future.

  • @jaqo92
    @jaqo92 3 роки тому +14

    I learn both. My conclusion is python is better but i love Rmarkdown and ggplot more than jupyter and matplotlib + seaborn.

  • @adrielbezerra7887
    @adrielbezerra7887 3 роки тому +11

    Honestly, I like both. Since I came from SPSS and statistics background, R suits me better. But when I need to analyze missing values or do some graphics, Python helps me a lot more.

    • @ankicanozinic6551
      @ankicanozinic6551 2 роки тому +1

      What would you recommend to a beginner in statistics - SPSS or R?

    • @adrielbezerra7887
      @adrielbezerra7887 2 роки тому +2

      @@ankicanozinic6551 If you never touched a database or programming before, SPSS may be easier to learn once it resembles a little bit the Excel and alikes. Also, you can click the buttons SPSS offers and the software gives you the programming script that your clicks generate. Disadvantage: SPSS requires a paid license. But it has a trial version to test.
      If you're commited (and have enough time to study) to learn programming basics alongside with statistics, R is the way to go.
      The answer is: it depends. I always tell my students to go step by step. Well, hopefully it will be useful to you.

    • @ankicanozinic6551
      @ankicanozinic6551 2 роки тому +1

      @@adrielbezerra7887 thank you for a thorough explanaiton

    • @tufazzulmiah6299
      @tufazzulmiah6299 10 місяців тому

      I am a statistics background student.If Learn MS excel, R and power BI for data analysts, it is enough for me in smooth data analysis ?

  • @chancellorpalpatine3352
    @chancellorpalpatine3352 3 роки тому +10

    I think R is better, as pandas is much slower and less easy to use than dplyr. Data prep takes up most of my time, so this is huge. Both R and python are relatively equivalent to me for machine learning. Alot of these ml packages are just R and Python wrappers to c code. Maybe if you work for a fang and do alot of pure deep learning, python may be better, but I think that situation is rather rare.

  • @prishitachoudhary8920
    @prishitachoudhary8920 3 роки тому +2

    Thank you Alex for this video!

  • @davidyolchuyev2905
    @davidyolchuyev2905 3 роки тому +11

    for finding the mean of the column in R, you use mean() function. I dont know why you have shown pipes in the R section of syntax example

    • @olivierspanhove8808
      @olivierspanhove8808 3 роки тому +2

      I know right, it's even one of the functions in the base package !

  • @arielleung3917
    @arielleung3917 3 роки тому +5

    R and Python require totally different mindsets. Picking the better one of them is like asking "which is a better career, statistician or engineer?". With my mathematical background I find R code much more straightforward, and when I started to use Python, it's not like any single piece of code is unreadable but the entire workflow is unfamiliar: how tasks are broken down, why makes a copy here, and so on. It also took me quite some time to be convinced that Python does not have a library for the Spatio-Temporal Autoregression model (for a few hours I thought I just hadn't searched the right way) since R offers abundent solutions for spatio-temporal data. Eventually I realized that modelling is never at the top of engineers' priority list, and mathematicians/statisticians can focus on the intellectual work only because engineers have got their hands dirty. Also, Python makes it easier to collaborate with other platforms. If I were to communicate with laymen rather than other professionals in my company, deploying a dashboard or web app would be the best explanation. Again, it didn't occur to me because this hadn't been my top priority, so I preferred R. Now the job has changed and I'm using Python more often.

    • @LHommeEnVert
      @LHommeEnVert Рік тому

      If there are no librairie for spacio temporal autoregression you can build it from scratch with python, as object oriented programming languages which is not possible in R

    • @ikari0133
      @ikari0133 4 місяці тому

      @@LHommeEnVertthen how do you think they built the library in the first place??

  • @aldorodriguez7310
    @aldorodriguez7310 Рік тому +7

    It would be nice to have a video with examples or real world scenarios for both cases.

  • @nasser.h9473
    @nasser.h9473 3 роки тому +10

    I don’t think it’s a big deal, it’s a preferential thing! I’m currently learning how to use Python

  • @ethaniel7551
    @ethaniel7551 3 роки тому +22

    I guess you could say that the messages Alex received in regard to his “controversial” post included some R-Rated content ;)

  • @CunningCaracal
    @CunningCaracal 3 роки тому +24

    From my understanding if you're familiar with SAS, R would probably be easier to pick up vs Python. I personally started with C#/C++ so python was easier for me to pick up. Also perfect troll post on LinkedIn, just say something controversial and walk away lol 👍.

  • @QuantumImperfections
    @QuantumImperfections 3 роки тому +28

    I've personally enjoyed my SQL, Excel, R, PowerBI group I've got set-up. The only thing I really planned for was learning PowerBI but the rest came about oddly naturally. Great video by the way!

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 роки тому +4

      That's a good toolbelt right there!

    • @praveen26699
      @praveen26699 2 роки тому

      I too learned Power bi...Is it easy to learn R?

    • @QuantumImperfections
      @QuantumImperfections 2 роки тому +1

      @@praveen26699 I found it particularly easy. I'd picked up most on my own after reading: "Learn R" by Aphalo. I'd taken some DataCamp courses and other paid courses by ppl like Matt Dancho that provided spot-on business applications for it. I also read "Advanced R" by Wichkham and with all of that R is my main powertool in the tool chest.
      R is a lot like Excel and SQL, Python is a lot more like other programming languages. All of the above are interchangeable and as long as you can learn how to provide business value you are golden.

  • @BabyBugBug
    @BabyBugBug 3 роки тому +4

    It helps to know many programming languages - that much I have learned so far

    • @BboyKeny
      @BboyKeny 3 роки тому

      True, for me it broadens your way of thinking about programming in general.
      Since every language approaches the subject in a unique way with a unique motivation.
      It makes you very appreciative of the strengths and weaknesses of every language.

  • @kirillshmidt3776
    @kirillshmidt3776 2 роки тому +4

    filtering with pandas
    df[df['column'] == x]
    vs R data.table
    df[column == x, ]
    What is easier to read? Become even worse in pandas then you have more complex condition.
    Not to mention multi index in pandas. It is a hell.
    And direct comparison with reading of csv in r that you can also do in 2 lines:
    x

  • @Dylan-zm3ht
    @Dylan-zm3ht 3 роки тому +18

    I prefer the R syntax. I find it easier to remember “weird” syntax.

  • @bgpolitan
    @bgpolitan 2 роки тому +5

    I am an economist trying to dedicate to data analysis and I still didn't understand the pros and cons of both, so this video is exactly what I needed. Thank you! 😄

  • @gj3933
    @gj3933 3 роки тому +2

    There must be a reason behind the growing number of R packages. My clients won't care about if I produce results in R or Python. If they ask me to build an app, then I'll reconsider.

  • @souhaibsebbane5623
    @souhaibsebbane5623 2 роки тому +2

    Stopped watching at 5:55 because either Alex was biased or he has no idea whatsoever about R, since he did not use the mean( ) function which is even a base R function and you don't need to install and load any package to use it.

  • @nicov1003
    @nicov1003 2 роки тому +5

    This guy is not using R correctly if that's how he's calculating the mean for a dataset lol

  • @hernanmorales6681
    @hernanmorales6681 3 роки тому +1

    Amazing video!! Thanks!!

  • @sergiotorres513
    @sergiotorres513 3 роки тому +16

    Great video. But I think you could make a syntax example with R much easier. If you want to know the mean of a variable or of data you only type mean(data) or mean(data$variable), of course, depends on this variable or data is numeric or not. Thanks for your video. Regards.

    • @AlokPratapSingh4001
      @AlokPratapSingh4001 3 роки тому +1

      Also, mean for each column, just use the code
      sapply(dataframe, mean)

    • @dominicj7977
      @dominicj7977 3 роки тому

      using apply functions or colmeans( )

    • @hak41711
      @hak41711 3 роки тому

      You could simply use the describe method for that using pandas

  • @jacqmarthy536
    @jacqmarthy536 3 роки тому +4

    The youtube algorithm brought me here lol. I think an example of data analysis using popular libraries on both for comparison is good. Like the processing time, the amount of codes written etc. For me python is easier since in colleague we used cpp and fortran for learning basics of algorithm and numerical methods. The one time i had to use spss for statistics assignments i got really confused.

  • @trancehvn
    @trancehvn 3 роки тому +2

    I feel this video is a little biased at 5:55, as I don't think anybody will write that code to just get the mean

  • @RobertR1611
    @RobertR1611 3 роки тому +2

    Thanks, Alex. Great video. The right tool for the right job.

  • @annoyingprecision2487
    @annoyingprecision2487 3 роки тому

    Liked and subscribed! Thank you for the valuable input.

  • @fernando_dominguez
    @fernando_dominguez 2 роки тому

    Loved it! Thank you very much for your content, just started following you.
    My advice is just express your opinion like you did, makes content far more unique.
    Cheers!

  • @behrad9712
    @behrad9712 2 роки тому +1

    thank you!🙏

  • @borisn.1346
    @borisn.1346 2 роки тому +1

    Love your overall points Alex, but saying that R can't be integrated in web apps is plane wrong!

  • @rashawnhoward564
    @rashawnhoward564 3 роки тому +4

    You definitely cherry picked to get mean of column you don't need to load packages just the base summary function will give that. summary(data) will give you the mean, median, Q1, Q3, min and max of every numeric column, not to mention the counts of qualitative columns.

  • @Salmanbro96
    @Salmanbro96 3 роки тому +15

    SQL VS NOSQL

  • @andrijamilojevic1891
    @andrijamilojevic1891 9 місяців тому

    Thank you Alex!

  • @anthonysoltero88
    @anthonysoltero88 3 роки тому +28

    Use the language your team uses.
    The guess work can be taken out based on the company you work for or the company you want to work for in the future.
    If they use R, use R. If they use Python, use Python. If it’s only up to you, flip a coin.

    • @ArnavRoyRA
      @ArnavRoyRA 3 роки тому +1

      LoL Flip a Coin....... that's what I'm gonna do as a Student to start Learning XD

    • @jaqo92
      @jaqo92 3 роки тому +1

      Pandas > tidyverse

    • @dominicj7977
      @dominicj7977 3 роки тому

      @@jaqo92 R data table > pandas

  • @romanvasiura6705
    @romanvasiura6705 2 роки тому +1

    Thank you!
    I'll try both.
    Approximately in a month I'll have a course in Coursera about R (from Google Data Analyst Certification),
    but after that I'm interested to try Python as well.

  • @YlmazDALKIRANscallion
    @YlmazDALKIRANscallion 2 роки тому

    Thanks for clarification.

  • @down_with_data
    @down_with_data 3 роки тому +4

    By the way R can be embedded in web application.

  • @mohammedbouchcoucha890
    @mohammedbouchcoucha890 3 роки тому +5

    Miss you Alex!! i worked with both and i've one little thing to say, in R you can write mean(nba) or you can use summary(nba)

  • @uqwang
    @uqwang 3 роки тому +5

    They’re just tools to get my work done. I use both on daily basis.

  • @richardaversa7128
    @richardaversa7128 Рік тому

    Excellent summary, great balance of conciseness and examples.
    "R is harder to learn, but has more features"... specifically for analytics, right? My understanding is Python has far more features in general. Never heard of someone building a mobile app in R.

  • @simonparker4992
    @simonparker4992 2 роки тому +3

    The R code for your example is:
    x

  • @adrianchong5883
    @adrianchong5883 3 роки тому +6

    Debates of Python VS R are pointless imo. They are good in their own ways.
    Most importantly, they are better than SAS.

    • @ponalvsiki2254
      @ponalvsiki2254 3 роки тому

      Well, if you ask CS students, what programming language should be learnt first, 99% will tell you Python. CS students just love Python so much that they could have sexy dance with it if it were a girl.
      If anyone tells you to learn C first, you know you find your true lover.

  • @kaym7704
    @kaym7704 3 роки тому +2

    I’m new to programming and I chose Python as a starting point... easier for a noob like me.

  • @aalb1970
    @aalb1970 11 місяців тому

    I mostly do data analysis on survey data and in my experience R is more robust in this regard. For instance, there are several packages that will create survey weights for you, but I have yet to find one Python package that actuially works.
    I do agree that Python syntax is somewhat easier to pick up, but once you understand vectorized operations in R it becomes easier to use.

  • @ai.aspirations
    @ai.aspirations 3 роки тому +1

    thanks much!

  • @danyos777
    @danyos777 2 роки тому +1

    I would like to see a second part video comparison focused on comparing R and Python languages from a business standpoint rather than their more general-purpose, programming capabilities for building applications, and heavy used of sophisticated statistics that do not apply to the average business world. For instance, many of us in business are hoping to learn which language is better for business analysis which, after all, is the trend in using either of these languages.
    What we learn from the video is that R is being highlighted as useful in purely statistical analysis, while the comparison with Python does not provide any insights into Python's capabilities for statistical analysis. R is being highlighted as great for statistical analysis, however, advanced statistics is mostly used by the scientific and academic community mostly as well as sophisticated business environments whereby most of these advanced statistics are not needed in the general business world.
    I would like to see the view from a business analyst/business intelligence professional who has truly used both R and Python for exactly the same purpose, using these languages for business analysis. It would be great to move away from the general-purpose and application development and get more into the business uses for each language and on what statistical and data analysis truly serves the vast majority of business users, business intelligence and data analysts analyzing business related data.
    Looking forward to this second video. Thanks Alex!

    • @kirillshmidt3776
      @kirillshmidt3776 2 роки тому

      For business analyst/business intelligence both languages are equal and more about preference rather than advantages. Usually there are no issue with performance, you do not need sophisticated models and packages. You can build up your own functions and make it closer to you field in both languages. Maybe there are better visualisation with ggplot in R (it is more versatile). But if you want to build proper self-service BI then better to go with classical BI tools like Tableau/Power BI and etc. R and Python are for search of deep insides made by hardcore analyst, and BI tools for managers.
      I am actually bilingual in R and Python and do both ways in my work.

  • @AaronMOliver
    @AaronMOliver 3 роки тому +4

    Hard to go wrong either way. If your job leans more towards data engineering and ETL then probably Python is a good choice to start with first, IMO.
    Thanks Alex!

  • @hibou647
    @hibou647 3 роки тому +2

    In my opinion the biggest advantages of R are its IDE Rstudio and the capacity to execute only the mouse selected portion of code (no, notebooks are not as convenient). Web deployment is possible through Shiny but it seems much more of a hassle than on Python.

    • @MrMafiale
      @MrMafiale 2 роки тому

      Try rmarkdown, sweave or knitr for notebook IDE. They are even better than the python notebooks I worked with so far

  • @a.s.vanhoose1545
    @a.s.vanhoose1545 Рік тому

    Simple fact is that a good programmer in either language can handle 95% of data analytics tasks easily.

  • @RCasey100
    @RCasey100 3 роки тому

    I know absolutely nothing about Python, but your example at ~6:30 is a major giveaway that you are not experienced enough with R programming to form a reliable comparison - the example could be done in base R with two lines of simple code. I've never seen such an overcomplicated way to find the mean as you described..

  • @AndyQuinteroM
    @AndyQuinteroM 3 роки тому +3

    I like functional programming. I love R for data science. Anything else I’ll just write out some C or JS

  • @rishimetawala
    @rishimetawala 3 роки тому +2

    i actually gave up on R as I moved to a more strategic role and away from hardcore data analysis, i found it harder and harder just to recollect syntax across different libraries. Plus I see Industry is tilting more and more towards Python and learning Python is kind of "future proofing" your time spent on it.

  • @kidhardt
    @kidhardt 3 роки тому

    There's something to be said for the experience of Python over R, not to mention the rise of machine learning for good prediction over standard statistics for inference.

  • @planetary-rendez-vous
    @planetary-rendez-vous 2 роки тому

    Just calling attention to librarian::shelf(tidyverse)
    You don't need to write 10 lines of library(dplyr), you can write all your package name in a single line code and it will automatically install if needed and load it.

  • @vishnuvrv
    @vishnuvrv 3 роки тому +3

    Google chose Python for its ML/AI coding. So if you are looking at ML or AI, python is the way to go.

  • @KRMukundan
    @KRMukundan 4 місяці тому +2

    R is easier to learn than python. I already know python, and it was hard to learn. I am learning R in the present and I must say its easier.

    • @brianhart4108
      @brianhart4108 4 місяці тому +1

      well, if you had python long ago, and used that for data... first, now you know what you are doing, and second, there's 100% chance you picked up c due to how py works under the hood. Now you see R, it all makes sense, for a beginner without much computer knowledge, like when I learned r as my first language, that was a nightmare, and I didn't remember anything fun or anything besides homework with it.

  • @Ma-Attia
    @Ma-Attia 2 роки тому +1

    U R amazing man ❤️👏

  • @nkristianschmidt
    @nkristianschmidt Рік тому

    Both are needed once things get a little advanced

  • @Chris_87BC
    @Chris_87BC Рік тому

    I learned R and Python and I can say R is much easier to learn but Python is way more robust. I replaced a VBA code that creates MS Excel workbooks from a template, and it took like 3 seconds to complete. Using R took like 45 seconds.. After I saw the benefits and speed of Python, I put R aside and focus on Python.

  • @renanlolop
    @renanlolop 3 роки тому

    For me, it's clear. Python is better for most of people, but, if one have strong math and stats background, R is probably the best.
    It's so much easier to collect data, clean it, put everything to work in python. But R is just THE WAY TO GO for statistical analysis. You get so many stuff out of the box. So many statistics, it is amazing.
    Tl dr: learn both, R for statistical and ML modeling. Python for anything else.
    If learning both is not an option, probably go the python route.

  • @jermainecameron6209
    @jermainecameron6209 3 роки тому +5

    Once again, thanks for sharing your thoughts with us Alex. Can always count on your fair unbiased opinions.

    • @simonparker4992
      @simonparker4992 2 роки тому +1

      It's crap. He has no idea what he's talking about.

  • @easydatascience2508
    @easydatascience2508 2 роки тому

    Python is a compuer scientist designed language, R is a statistician designed one. Python uses = for assignment, while R use -> as assignment symbol. Python's function is more flexible than R. Deep learning packages is written for Python, but not R. So R is a statistics-lise language, and Python is a data science language.

  • @joonyoon7900
    @joonyoon7900 3 роки тому

    General rule of thumb for me.
    1. parsing? data wrangling? -> python
    2. statistics? single cell? -> R

  • @ahmedelsheshtawey438
    @ahmedelsheshtawey438 3 роки тому +5

    Learn sql for data analysis from your playlist it’s enough or need to continue with another course

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 роки тому +1

      For sure! I think it's a good place to start :D

  • @drmack911
    @drmack911 2 роки тому

    The genesis of R actually dates back to circa 1975 at Bell Labs where is was named S. Python had its origin around 1989.

    • @-Jakob-
      @-Jakob- 7 місяців тому

      and Python was insipred by ABC which was inspired by SETL

  • @paultoronto42
    @paultoronto42 3 роки тому +1

    I think Python is better too, but I do like Hadley Wickham's TidyVerse for R.

  • @LucasPeitton
    @LucasPeitton Рік тому

    It's insane how many times you had to ask ahead for forgiveness to avoid potentially offending anyone. We are all different and haver different opinions - get over it people! Very cool video mate. As a statistician I'm very in love with R, but I'm trying to learn Python as I am very aware of it's coverage and power. Cheers

  • @pieterrossouw8596
    @pieterrossouw8596 3 роки тому +1

    Underrated skill that's complementary to these is Excel PowerQuery... Poor man's PowerBI and the only thing that makes Microsoft's Office suite irreplaceable by even the best of clones.

  • @bo3thman917
    @bo3thman917 3 роки тому +1

    my whole knowledge in programming is with c++, I even made my calculations for my M.Sc in Statistics and Operations research in c++ . Now I’m not sure what to begin with Python or R. Most my work are hover around numerical analysis.

  • @andrewnguyen3312
    @andrewnguyen3312 11 місяців тому

    I use R, but want to learn python eventually! Thx for this video

  • @linnaco9361
    @linnaco9361 3 роки тому

    I use both as a digital analyst student. R for data cleaning, structure, and manipulation. Python for ML

  • @SL-zn9oh
    @SL-zn9oh 3 роки тому +5

    Hey Alex do you mind doing a video on the impact of automation on the future of the data analyst career? It would be really helpful to those who are on the fence about starting/changing their careers.

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 роки тому +1

      I definitely plan on making a video on AI and automation :)

    • @SL-zn9oh
      @SL-zn9oh 3 роки тому

      @@AlexTheAnalyst Thanks!

    • @idakwiatkowska2862
      @idakwiatkowska2862 3 роки тому

      Great topic! I also interested in your thoughts

  • @nicoleluo6692
    @nicoleluo6692 Рік тому

    When I get to know Python, I immediately throw away everything I know about R from my brain into rubbish bin. LOVE PYTHON!!!!

  • @Harry-mf6rq
    @Harry-mf6rq 2 роки тому +4

    I think the best thing about R is RMarkdown. Being able to hit one button, run my statistical analysis, and output a word document with all the right numbers and figures in it is amazing for reproducible reporting.
    I'm switching to Python soon. Do you have any recommendations for a similar functionality?

  • @scottterry2606
    @scottterry2606 2 роки тому +1

    Without viewing, the answer is ... "the one you know". Software is a tool. It is not the artist.

  • @NgaiChakLo
    @NgaiChakLo Рік тому +1

    R start vector indexing from 1, not 0. It really did a great job of making programmers ANGRY

  • @TheFootballPlaya
    @TheFootballPlaya 3 роки тому +1

    question: if you had to choose one background to have to work as a data analyst, business or statistics, which one would you choose and why?

  • @novruzhasanov4641
    @novruzhasanov4641 3 роки тому +2

    Dear Alex, thanks for your review. However no offense but referring to your arguments, you know veeeery little of R. R objects and visuals in particular can be easily embeded to web apps of any kind. Plotly also exists in R and can be combined with ggplot which a heck of a boost in flexibility. Next up mean of a column in R is as easy as mean(data$column), no need for pipes dude. Therefore with all due respect if you spent more time on Python does not mean that its better and vice versa of course. If R was so lame as u say. Python would not adapt ggplot and more remarkably caret for ML and call it pycaret (like seriously).

    • @skillarchiedavis8430
      @skillarchiedavis8430 2 роки тому

      Hey bro. That example was so poor as well because R cab find the mean in one line as well. I used both R os better for data analyst but python better for data science because it's more versatile. I honestly wouldn't use python for data analyst because majority of the time it requires too much code when R doesn't. That's my opinion tho

  • @sekhem313
    @sekhem313 3 роки тому +5

    R requires running in a Linux environment to ensure stability & reliability- even for simple flat. Windows creates crap. The disaster will accelerate in the upcoming Windows 12- Microsoft developers can't explain what's the problem. The instability can't be replicated in python, sas, C, Java code. What BS.

  • @tritiyo_noyon
    @tritiyo_noyon 3 роки тому +2

    Do we have something like r studio for python?

  • @jakeduddy3257
    @jakeduddy3257 3 роки тому +2

    Like Python more as a language. R really suffers from weak data typing, and programming feels more clunky, but %>% is great, plus access to shiny

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 роки тому

      Haha agreed

    • @jakeduddy3257
      @jakeduddy3257 3 роки тому

      Just seen pandas.pipe() might even be more concise than pipes in R, weirdly passing functions like this feels more R like

    • @jerchiury
      @jerchiury 3 роки тому

      I agree if you are using the code in production data typing is important. For analysis and reporting weak data typing is actually time-saving.

    • @dominicj7977
      @dominicj7977 3 роки тому

      @@jerchiury what is data typing?

    • @Reivivus
      @Reivivus 3 роки тому

      @@dominicj7977 , strong data typing is to have a variable be a certain type integer, or numeric or dataframe or matrix or string, file, etc exclusively. Weak data typing is I can say x = 5, and then I convert it to a string ‘5’. In weak data typing I can define variable x as NULL. x = NULL, and I can do things with the variable when it is not NULL.
      X = 5 # x is a number
      X = append(X, 9) # now X is a list of 5,9
      In strong data typing languages such as C++ you say
      int X = 5
      X can never be anything other than integer.

  • @vinn0101
    @vinn0101 3 роки тому

    If all you want to do is read a CSV file and see the mean, you could use RStudio and not program anything in either R or Python. Use the right tool for the right job.

  • @deniskk2
    @deniskk2 3 роки тому +3

    Alex, its been 4 days getting in the Data analyst Game. you are my GOTO guy and the way you started this business story is real. I am following your classes because, they are simple and easy to read. the classes that are taking in LOOOONG and complicating stuff. Please let me know if you have Instagram, its easy to communicate there.Please Explain to your wife why you could to Instagram account.

    • @AlexTheAnalyst
      @AlexTheAnalyst  3 роки тому

      Haha I’ll look into it 😁 so awesome to hear you’ve enjoyed the channel 👍

    • @davidil2147
      @davidil2147 3 роки тому

      @@AlexTheAnalyst And that's why your wife is on the money about the professor thing xD

  • @junhotan2096
    @junhotan2096 3 роки тому +1

    Why not both

  • @jos3784
    @jos3784 3 роки тому +2

    Can you recommend a R course on Udemy and/or have a promo code for a R course?

    • @veerasekhar8551
      @veerasekhar8551 3 роки тому

      i wouldn't recommend any R course in Udemy

    • @veerasekhar8551
      @veerasekhar8551 3 роки тому

      go for O'REILLY books

    • @gauravmishra8561
      @gauravmishra8561 3 роки тому

      @@veerasekhar8551 bhai aap india se ho? whats your thought on spss?

  • @Ottozone1
    @Ottozone1 3 роки тому +1

    The Python example seems to be a bit cherry picked to show that Python has one function to apply the function mean to all the columns.
    Also, I'd like to know why R is more difficult to maintain?
    I think the pros for R should include ability for Markdown, better visualisation libraries, and piping is intuitive.
    Python pro should include that it's an actual programming language.

    • @bendirval3612
      @bendirval3612 2 роки тому

      I class myself as an R guy, but I do have one problem with R as far as maintainability is concerned: I can't count on it always producing the same answers from the same code. Even on the same version of R, changes in its many packages tend to change my results all the time. And I get slightly different results on different computers. I have spent a lot of time in the last few years trying to overcome this problem but I'm considering sucking it up and porting a lot of my production code to python.

    • @mycrushisachicken
      @mycrushisachicken 2 роки тому

      @@bendirval3612 how tf does that happen?

    • @farnsworthsclasses3523
      @farnsworthsclasses3523 2 роки тому +3

      @@mycrushisachicken R and python differ in their third party package systems. Python has many packages but, you really only use 2 or 3 of them for data science. They are each large and aren't updated all that often--they seem to have lots of eyes on them and a reluctance to change rapidly. R has many thousands of packages, all of which do data science or statistics/econometrics. And they are all created/maintained/updated by their original authors, so the updates go to CRAN (R's central repository of packages) right away whenever the authors feel like it. You end up using lots more packages in R and the packages are all written by different people, who may not be overly concerned about the effects of updates on end users. So it's a lot easier to get two systems out of sync in R than python. There may also be numerical reasons why you get different answers on one computer versus another in R. I have, many times, had the experience of optimizers getting a slightly different answer on intel versus AMD, despite all my efforts to standardize, in R. I'm not saying this can't happen in python or matlab or whatever, but I haven't seen it as much.

    • @mycrushisachicken
      @mycrushisachicken 2 роки тому

      @@farnsworthsclasses3523 damn thats crazy

  • @nifftbatuff676
    @nifftbatuff676 Рік тому

    I thought R had many issues. Then I tried Python and I found it has the same issues, but worse.