ChatGPT 4o vs Expert Analyst | Data Visualization: Who Does It Better?

Поділитися
Вставка
  • Опубліковано 29 чер 2024
  • Chris Dutton, Power BI expert and founder of Maven Analytics, goes up against OpenAI’s ChatGPT 4o to tackle a data visualization project.
    The task?
    Analyze a data set containing 120 years of Olympic history, and use data visualization to explore how the number of athletes from each country has trended over time.
    Can ChatGPT alone solve this task or does it require the skills of an expert analyst?
    🔗 Olympic data set in Maven’s Data Playground: mavenanalytics.io/data-playgr...
    ⏱️ Timestamps ⏱️
    0:00 Intro
    0:27 Expert Analyst Chris Dutton
    0:35 Data Visualization Task
    0:55 ChatGPT Demo: Analysis
    2:30 ChatGPT Demo: Visualization
    4:37 Power BI Expert Demo: Analysis
    10:27 Power BI Expert Demo: Visualization
    13:10 Power BI Expert Demo: Final Visualization
    14:16 Expert Analyst Thoughts
    16:22 Recap
    17:17 Who Does It Better?
    17:25 Outro
    🗣️ Follow us for more ⬇️
    👉 Follow Chris:
    / csdutton
    👉Follow Alice:
    / alicexzhao
    👉 Follow Maven Analytics:
    / maven-analytics
    At Maven Analytics, we empower everyday people with life-changing data & AI skills.
    Head to mavenanalytics.io and master in-demand tools like Excel, SQL, Power BI, Tableau, Python and more. Design your own personalized learning plan and get started for FREE!
    Looking for more opportunities to sharpen your skills? Explore sample datasets and practice solving real business problems with unique, expert-led guided projects: mavenanalytics.io/guided-proj...

КОМЕНТАРІ • 61

  • @jasonrusso632
    @jasonrusso632 16 днів тому +31

    I'm a believer that a skilled analyst who is comfortable using AI is the best combination for speed and effective analysis. Great head to head challenge!

    • @Chris-at-Maven
      @Chris-at-Maven 16 днів тому +4

      Absolutely. Despite the title, it's really about showing how humans and AI can each play to their strengths to produce the best outcome possible (and also shine a light on some of the MAJOR risks of blindly trusting model outputs)

    • @Ruchikun
      @Ruchikun 12 днів тому +3

      @@Chris-at-Maven and that last sentence is what grinds my gears with all these tech evangelists PRETENDING that everyone can do any job by using ai. It's just creating vast misconceptions and is really harmful tbh.

  • @jbpauler
    @jbpauler 16 днів тому +14

    This is awesome. ChatGPT is decent, but still has a long way to go. I'll take Chris Dutton over the machines all day!!

  • @ramabg2
    @ramabg2 5 днів тому +4

    I would love to see how expert data analyst leverage GenAI to make data analyst process faster and better. Like asking ChatGPT why in certain year Olyompic participan is lower or asking chat GPT to make a list of countries based on regional, continent, developing/ develop.
    That way, some tedious job can be offloaded to GenAI hence the expert can focus more on creative work.

  • @andreag6126
    @andreag6126 16 днів тому +11

    Chris Dutton did it better: he considered multiple events per athlete, the duality between summer and winter games, and historical events that occurred in the twentieth century. Alice Zhao could have guided ChatGPT more using the prompt, but the AI ​​alone, although extremely fast, is not able to grasp the nuances

    • @Chris-at-Maven
      @Chris-at-Maven 16 днів тому

      Thanks for the shoutout!

    • @PatrickBateman12420
      @PatrickBateman12420 7 днів тому

      Indeed, Chris vs. AI 10:1.
      However, Chris definitely used much more time to prepare - which is irrelvant though, as proper analysis often are not a game of speed.

  • @ateekagarwal4545
    @ateekagarwal4545 6 днів тому +2

    Well said… Data analyst/ human can interpret data more granulated manner by analysing what’s the cause of it
    While AI is used as a supplements…
    Highly impressed by AI but at the end we have to actually finding it’s actual RC(root cause) and it’s recommendations according to dynamic market trends.

  • @saifiiui
    @saifiiui 3 дні тому +1

    The slicing of the season dimension was really an important insight in data that chatgpt ignored 😊.
    The main difference between AI and the human Evolutionary aspect is the way humans have feelings. So chatgpt can never replace the human mindset but surely can assist the human generations to come.

  • @cathyfrankland9158
    @cathyfrankland9158 6 днів тому +2

    Fascinating, thank you so much for this interesting example of AI vs. Human. Also, well done, Chris, impressive (as always!).

  • @malcolmvanhilten125
    @malcolmvanhilten125 3 дні тому +1

    I think this excercise was a great experiment. My conclusion is that Ai will most likely take over the role of building dash boards and the tech stuff. It got to the point of the first viz, as the (very senior) analyst, 80% quicker. That meant that the skill of building PowerBi and data tables etc is fast becomming obsolete, which is good news. The analyst was able to demostrate more value (much more) in recognising the distinct count and the change in the games frequency. So in the future if your value add is limited to building dash boards and you dont really understand the data, then your job is very much under threat. Lets be honest if you see data as just dots on a page (exactly like ChatGPT did in this example) then you have little value to offer going forward.

  • @BelgranoCAB19
    @BelgranoCAB19 6 днів тому +2

    I had love this video. Greetings from Argentina!

  • @osPA78
    @osPA78 16 днів тому +2

    This is a great video on how ChatGPT, Copilot, etc. can tee up the data for an analyst to bring it home! Really enjoyed watching this!!!

  • @rselvarajanMBA
    @rselvarajanMBA 5 днів тому +1

    What no one realised is that when you click on the star like icon, Chat GPT shows the actual python code it used to analyse and create the visualisation. It is quite useful and you can modify it to get far more detail results.

  • @AbdifatahGobe
    @AbdifatahGobe 15 днів тому +1

    I enjoyed watching from the beginning. Alice could have done more with the prompts, but GenAI appears to require human guiding. I also considered premium AI versions such as GPT 4o and the use cases for which they were created, and whether they could have provided a much more clear and brief analysis.
    Chris Dutton is at the top of the perk list for data analysis. It would have been appropriate to compare the AI to a typical data analyst. Special thanks to Chris for the insights! We both, me and the AI, have a lot to learn from you.

    • @Chris-at-Maven
      @Chris-at-Maven 15 днів тому

      Well said! We do plan to feature different analysts throughout the series (not just me), some more senior than others.

  • @veselinkobg
    @veselinkobg 10 днів тому +1

    Much respect for the expert analyst, looking to become one someday, but if the initial promt was "How many DISTINCT athletes are from each country, each year?", ChatGPT gives the correct answer

    • @Chris-at-Maven
      @Chris-at-Maven 10 днів тому

      True, and you might even get the correct answer sometimes with the exact prompt in the video. That said, many people (especially non-Analysts) won't know exactly how to frame the question in an optimal way, let alone how to QA the result.

    • @mosemanavan
      @mosemanavan 5 днів тому +1

      @@Chris-at-Maven Well the prompt wasn't good enough to begin with.

  • @DatawithHimanshu
    @DatawithHimanshu 16 днів тому +2

    This is really amazing.

  • @faustopf-.
    @faustopf-. 15 днів тому +1

    Wow! Insane skills and a super interesting topic! Love it

  • @edwinfcapidos
    @edwinfcapidos 13 днів тому +1

    can the gpt connect directly to servers?

  • @edwinfcapidos
    @edwinfcapidos 13 днів тому +1

    I really like PowerBI but my company is leaning towards tableau.

  • @alexrosen8762
    @alexrosen8762 16 днів тому +1

    Can't even compare with Chris solution... Not yet anyway 😄

  • @rupertllavore1731
    @rupertllavore1731 10 днів тому +1

    Interesting Concept! Maybe you should increase the odds By using atleast Some good Prompt engineering skills for Chatgpt

    • @Chris-at-Maven
      @Chris-at-Maven 10 днів тому

      Yeah we could certainly improve the output with more/better prompting (assuming you know what questions to ask). That said, many people still likely wouldn't catch the calculation error unless they specifically asked for validation

    • @rupertllavore1731
      @rupertllavore1731 10 днів тому

      @@Chris-at-Maven Here's an example i used Gpt -4o to analyze "Analyze a data set containing 120 years of Olympic history, and use data visualization to explore how the number of athletes from each country has trended over time." and by applying prompt engineering Chatgpt came up with "Analyze the Prompt
      Understand the Main Task and Objectives:
      The main task is to analyze a data set containing 120 years of Olympic history and use data visualization to explore how the number of athletes from each country has trended over time.
      Key Areas of Focus:
      1. Analyzing the data set to identify the number of athletes from each country in the Olympics.
      2. Utilizing data visualization techniques to illustrate the trends over the 120-year period.
      Generate AI Agent Profile
      AI Name: Olympic Athlete Participation Analyzer
      AI Role: Data Analyst and Visualization Specialist
      AI Objectives:
      1. Analyze the 120-year Olympic history data set to identify the number of athletes from each country.
      2. Create visualizations to showcase the trends in athlete participation over time.
      Define AI Tasks
      Task 1:
      Action 1: Extract and clean the data set to obtain the relevant information on athlete participation from each country.
      Task 2:
      Action 2: Analyze the cleaned data to calculate the number of athletes representing each country in each Olympic year.
      Task 3:
      Action 3: Utilize data visualization tools to create visual representations (such as line graphs or heat maps) that illustrate the trends in athlete participation over the 120-year period.
      Task 4:
      Action 4: Identify any significant patterns or changes in athlete participation and prepare a summary report.
      Execution and Self-Reflection Loop
      Execute Task 1:
      Action: Extract and clean the data set.
      Self-Reflection: Evaluate the accuracy and completeness of the extracted data, refine if necessary.
      Execute Task 2:
      Action: Analyze the cleaned data for athlete participation.
      Self-Reflection: Assess the precision of the analysis, refine if necessary.
      Continue the Loop:
      Execution: Continue executing tasks in sequence.
      Self-Reflection: After each task, evaluate performance and make necessary refinements.
      Completion: Continue the loop until all tasks are refined and objectives are achieved."

    • @rupertllavore1731
      @rupertllavore1731 10 днів тому

      @@Chris-at-Maven What i actually want to know your perspective on how significantly did chatgpt perform im sure this won't be perfect.
      (So here i used Gpt4o to analyze the task)
      "Analyze a data set containing 120 years of Olympic history, and use data visualization to explore how the number of athletes from each country has trended over time."
      (by applying prompt engineering skills Chatgpt has come up with.)
      ""Analyze a data set containing 120 years of Olympic history, and use data visualization to explore how the number of athletes from each country has trended over time."
      AI Agent Prompt Generator Framework
      Given Prompt:
      "Analyze a data set containing 120 years of Olympic history, and use data visualization to explore how the number of athletes from each country has trended over time."
      1. Analyze the Prompt
      Understand the Main Task and Objectives:
      Comprehension: The primary task is to analyze a historical data set of Olympic athletes spanning 120 years.
      Key Objectives:
      Analyze the number of athletes from each country.
      Visualize trends over time.
      Key Areas of Focus:
      Data analysis techniques.
      Data visualization methods.
      Historical trends in Olympic participation by country.
      2. Generate AI Agent Profile
      AI Name:
      Olympic Data Analyst
      AI Role:
      Data Analyst and Visualization Specialist
      AI Objectives:
      Analyze 120 years of Olympic athlete data.
      Identify trends in the number of athletes from each country.
      Create visualizations to represent these trends over time.
      3. Define AI Tasks
      Task 1:
      Action: Collect and preprocess the data set containing 120 years of Olympic history.
      Task 2:
      Action: Clean and normalize the data to ensure consistency and accuracy.
      Task 3:
      Action: Analyze the data to identify the number of athletes from each country for each Olympic event.
      Task 4:
      Action: Generate time-series visualizations to show trends in the number of athletes from each country over time.
      Task 5:
      Action: Interpret the visualizations to draw meaningful insights about the trends.
      4. Execution and Self-Reflection Loop
      Execute Task 1:
      Action: Collect and preprocess the data set.
      Self-Reflection: Evaluate the completeness and quality of the data. Refine the preprocessing steps if necessary.
      Execute Task 2:
      Action: Clean and normalize the data.
      Self-Reflection: Assess the consistency and accuracy of the cleaned data. Make adjustments as needed.
      Execute Task 3:
      Action: Analyze the data to identify the number of athletes from each country.
      Self-Reflection: Review the analysis for accuracy and comprehensiveness. Refine the analysis approach if required.
      Execute Task 4:
      Action: Generate time-series visualizations.
      Self-Reflection: Evaluate the clarity and effectiveness of the visualizations. Improve the visualization techniques if necessary.
      Execute Task 5:
      Action: Interpret the visualizations to draw insights.
      Self-Reflection: Assess the insights for relevance and depth. Refine the interpretation process if needed.
      Continue the Loop:
      Execution: Continue executing tasks in sequence.
      Self-Reflection: After each task, evaluate performance and make necessary refinements.
      Completion: Continue the loop until all tasks are refined and objectives are achieved.
      Implementation Steps
      Analyze the Prompt:
      Break down the prompt to understand the main task and objectives.
      Identify key areas of focus to address the objectives effectively.
      Generate AI Agent Profile:
      Dynamically generate the AI name based on the task.
      Define the AI role according to the task requirements.
      Establish clear and specific objectives for the AI to achieve.
      Define AI Tasks:
      Break down each objective into specific, actionable tasks.
      Ensure each task is clearly defined and aligned with the corresponding objective.
      Execution and Self-Reflection Loop:
      Execute each task sequentially.
      After each task, perform a self-reflection to evaluate performance.
      Refine the approach based on self-reflection before moving to the next task.
      Continue the loop until all tasks are completed and objectives are achieved.
      Example Application
      Given Prompt:
      "Analyze a data set containing 120 years of Olympic history, and use data visualization to explore how the number of athletes from each country has trended over time."
      AI Agent Profile
      AI Name:
      Olympic Data Analyst
      AI Role:
      Data Analyst and Visualization Specialist
      AI Objectives:
      Analyze 120 years of Olympic athlete data.
      Identify trends in the number of athletes from each country.
      Create visualizations to represent these trends over time.
      AI Tasks
      Task 1:
      Action: Collect and preprocess the data set containing 120 years of Olympic history.
      Task 2:
      Action: Clean and normalize the data to ensure consistency and accuracy.
      Task 3:
      Action: Analyze the data to identify the number of athletes from each country for each Olympic event.
      Task 4:
      Action: Generate time-series visualizations to show trends in the number of athletes from each country over time.
      Task 5:
      Action: Interpret the visualizations to draw meaningful insights about the trends.
      Execution and Self-Reflection Loop
      Execute Task 1:
      Action: Collect and preprocess the data set.
      Self-Reflection: Evaluate the completeness and quality of the data. Refine the preprocessing steps if necessary.
      Execute Task 2:
      Action: Clean and normalize the data.
      Self-Reflection: Assess the consistency and accuracy of the cleaned data. Make adjustments as needed.
      Execute Task 3:
      Action: Analyze the data to identify the number of athletes from each country.
      Self-Reflection: Review the analysis for accuracy and comprehensiveness. Refine the analysis approach if required.
      Execute Task 4:
      Action: Generate time-series visualizations.
      Self-Reflection: Evaluate the clarity and effectiveness of the visualizations. Improve the visualization techniques if necessary.
      Execute Task 5:
      Action: Interpret the visualizations to draw insights.
      Self-Reflection: Assess the insights for relevance and depth. Refine the interpretation process if needed.
      Continue the Loop:
      Execution: Continue executing tasks in sequence.
      Self-Reflection: After each task, evaluate performance and make necessary refinements.
      Completion: Continue the loop until all tasks are refined and objectives are achieved.
      Output and Reporting
      Monthly Report:
      Summarize key findings and trends.
      Dashboard:
      Real-time data on the number of athletes from each country over time.
      Alerts:
      Notifications for significant changes or trends in the data.
      Recommendations:
      Insights on historical trends and potential future projections.
      Continuous Improvement
      Refinement:
      Improve data sources and preprocessing techniques for better coverage and accuracy.
      Machine Learning:
      Incorporate machine learning to better identify and visualize trends.
      Expansion:
      Extend analysis to include additional dimensions such as athlete performance, gender, and event types.
      User Feedback:
      Gather feedback to enhance the relevance and usability of outputs.
      This framework provides a structured approach to developing an AI agent capable of analyzing and visualizing historical Olympic data."

  • @user-ox1ud8zn9g
    @user-ox1ud8zn9g 5 днів тому

    First step is to load CSV files in Excel, to check the data? Not like using Python, pandas, some basic EDA?😅 Power BI for an available interactive visual vs static visual from AI seems unfair 😅

    • @Chris-at-Maven
      @Chris-at-Maven 5 днів тому

      Yup we could definitely use Pandas for EDA too, just trying to give viewers a quick look at the data

  • @Funfunny820
    @Funfunny820 16 днів тому +1

    Sir power Bi dax course please

  • @ManhalAlHusami
    @ManhalAlHusami 10 днів тому +2

    That's great

  • @skd
    @skd 5 днів тому +2

    ChatGPT can make 2+2 mistakes so the human always does it better. ChatGPT can say sorry in many ways though.

  • @jakubkopczynski779
    @jakubkopczynski779 10 днів тому

    ChatGPT would probably crash on the datasets I'm working with, sooooooo clean and accessible 😄

  • @prasadagalave9762
    @prasadagalave9762 5 днів тому +1

    👌 best

  • @Hdcqt751
    @Hdcqt751 16 днів тому +2

    One can still ask Chat GPT to breakdown how it made those calculations. which can be corrected if necessary. let alone your high personal level of expertise in Excel and Power BI

    • @Chris-at-Maven
      @Chris-at-Maven 16 днів тому

      Absolutely! With more/better prompting we likely could have gotten to a very similar output, but the point here is to shine a light on some of the strengths and weaknesses of these tools, and why QA and strong foundational skills still matter

  • @abinzacharia
    @abinzacharia 11 днів тому +1

    AI is way more efficient. Look at how long he took to do that task. You could've just promoted chatgpt to do what he did rather than ambiguously saying 'describe the data', the key is to use proper prompts.

    • @Chris-at-Maven
      @Chris-at-Maven 10 днів тому

      Yeah we definitely could have improved the output with more/better prompting, but the goal here is really just to showcase some of the potential risks when you blindly trust AI-generated outputs. And yeah the speed is impressive, but doesn't matter much in cases where precision is critical.

    • @brandoncliffswarts5100
      @brandoncliffswarts5100 8 днів тому

      @@Chris-at-Maven , AI generated vague outputs from vague inputs.
      AI can be more precise when inputs are more precise.

  • @TemiBabalola
    @TemiBabalola 9 годин тому

    The prompts were not comprehensive at all.

  • @meemvideolar8383
    @meemvideolar8383 5 днів тому +1

    I think Claude Sonnet would do much better

    • @Chris-at-Maven
      @Chris-at-Maven 5 днів тому +1

      We’ll be testing more models like Copilot, Gemini and Claude Sonnet soon!

  • @mrbartuss1
    @mrbartuss1 15 днів тому +1

    We're safe

  • @donchichiumelo2762
    @donchichiumelo2762 11 днів тому

    We are indirectly asking openai to make Chatgpt smarter, at the same time telling openai.... " don't take our jobs!"

    • @Chris-at-Maven
      @Chris-at-Maven 10 днів тому +1

      I don't think AI will replace data professionals any time soon, even if they completely nail the technical stuff. Being a great analyst requires a lot of uniquely human skills like creativity, communication, empathy, intuition, etc.

  • @RamonX69
    @RamonX69 8 днів тому +2

    First learn how to write good prompts, and yes LLMs already took many jobs.

  • @yatsuk
    @yatsuk 6 днів тому +1

    Both analysis are not correct.
    The Soviet Union (USSR) was a large country with 15 constituent republics, and it competed in the Olympics as a single entity. This means athletes from all 15 republics could be on the Soviet Olympic team.
    After the USSR dissolved in 1991, the 15 republics became independent countries. Since then, each country sends its own athletes to the Olympics. So, Russia only sends athletes who are citizens of Russia.

    • @Chris-at-Maven
      @Chris-at-Maven 5 днів тому

      Yeah there are a bunch of similar nuances that we would uncover with a bit more time

  • @snipelite94
    @snipelite94 11 днів тому

    AI is a huge security risk to your private data.
    There are newspapers and other corporate entities who are suing an AI company for using their products to train their AI.
    Plus the subscription and monthly costs of AI are too exorbitant for most small and medium businesses.
    Even enterprise companies are penny-pinching now.