Working with CSV files | Day 15 | 100 Days of Machine Learning

Поділитися
Вставка
  • Опубліковано 11 лип 2024
  • The CSV file format is a popular format supported by many machine learning frameworks. The format is variously referred to "comma-separated values" or "character-separated values."
    A CSV file stores tabular data (numbers and text) in plain text form. A CSV file consists of any number of records, separated by line breaks of some kind. Each record consists of fields, separated by a literal comma. In some regions, the separator might be a semi-colon.
    Typically, all records have an identical number of fields, and missing values are represented as nulls or empty strings. There are a number of ways to load a CSV file in Python.
    ============================
    Do you want to learn from me?
    Check my affordable mentorship program at : learnwith.campusx.in/s/store
    ============================
    📱 Grow with us:
    CampusX' LinkedIn: / campusx-official
    CampusX on Instagram for daily tips: / campusx.official
    My LinkedIn: / nitish-singh-03412789
    Discord: / discord
    Instagram: / campusx.official
    E-mail us at support@campusx.in
    ⌚Time Stamps⌚
    00:00 - Intro
    00:54 - Process of Gathering Data
    04:45 - Different types of file formats
    05:45 - Code Demo with Jupyter Notebook
    06:35 - Methods to handle CSV files

КОМЕНТАРІ • 112

  • @749srobin
    @749srobin 2 роки тому +42

    bhai dil jeet liya tune , kash march mein hi mil jata tera channel

  • @ashutoshkanojiya7380
    @ashutoshkanojiya7380 Рік тому +8

    31:10 If you want to shorten all the team names, you can use the following code:
    def rename(name):
    list1 = name.split(" ")
    temp = ''
    for i in range(len(list1)):
    temp += list1[i][0]
    return temp

  • @harneksingh960
    @harneksingh960 2 роки тому +20

    Literally Speaking. the way of teaching by u Sir is appreciable, I have learned ML from ZTM but your course is enormously so much useful and understandable.....Warmed Respect for u Sir

  • @jinks3669
    @jinks3669 2 роки тому +5

    Best channel on youtube.
    It's channel's like yours that make social media worthwhile.
    Dhanyavaad dost

  • @shamaila3
    @shamaila3 Місяць тому +2

    You are doing a great job brother, respect and appreciation from Pakistan👍✊

  • @nihalwaseem7106
    @nihalwaseem7106 10 місяців тому

    Bahut badiya content tha, as a beginner bahut kuch seekhne ko mila. Thank you sir , aise hee ML videos aur b chahiye 💜

  • @nishitaverma8805
    @nishitaverma8805 2 роки тому +21

    Amazing! I was in search of such detailed explanation video and I found this playlist. Thank you for your effort.

  • @yandeshubham8657
    @yandeshubham8657 Рік тому

    One of the best channel for learning ML
    Thank u so much sir for improve my knowledge. God bless u

  • @AbdulHannan-dg6dl
    @AbdulHannan-dg6dl Рік тому +4

    Bhai ap great ho !!
    khush rho.. bht kch seekhne ko milta hai ap sy
    hats off to you
    Love and lots of respect from a student from Pakistan 😍

  • @shariqmeyraan2373
    @shariqmeyraan2373 2 роки тому +11

    You are a gem.. Kudos for such amazing content!!

    • @shariqmeyraan2373
      @shariqmeyraan2373 2 роки тому

      How can I contact you? I want to be your student!!

  • @user-ni4qi9gf6c
    @user-ni4qi9gf6c 10 місяців тому

    sir your explanation is very clear and in right pace, clear understandable content and warmed respect from me
    waiting for deep learning and nlp complete playlist thankyou very much.

  • @viditvaish7317
    @viditvaish7317 7 місяців тому

    amazing sir ,bahut zayda baareek cheeze explain kari hai aapne thanqu sir

  • @zkhan2023
    @zkhan2023 3 роки тому +3

    sir, you put all your effort to produce good content keep it up sir

  • @tanveerbashir8393
    @tanveerbashir8393 Рік тому +2

    interesting, informative and amazing lecture

  • @user-bm2fi5wm8m
    @user-bm2fi5wm8m 8 місяців тому

    Thanks a lot for making this playlist. really appreciated.

  • @DataScience111
    @DataScience111 2 роки тому

    Amazing sir. Hats off to you

  • @narendraparmar1631
    @narendraparmar1631 7 місяців тому

    Very Easy Explanation.
    Thanks sir ji😀

  • @sidindian1982
    @sidindian1982 Рік тому

    one of the best content for ML platform ,.... Sir you are GOD 🙂

  • @swethasree5684
    @swethasree5684 2 роки тому

    Very useful video. Thank you sir

  • @vishnujatav6329
    @vishnujatav6329 2 роки тому

    Sir, you are a real hero

  • @sanskaripatrick7191
    @sanskaripatrick7191 Рік тому +2

    For anyone confused, Skiprows is a parameter which is 1-indexed, if you pass in 0 then it will actually remove the column names since the parameter treats the column names as the 0th row.
    So in a sense, while it technically is 0-indexed, it considers the column names as the 0th index and so we have no use of using that. This makes it 1-indexed in use.

    • @abdulaziz-vm7yh
      @abdulaziz-vm7yh 8 місяців тому

      yeah even i did noticed that, i dont know why they consider column name as 0th index

  • @PyMLHub
    @PyMLHub Місяць тому

    Learning with great understanding with you Sir, I hope this will help me in my next Course of Generative AI.

  • @jawadchaudhry2648
    @jawadchaudhry2648 2 місяці тому

    Kamaal ho Sir g ap ... God Bless u

  • @RajatKumarhey_sparkerr
    @RajatKumarhey_sparkerr Рік тому

    Love your contents ❣️

  • @preetisrivastava1624
    @preetisrivastava1624 Рік тому +1

    Sir aap best ho

  • @prathmeshfutane1247
    @prathmeshfutane1247 Рік тому

    Awesome Explanation
    Javascript Object Notation

  • @akashgawande4118
    @akashgawande4118 2 роки тому

    sir you are the best......

  • @shubhamchoudhary5461
    @shubhamchoudhary5461 2 роки тому +2

    luckily i got your channel..thanks !!

  • @niteshbutola8753
    @niteshbutola8753 2 роки тому

    awesome explanation bro
    keep it up

  • @tahimakhatun3208
    @tahimakhatun3208 2 роки тому

    It was very usefull. thanks a lot...

  • @user-tx2rk9uu5b
    @user-tx2rk9uu5b 3 місяці тому

    i was looking for UTF-8 error .thank you boss🙏🙏🙏🙏🙏

  • @sahibnoorsingh2432
    @sahibnoorsingh2432 Рік тому

    Loved your videos, Please make videos on advanced robotics as well

  • @ArunKumar_237
    @ArunKumar_237 Рік тому

    Thank you so much sir 🙏🙏

  • @paragvachhani4643
    @paragvachhani4643 Рік тому

    Maja aagya sir...🥰

  • @vivekyadav-zx1kw
    @vivekyadav-zx1kw 2 роки тому

    U just doing great..

  • @Naman_Bansal102
    @Naman_Bansal102 7 місяців тому

    Very Intereseting Video

  • @MuhammadJunaid-yr8jd
    @MuhammadJunaid-yr8jd Рік тому

    thank you so much

  • @sameerabanu3115
    @sameerabanu3115 9 місяців тому

    superb

  • @sandipansarkar9211
    @sandipansarkar9211 Рік тому +1

    finished watching

  • @avinashglpaccountancy999
    @avinashglpaccountancy999 Рік тому

    thank you for showing direction, #ML, #Machine learning,#MACHINE LEARNING, # csv, #HOW TO OPEN CSV IN PANDAS...... ---
    following the path

  • @JustPython
    @JustPython 10 місяців тому

    very good videos

  • @mahfuzraihan8690
    @mahfuzraihan8690 11 місяців тому

    love it

  • @manujkumarjoshi9342
    @manujkumarjoshi9342 10 місяців тому

    Lovely

  • @umangsinghal5403
    @umangsinghal5403 Рік тому

    Maza aa gya

  • @monoarul_islam_3
    @monoarul_islam_3 2 роки тому

    19.01. If you use 0 then it will refer to the row which incorporates the column names. So actual rows start from 1 onwards. And this will resolve the issue.

    • @monoarul_islam_3
      @monoarul_islam_3 2 роки тому

      35.26 you need to use chunks instead of chunk.

  • @parikhgaurav2111
    @parikhgaurav2111 Рік тому +2

    create a video on data warehouse , no sql data bases and big queary . [ explain data marts , data lakes, pipelines and etc in details ] [data engineering in depth]

  • @atharvkazarid2-354
    @atharvkazarid2-354 Місяць тому +1

    in skiprow part the main rows of data are starting from index 1 not 0 , when u was doing 'skiprow[0,5]' u was deleting the header row and the 4th row sir...

  • @x2diaries506
    @x2diaries506 Рік тому

    I have some questions regarding the header. It is associated with the OS and browser. How can we use the same header from different OS and browsers?

  • @UnpluggedSalman
    @UnpluggedSalman 3 місяці тому +1

    sir can you please share the ipynb file probably with all the dataset link.

  • @aayushisolanki2935
    @aayushisolanki2935 Рік тому

    greattttttt

  • @stevegabrial1106
    @stevegabrial1106 3 роки тому

    How to filter csv file in which words with mixing of aplha and numbers.
    Can we see GNN in upcoming videos with lot of examples.
    Thax

  • @balkrushnaparkale7094
    @balkrushnaparkale7094 2 роки тому

    Thank you so much for beautiful content.
    But how to handle txt file?

  • @Garrick645
    @Garrick645 3 місяці тому +6

    Where can we find the jupyter code for this video

    • @mango-strawberry
      @mango-strawberry 8 днів тому +1

      were you able to find it?

    • @mango-strawberry
      @mango-strawberry 8 днів тому

      @@Garrick645 which link?

    • @Garrick645
      @Garrick645 8 днів тому +1

      I'm pasting the link but UA-cam is not letting me post the comment

    • @Garrick645
      @Garrick645 8 днів тому +1

      @@mango-strawberry just see his next video in the series, the link to his GitHub is given in the description box. You'll find all the days files over there.

    • @mango-strawberry
      @mango-strawberry 8 днів тому

      @@Garrick645 thanks I was able to find it.

  • @rishavchaba3565
    @rishavchaba3565 6 днів тому

    At 35:20 how the hell is this not throwing an error ?
    i mean "chunks" in dfs and inside loop there is variable "chunk"

  • @NickMaverick4
    @NickMaverick4 5 місяців тому

    Sir can we also do that we can paste that CSV file in notepad and then, open that text file in Excel and save it as CSV. By this we don't have to write code for opening a CSV file from the url.... Please reply is the right way to do.. or writing code is necessary..

  • @noorelahi4792
    @noorelahi4792 Рік тому +3

    Why people are spending lakhs of rupees in the institute? I have been going through various institute videos but now here i got the real thing within a month I'll be the data scientist.

  • @krishnakanthmacherla4431
    @krishnakanthmacherla4431 2 роки тому

    Done

  • @sachin2725
    @sachin2725 Рік тому

    Hello Sir, XGBoost is not included in playlist, could you please make a video on XGBoost ?

  • @saumyashah6622
    @saumyashah6622 3 роки тому +1

    sir can you please share google drive link which contains google colab of examples of all the parameters which we discussed today along with the datasets

  • @user-lw6ng1vd7v
    @user-lw6ng1vd7v 8 місяців тому

    Where i find my headers and how to get the header??

  • @aviralasthana3672
    @aviralasthana3672 Місяць тому

    Where to get the jupyter notebook

  • @whatdidilearntoday6369
    @whatdidilearntoday6369 Рік тому +1

    Bro , if i just run pd.read_csv(url). Then also it works. isnt it?

  • @SATYAMSINHA-qi5vj
    @SATYAMSINHA-qi5vj 4 місяці тому +1

    inn sab ke github codes kaha available honge sir ?

    • @mathics2869
      @mathics2869 3 місяці тому

      If u got the link pls share with me

  • @drawing8780
    @drawing8780 Рік тому

    How to take User input for CSV file and process it

  • @kushagra4401
    @kushagra4401 Рік тому

    23:00

  • @madankhatri7727
    @madankhatri7727 9 місяців тому

    encoding='ISO-8859-1' works for all file

  • @shubhamdhole5160
    @shubhamdhole5160 Місяць тому

    where is link of codes and datasets? plz help

  • @prithvigupta8215
    @prithvigupta8215 Рік тому

    25:00

  • @suhaibdata
    @suhaibdata Рік тому

    amazing.....superb

  • @shahmdnazmulalam9620
    @shahmdnazmulalam9620 Рік тому

    Sir, How do I load the dataset from the github. please ....

  • @parthpatel473
    @parthpatel473 Рік тому

    HI, SIR HEADER IS SAME WHEN WE OPENING A CSV FILES FROM AN URL.

  • @shiwamkumar5979
    @shiwamkumar5979 3 місяці тому

    Notebook kha hai?

  • @shreyasur1534
    @shreyasur1534 4 місяці тому +1

    sir dataset share kar dijiye pls

    • @mathics2869
      @mathics2869 3 місяці тому

      If u got the link pls share with me

  • @prithvigupta8215
    @prithvigupta8215 Рік тому

    29:38

  • @23MMA002_IQRAKHAN
    @23MMA002_IQRAKHAN 26 днів тому +1

    kahan hai github ka nb link?

  • @user-iv5fr9mr2n
    @user-iv5fr9mr2n Рік тому

    11:00

  • @anshukumaria4-115
    @anshukumaria4-115 24 дні тому

    can you please share github link of this notebook?

  • @user-zc3pg2fj4o
    @user-zc3pg2fj4o 4 місяці тому

    can you provide the csv files

  • @Headstrongyt
    @Headstrongyt 20 днів тому

    Sir, please provide notebook. Thank u.

  • @satyazigyansu6873
    @satyazigyansu6873 11 місяців тому

    ir url se csv read nehin ho pa raha hai

  • @sigma5391
    @sigma5391 Рік тому

    sir aug_train.csv file khul ni rahi hai

  • @shubhankarsharma2221
    @shubhankarsharma2221 Рік тому

    erorr bad line deprecated to on_"bad_line

  • @SauravSharma-c9o
    @SauravSharma-c9o 3 дні тому

    where is the link of notebook ?

  • @vinodbarti9237
    @vinodbarti9237 4 місяці тому

    import pandas as pd
    import numpy as np
    pd.read_csv('placement.csv', na_values=['cgpa'])
    sir this isnt working

    • @iamsachinnegi
      @iamsachinnegi Місяць тому

      What thinking u performing here u should give name to file then run ..

  • @harshshukla980
    @harshshukla980 Місяць тому

    github link bhul gaye kya daalna?

  • @rajchoure03
    @rajchoure03 4 місяці тому

    sir please share the source code for this tutorial

  • @quancume
    @quancume Місяць тому

    who noticed ' hyd trip ' 🤚🤣

  • @alokdhiman755
    @alokdhiman755 2 роки тому

    Javascript object notation

  • @deeptisharma90
    @deeptisharma90 3 місяці тому

    at 17:32 squeeze is no longer supported that way. updated version is like this -
    pd.read_csv('aug_train.csv',usecols = ['gender']).squeeze()

  • @sasta-koder
    @sasta-koder Рік тому

    Hi, here is a code snippet which can also be used to get encoding info :
    code:
    import chardet
    with open('spam.csv', 'rb') as rawdata:
    result = chardet.detect(rawdata.read(100000))
    result
    O/p:
    {'encoding': 'Windows-1252', 'confidence': 0.7272080023536335, 'language': ''}
    here this is encoding: Windows-1252

  • @DataScience111
    @DataScience111 2 роки тому

    Amazing sir. Hats off to you