Working with CSV files | Day 15 | 100 Days of Machine Learning
Вставка
- Опубліковано 11 лип 2024
- The CSV file format is a popular format supported by many machine learning frameworks. The format is variously referred to "comma-separated values" or "character-separated values."
A CSV file stores tabular data (numbers and text) in plain text form. A CSV file consists of any number of records, separated by line breaks of some kind. Each record consists of fields, separated by a literal comma. In some regions, the separator might be a semi-colon.
Typically, all records have an identical number of fields, and missing values are represented as nulls or empty strings. There are a number of ways to load a CSV file in Python.
============================
Do you want to learn from me?
Check my affordable mentorship program at : learnwith.campusx.in/s/store
============================
📱 Grow with us:
CampusX' LinkedIn: / campusx-official
CampusX on Instagram for daily tips: / campusx.official
My LinkedIn: / nitish-singh-03412789
Discord: / discord
Instagram: / campusx.official
E-mail us at support@campusx.in
⌚Time Stamps⌚
00:00 - Intro
00:54 - Process of Gathering Data
04:45 - Different types of file formats
05:45 - Code Demo with Jupyter Notebook
06:35 - Methods to handle CSV files
bhai dil jeet liya tune , kash march mein hi mil jata tera channel
31:10 If you want to shorten all the team names, you can use the following code:
def rename(name):
list1 = name.split(" ")
temp = ''
for i in range(len(list1)):
temp += list1[i][0]
return temp
Literally Speaking. the way of teaching by u Sir is appreciable, I have learned ML from ZTM but your course is enormously so much useful and understandable.....Warmed Respect for u Sir
Best channel on youtube.
It's channel's like yours that make social media worthwhile.
Dhanyavaad dost
You are doing a great job brother, respect and appreciation from Pakistan👍✊
Bahut badiya content tha, as a beginner bahut kuch seekhne ko mila. Thank you sir , aise hee ML videos aur b chahiye 💜
Amazing! I was in search of such detailed explanation video and I found this playlist. Thank you for your effort.
metoo
One of the best channel for learning ML
Thank u so much sir for improve my knowledge. God bless u
Bhai ap great ho !!
khush rho.. bht kch seekhne ko milta hai ap sy
hats off to you
Love and lots of respect from a student from Pakistan 😍
You are a gem.. Kudos for such amazing content!!
How can I contact you? I want to be your student!!
sir your explanation is very clear and in right pace, clear understandable content and warmed respect from me
waiting for deep learning and nlp complete playlist thankyou very much.
amazing sir ,bahut zayda baareek cheeze explain kari hai aapne thanqu sir
sir, you put all your effort to produce good content keep it up sir
interesting, informative and amazing lecture
Thanks a lot for making this playlist. really appreciated.
Amazing sir. Hats off to you
Very Easy Explanation.
Thanks sir ji😀
one of the best content for ML platform ,.... Sir you are GOD 🙂
Very useful video. Thank you sir
Sir, you are a real hero
For anyone confused, Skiprows is a parameter which is 1-indexed, if you pass in 0 then it will actually remove the column names since the parameter treats the column names as the 0th row.
So in a sense, while it technically is 0-indexed, it considers the column names as the 0th index and so we have no use of using that. This makes it 1-indexed in use.
yeah even i did noticed that, i dont know why they consider column name as 0th index
Learning with great understanding with you Sir, I hope this will help me in my next Course of Generative AI.
Kamaal ho Sir g ap ... God Bless u
Love your contents ❣️
Sir aap best ho
Awesome Explanation
Javascript Object Notation
sir you are the best......
luckily i got your channel..thanks !!
awesome explanation bro
keep it up
It was very usefull. thanks a lot...
i was looking for UTF-8 error .thank you boss🙏🙏🙏🙏🙏
Loved your videos, Please make videos on advanced robotics as well
Thank you so much sir 🙏🙏
Maja aagya sir...🥰
U just doing great..
Very Intereseting Video
thank you so much
superb
finished watching
thank you for showing direction, #ML, #Machine learning,#MACHINE LEARNING, # csv, #HOW TO OPEN CSV IN PANDAS...... ---
following the path
very good videos
love it
Lovely
Maza aa gya
19.01. If you use 0 then it will refer to the row which incorporates the column names. So actual rows start from 1 onwards. And this will resolve the issue.
35.26 you need to use chunks instead of chunk.
create a video on data warehouse , no sql data bases and big queary . [ explain data marts , data lakes, pipelines and etc in details ] [data engineering in depth]
in skiprow part the main rows of data are starting from index 1 not 0 , when u was doing 'skiprow[0,5]' u was deleting the header row and the 4th row sir...
I have some questions regarding the header. It is associated with the OS and browser. How can we use the same header from different OS and browsers?
sir can you please share the ipynb file probably with all the dataset link.
greattttttt
How to filter csv file in which words with mixing of aplha and numbers.
Can we see GNN in upcoming videos with lot of examples.
Thax
Thank you so much for beautiful content.
But how to handle txt file?
Where can we find the jupyter code for this video
were you able to find it?
@@Garrick645 which link?
I'm pasting the link but UA-cam is not letting me post the comment
@@mango-strawberry just see his next video in the series, the link to his GitHub is given in the description box. You'll find all the days files over there.
@@Garrick645 thanks I was able to find it.
At 35:20 how the hell is this not throwing an error ?
i mean "chunks" in dfs and inside loop there is variable "chunk"
Sir can we also do that we can paste that CSV file in notepad and then, open that text file in Excel and save it as CSV. By this we don't have to write code for opening a CSV file from the url.... Please reply is the right way to do.. or writing code is necessary..
Why people are spending lakhs of rupees in the institute? I have been going through various institute videos but now here i got the real thing within a month I'll be the data scientist.
Did you become one?
Did you become data scientist or data analyst
Done
Hello Sir, XGBoost is not included in playlist, could you please make a video on XGBoost ?
sir can you please share google drive link which contains google colab of examples of all the parameters which we discussed today along with the datasets
Check description
Where i find my headers and how to get the header??
Where to get the jupyter notebook
Bro , if i just run pd.read_csv(url). Then also it works. isnt it?
Yes
inn sab ke github codes kaha available honge sir ?
If u got the link pls share with me
How to take User input for CSV file and process it
23:00
encoding='ISO-8859-1' works for all file
where is link of codes and datasets? plz help
25:00
amazing.....superb
Sir, How do I load the dataset from the github. please ....
HI, SIR HEADER IS SAME WHEN WE OPENING A CSV FILES FROM AN URL.
Notebook kha hai?
sir dataset share kar dijiye pls
If u got the link pls share with me
29:38
kahan hai github ka nb link?
11:00
can you please share github link of this notebook?
can you provide the csv files
Sir, please provide notebook. Thank u.
ir url se csv read nehin ho pa raha hai
sir aug_train.csv file khul ni rahi hai
erorr bad line deprecated to on_"bad_line
where is the link of notebook ?
have you got it ?
import pandas as pd
import numpy as np
pd.read_csv('placement.csv', na_values=['cgpa'])
sir this isnt working
What thinking u performing here u should give name to file then run ..
github link bhul gaye kya daalna?
sir please share the source code for this tutorial
who noticed ' hyd trip ' 🤚🤣
Javascript object notation
at 17:32 squeeze is no longer supported that way. updated version is like this -
pd.read_csv('aug_train.csv',usecols = ['gender']).squeeze()
Hi, here is a code snippet which can also be used to get encoding info :
code:
import chardet
with open('spam.csv', 'rb') as rawdata:
result = chardet.detect(rawdata.read(100000))
result
O/p:
{'encoding': 'Windows-1252', 'confidence': 0.7272080023536335, 'language': ''}
here this is encoding: Windows-1252
Amazing sir. Hats off to you