Exploratory Data Analysis with Pandas Python

2024 ж. 20 Мам.
413 859 Рет қаралды

In this video about exploratory data analysis with pandas and python, Kaggle grandmaster Rob Mulla will teach you the basics of how to explore data using python and pandas. Exploratory Data Analysis it a necessary tool for any data scientist. Pandas is a MUST for anyone getting into data science with python. Python is the #1 coding language for data science and has been growing over the years as an essential tool, with Pandas being the main data wrangling module. Kaggle Grandmaster Rob goes over it all in this video. In this video we discuss the basics of how to use explore data including...
Timestamps:
00:00 Introduction
01:00 Imports and reading data
03:35 Data Understanding
06:40 Data Preparation
20:57 Feature Understanding
27:35 Feature Relationships
35:30 Asking a Question about the Data
40:00 Final Thoughts
Follow me on twitch for live coding streams: / medallionstallion_
Intro to Pandas video: • A Gentle Introduction ...
Link to kaggle notebook used in the tutorial: www.kaggle.com/robikscube/int...
* KZhead: / @robmulla
* Twitch: / medallionstallion_
* Twitter: / medalliondata
* Kaggle: www.kaggle.com/robikscube
#Python #Coding #DataScience #Kaggle

Пікірлер
  • Chapters don't appear to be working for my videos for some reason. Here are the timestamps for the video: <a href="#" class="seekto" data-time="0">00:00</a> Introduction <a href="#" class="seekto" data-time="60">01:00</a> Imports and reading data <a href="#" class="seekto" data-time="215">03:35</a> Data Understanding <a href="#" class="seekto" data-time="400">06:40</a> Data Preparation <a href="#" class="seekto" data-time="1257">20:57</a> Feature Understanding <a href="#" class="seekto" data-time="1655">27:35</a> Feature Relationships <a href="#" class="seekto" data-time="2130">35:30</a> Asking a Question about the Data <a href="#" class="seekto" data-time="2400">40:00</a> Final Thoughts

    @robmulla@robmulla2 жыл бұрын
    • amazing video, please can you make available the dataset used? thank you

      @oshogweikekhai5499@oshogweikekhai5499 Жыл бұрын
    • Hi Rob. I'm new to notebooks. Could you plese explain why you don't need an explicit print statement to view cell output?

      @Pluvo2for1@Pluvo2for1 Жыл бұрын
    • This was one of the bad video and non industry knowledge

      @cocgamingstar6990@cocgamingstar69907 ай бұрын
    • 😮

      @ABYHYDROS@ABYHYDROS6 ай бұрын
  • As a begginer in data this really opend my eyes as to how things works. Your explanations are very clear and I can feel how passionate you are. Great video

    @saintsaens3517@saintsaens3517 Жыл бұрын
    • Glad it was helpful! I am passionate about it, and excited when I hear people are learning from my videos.

      @robmulla@robmulla Жыл бұрын
    • So true,, though there are many projects and training videos outside. The way you think, step by step approach and the reason for doing so, is so relatable and feels very natural. Awesome video, Thank you so much

      @arunkumark6351@arunkumark635111 ай бұрын
  • There are a ton of EDA videos on KZhead. This is one of the best I have ever come across. You just nailed it, Rob.

    @saptarshidey7672@saptarshidey76729 ай бұрын
    • Thanks so much!

      @robmulla@robmulla9 ай бұрын
  • Lucidly explained. One thing i have learned that in order to be a great Data scientist what matters is your problem solving skills, understanding the business requirements and curiosity to dive deep into data (true to the name data scientist) . There is no need in remembering these codes as long as you know what to look for.

    @darshantawte7435@darshantawte743511 күн бұрын
  • This is a great refresher guide! Very nice coding style and I appreciate you using a simple Kaggle dataset to follow along. Great stuff - thanks!

    @nigelkiernan1321@nigelkiernan13212 ай бұрын
  • Hi Rob, this was super useful to me as a tired Excel veteran and python beginner. You explain and demonstrate everything so clearly, thank you

    @Aarron-io3pm@Aarron-io3pm Жыл бұрын
  • Great channel! Very helpful for beginners and for those who' re digging deeper and moving forward into DS industry as myself! Thanks Rob!

    @romanrodin5669@romanrodin5669 Жыл бұрын
  • I have tried plenty of tutorials by now. This is the most precise and to-the-point tutorial so far. Well done.

    @silver_soul98@silver_soul9810 ай бұрын
  • I did not practise pandas usually then I almost forgot the syntax or its application. Now I find your video with very clear instructions, it helps me remember better. Thanks alot

    @vietndk5437@vietndk54373 күн бұрын
  • I can’t get enough of your videos, especially the very hands-on practical approach to learning. Your explanations are clear and easy to follow along with. Please make more of these types of videos. You are definitely makes a change and contributing to the KZhead knowledge pool. Thank you so much.

    @chrisosomo2856@chrisosomo2856 Жыл бұрын
  • This is one of the best content related to Data Analysis and Python/Pandas, I am really glad I found it! Thanks!

    @adamvoltemar420@adamvoltemar4202 ай бұрын
  • This is such an amazing guide! I’m new to data analysis and had limited python exposure and have taught myself most of these things so far by googling or just reading the pandas documentation. Watching someone familiar with the process do it all together was really helpful and gave me a lot of insights as to how I can improve my skills and workflow. Thank you so so much!

    @jackgarn8392@jackgarn8392 Жыл бұрын
  • Great video: informative and fun; easy to follow along. Helped me feel motivated to tackle more Python Pandas. Thanks so much!

    @lindyhopwithliz@lindyhopwithliz6 ай бұрын
  • Thank you for sharing this, Rob! This is wonderful content. Keep up the good work. Cheers!

    @jcgdt94@jcgdt946 ай бұрын
  • this is absolutely amazing! Follow your video step by step actually make me more confident of my coding!

    @user-et7zv5rs3q@user-et7zv5rs3q10 ай бұрын
  • By far one of the most clear and concise ways of teaching in a computer science related field I've come across in a while. I'll be binging all your tutorials for sure!

    @sa-pt3kf@sa-pt3kf Жыл бұрын
    • Whoa. I love this feedback. I'll try my best to keep them coming.

      @robmulla@robmulla Жыл бұрын
  • Thanks a lot, you explain concepts like no one, subscribed!

    @9jorge@9jorge10 ай бұрын
  • This video makes me feel glad to be alive. Great explanation, amazingly fast and on point. Thank you!

    @Eysh2009@Eysh20098 ай бұрын
  • Hands down one of the best tutorial I ever saw. Basic enough to follow as a newbie but demanding enough to be useful. ❤

    @itm1996@itm19963 ай бұрын
  • Wow ! this is such a clean run through. You make it look so easy and easy to learn ! Thank you so much. This is giving me the confidence to finally start something on my own.

    @SearchingforScraps@SearchingforScraps Жыл бұрын
  • thank you so much, you have made my EDA analysis easier and faster. :) also, it's easy to digest as I go along with the data you are working. thanks a lot. you are helping a lot of analysts or people who wants to study in data analytics. Great video, keep them coming.

    @jmmj5018@jmmj50185 ай бұрын
  • This is perfect for my interview tomorrow. I just needed a refresher on how to approach the problem, ask right questions and then come up with exploratory options. Thank You so much for this video

    @sheshankjoshi@sheshankjoshi Жыл бұрын
  • Thank you for the video. You have combined all my knowledge into one comprehensible picture.

    @user-ei9jd7pw4s@user-ei9jd7pw4s3 ай бұрын
  • It was fantastic. Every step you took was kind of amazing, specially the last bit where you visualized average coaster speed by location. Thanks.

    @alikakavand3165@alikakavand31656 ай бұрын
  • This is GOLD! Thank you

    @MCMMADDOGXCV@MCMMADDOGXCV8 ай бұрын
  • Great stuff man, thanks so much for this. Youre great at teaching beginners!!!!

    @iiN1GH7M4R3ii@iiN1GH7M4R3ii11 ай бұрын
  • Definitely amazing. Thank you so much, Rob!

    @Dongnanjie@Dongnanjie4 ай бұрын
  • You are the best coach. Thank you, sir!

    @jsplayground241@jsplayground241 Жыл бұрын
  • Great stuff - thank you, Rob!

    @kmvkmv3433@kmvkmv34339 ай бұрын
  • Thank you! that was very informative!

    @linux2350@linux2350 Жыл бұрын
  • Pair-plot looked absolutely beautiful!

    @vishwathapa6626@vishwathapa66269 ай бұрын
  • Great! Thanks a lot for this tuorial, so helpful for me as a beginner!

    @Anarky35@Anarky35 Жыл бұрын
  • wow... thank you so much rob. I come from a frontend background but just began a data analytics bachelor at SJSU. I was trying to grasp at a high level what DA might look like as it pertains to conducting an explorative project. This tutorial completely cleared up those questions!

    @Rantalytics@Rantalytics5 ай бұрын
  • Very well explained and quite nice difficulty level! Brilliant!

    @ricardorockthem3339@ricardorockthem33395 ай бұрын
  • I have watched more than 5 times its really eye opener and step by step teaching. Well done Boss

    @Thorne2610@Thorne26106 күн бұрын
  • Clear and applicable to any type of analysis. Thank you

    @aydanlopresti2879@aydanlopresti28797 ай бұрын
  • Really loved seeing the pairplot. Will definitely try this out this week

    @TheMonieray@TheMonieray12 күн бұрын
  • Great video. Clear explanation! You just earned a new subscriber

    @olusolafatoye9691@olusolafatoye969123 күн бұрын
  • This video is.... PERFECT! Thanks ^^

    @sergiopellitero4136@sergiopellitero41369 ай бұрын
  • Excellent work and introduction. Very well done!

    @rdatta@rdatta9 ай бұрын
  • I dabbled in this 4 years ago at EDX. This is a wonderful refresher. Thanks Rob!

    @tomparatube6506@tomparatube65066 ай бұрын
  • Thank you so much. I appreciate the work you put into your videos. It shows.

    @bradleyfrueh2761@bradleyfrueh2761 Жыл бұрын
    • I really appreciate the feedback! Please share with anyone you think might also learn from it.

      @robmulla@robmulla Жыл бұрын
  • Thanks for this lesson. It’s much valuable.

    @guilhermedesanctis@guilhermedesanctis Жыл бұрын
  • Awesome! Love the method chaining

    @kapamagicman@kapamagicmanАй бұрын
  • Thank you Rob for your explanation, before this it was hard for me to study and my mind just start pressured me of how to do EDA with Python language. And this video just open my mind to study it!

    @muhammadfadliaktsar7172@muhammadfadliaktsar717210 ай бұрын
  • Amazing, Thank you so much, the best tutorial!! :)

    @fabiolasilva6623@fabiolasilva66235 ай бұрын
  • This is very helpful. Thank you, Rob!

    @panneerselvamposangu9929@panneerselvamposangu99298 ай бұрын
  • Really cool and looks so easy now, thank you Rob

    @mario1ua@mario1ua4 ай бұрын
  • Thanks Rob, you’re doing a great job for the data science community. Your videos here and on TikTok is helping me a lot in this journey. Thank you.

    @wahabamin6946@wahabamin6946 Жыл бұрын
    • Love to hear that Wahab! Glad you learned something, and thanks for posting the feedback.

      @robmulla@robmulla Жыл бұрын
  • @Rob Mulla: Excellent video on EDA

    @Al-Ahdal@Al-Ahdal9 күн бұрын
  • Just completed it along with coding it all!

    @anishkumaranjan@anishkumaranjan5 ай бұрын
  • amazing video, appreciate it a lot!

    @ThePablo505@ThePablo5059 ай бұрын
  • Hey Rob, really admired the way you explained complicated topics with ease!! Looking forward to learning from you more :)

    @aayaanhasnain5143@aayaanhasnain5143 Жыл бұрын
    • Thanks so much for that feedback. I really apprecaite it.

      @robmulla@robmulla Жыл бұрын
  • This is a really good tutorial. I am new to Python and data analysis, and was completely lost. It was so hard to find a good, reliable source about it. This source just clarifies the basics for beginners so that I can start off with my own project.

    @deneskalnoky7939@deneskalnoky79395 ай бұрын
  • Excellent explanation Rob. Learned a lot from this video. Keep it up.

    @anuragarunedlabadkar8889@anuragarunedlabadkar8889 Жыл бұрын
  • Great video. Look forward to your twitch streams!!

    @hardikacharya2664@hardikacharya26642 жыл бұрын
    • Thanks so much. Hope to see you during one of the twitch streams soon.

      @robmulla@robmulla2 жыл бұрын
  • The quality of your content is only surpassed by the ease at which it is to assimilate it, keep up the great content Rob, cheers!

    @wesleyweel8007@wesleyweel8007 Жыл бұрын
    • Wow. Thanks for the positive feedback!

      @robmulla@robmulla Жыл бұрын
    • ​@@robmulla bi L😅

      @yjgg5882@yjgg5882 Жыл бұрын
    • 🎉😊

      @yjgg5882@yjgg5882 Жыл бұрын
  • Thank you so much. I am happy for your teaching about EDA with data analysis for pandas. I am clearly explaining to you. I can continue my hands-on experience for EDA

    @NeerajKumardeaf@NeerajKumardeaf4 ай бұрын
  • Clear explanation for beginners.. will follow you more for tutorials

    @thaanathaana4522@thaanathaana45224 ай бұрын
  • This is a brilliant video, helped alot thankyou!!

    @Dean-nz9ld@Dean-nz9ldАй бұрын
  • I'm impressed! your videos are excellent. Thanks, Rob

    @edusheffer@edusheffer8 ай бұрын
  • Perfect stuff what I love about this video is the simplicity and the clearness of the way you talk

    @chess6802@chess6802 Жыл бұрын
    • I appreciate that! Thats how I learn best so it's also how I try to explain things.

      @robmulla@robmulla Жыл бұрын
    • @@robmulla your response reflect your both knowldge and wisdom please keep on 💞

      @chess6802@chess6802 Жыл бұрын
  • This was a really nice tutorial, Rob. Had fun coding along, thanks for doing it :)

    @siddhant0701@siddhant0701 Жыл бұрын
    • Thanks for watching and providing feedback. Feel free to share with anyone else you think might also learn from it.

      @robmulla@robmulla Жыл бұрын
  • well organized, concise, very helpful to get grounded in Pandas. my explorations will continue. Thanks!

    @walterpark8824@walterpark8824 Жыл бұрын
    • Glad it helped! Thanks for watching. Share with anyone else you think might also learn from it.

      @robmulla@robmulla Жыл бұрын
  • Awesome video! Thanks for putting the time into this. Very helpful

    @mschuer100@mschuer100 Жыл бұрын
    • Glad it was helpful! Share with a friend!

      @robmulla@robmulla Жыл бұрын
    • @@robmulla I certainly will. Thanks

      @mschuer100@mschuer100 Жыл бұрын
  • this is a wonderful video!

    @user-ki9vt2jc2t@user-ki9vt2jc2t10 ай бұрын
  • Thank you very much Rob for this wonderful walkthrough and explanation! Really Appreciate it!!!!

    @shihaosun6861@shihaosun6861 Жыл бұрын
    • Thanks for the feedback! Glad to hear you learned something from it.

      @robmulla@robmulla Жыл бұрын
  • An absolute legend, thank you

    @adityagavali3158@adityagavali315821 күн бұрын
  • This is the best reference guide. I always find myself rewatching this whenever I'm cleaning a dataset.

    @chrismagee5845@chrismagee5845 Жыл бұрын
    • So glad you find it helpful.

      @robmulla@robmulla Жыл бұрын
  • It has been great to refresh some topics and learn new ones. Thanks a lot :)

    @pdrcouto@pdrcouto Жыл бұрын
    • Thanks Pedro. So glad you’ve found these as a good refresher.

      @robmulla@robmulla Жыл бұрын
  • This is the second one of these I have now watched and coded along with! Genuinely awesome content, so precise and simple to follow. You make daunting tasks (for beginners getting into data) really accessible which is a sign of a great teacher!

    @JHornsby89@JHornsby89 Жыл бұрын
    • Comments like this make me really happy that I made this video. So happy it helped you in your coding journey. Did you use the Kaggle notebook when you followed along?

      @robmulla@robmulla Жыл бұрын
  • Great Job Rob. I got vast knowledge about pandas from this video

    @jikarun@jikarun3 ай бұрын
  • - import data Data understanding - filter columns by need - convert dtype of certain columns - rename columns - check isna in columns and dropna on row or column accordingly - locate duplicated rows in single or multiple columns - drop duplicated rows from dataset and reset index Data prep -univariate analysis of features - kde, histogram, box plot - use value counts to determine duplicates and unique values in feature - he creates bar plot for top 10 years introduced to highest # of coasters - he creates histogram to bin speeds of roller coaster and view their frequency distribution Feature understanding - scatterplot, pairplot, correlation, groupby - he creates scatterplot for speed and height with year based hue of points - he create pairplot to compare correlation between features, alongside hue from material type - creates a correlation heatmap for selected features Ask question - he uses groupby and query to create bar plot with sorted descending data on mean speed of roller coasters by location.

    @grandselenium296@grandselenium2964 ай бұрын
  • Your explanation is easy to understand and also show how the things work, ThankYou please make more videos about EDA in python Rob!!

    @ilmankhairusidqi9146@ilmankhairusidqi91466 ай бұрын
  • Cheers bro~! M grateful for your tutorial =)

    @terencelim9889@terencelim98899 ай бұрын
  • you are the best teacher ever, I'm not good at English but I try to write down this sentence to show my appreciate to you. Im still waiting for new lesson using pandas for Data Analyst

    @Jack-bs2mx@Jack-bs2mx11 ай бұрын
  • Terrific introductory survey that answered so many of my questions, moving from SQL. Looks extremely efficient. Now, to plug into my data! Thanks.

    @walterpark8824@walterpark8824 Жыл бұрын
    • Glad you liked it. Sql still has a place but when working with the data for EDA pandas can’t be beat.

      @robmulla@robmulla Жыл бұрын
  • This was the greatest Tutorial I ever had. Thank you. Here I get to cnow about the corelation and some panda functions and ploting. But for Power counting and Corelation between Variables was very pleasend and satisfied my expectations. From Bulgaria Volga Sauvete! Thaank you ! 👑👑👑👑👑👑

    @ZeuSonRed@ZeuSonRed8 ай бұрын
  • #1 Data science youtuber!!! You made easy to understand the basic commands e sintaxes. Thank you a lot, Rob. 😉

    @rrestituti@rrestituti Жыл бұрын
    • Tell all your friends. 😆

      @robmulla@robmulla Жыл бұрын
    • Agreed, this is the best Python content on the entire internet, hands down. I'm going to be carefully watching these videos over and over for a long time.

      @lashlarue7924@lashlarue7924 Жыл бұрын
  • Excellent tutorial and immediately useful. Thank you!

    @zhaozheng7704@zhaozheng7704 Жыл бұрын
    • Glad it was helpful! Thanks for watching Zhao.

      @robmulla@robmulla Жыл бұрын
  • your content is pure gold! thank you

    @Davlet@Davlet Жыл бұрын
    • Glad you enjoy it! This comment is gold. 😎

      @robmulla@robmulla Жыл бұрын
  • Very nice explanations , thank you so much!

    @rr2b@rr2bАй бұрын
  • Wow so many things are covered, its a great tutorial for getting started with EDA.

    @soumyadrip@soumyadrip2 жыл бұрын
    • Thanks @somuSan. Glad you liked the tutorial. It took me waaaaay longer to film than I expected but I'm happy with the result. I hope more people in the future find it helpful.

      @robmulla@robmulla2 жыл бұрын
  • Thank you for the tutorial, Rob!

    @MrEdinaldolaroque@MrEdinaldolaroque Жыл бұрын
    • Thanks for watching and commenting. Share it around if you want to.

      @robmulla@robmulla Жыл бұрын
  • This type of videos are amazing to follow, i am starting to use python for data analysis and i could not happier! Your channel is helping me alot, thank you!

    @nunolopes3910@nunolopes3910 Жыл бұрын
    • So happy to hear this. Let me know what you would like to see in future videos.

      @robmulla@robmulla Жыл бұрын
    • ​@@robmulla One question i have is about the safety of using jupyter while working with company data. I am just starting to use jupyter and that is a big question that i'm sure other begginers would like to know to! Can you give your opinion on it? Thanks in advance

      @nunolopes3910@nunolopes3910 Жыл бұрын
  • Thank you!!! I learned a lot from your course.

    @user-tc9lk6iy2h@user-tc9lk6iy2h Жыл бұрын
    • Glad you found it helpful!!

      @robmulla@robmulla Жыл бұрын
  • Wow, what an informative fun tutorial. Thanks Rob!

    @alisonhenley2551@alisonhenley255111 ай бұрын
    • Glad you learned from it and I appreciate the comment.

      @robmulla@robmulla11 ай бұрын
  • This is a great video. So helpful and informative. Thank you.

    @282OJK@282OJK Жыл бұрын
    • Glad it was helpful! Thanks for watching and tell your friends!

      @robmulla@robmulla Жыл бұрын
  • Ohhhh my goodness ❤️‍🔥 what a Quality of contents❤️, Thx for your effort sir

    @abhishekyadav2041@abhishekyadav2041 Жыл бұрын
  • Way to go, Rob! Excellent!

    @naturfagstoff@naturfagstoff Жыл бұрын
    • Thanks for watching!

      @robmulla@robmulla Жыл бұрын
  • I was smiling at <a href="#" class="seekto" data-time="2363">39:23</a> . How easily you answered the question. Thanks for this amazing video tutorial.

    @ranahuzaifa147@ranahuzaifa147 Жыл бұрын
    • My pleasure 😊 Glad you liked seeing it all come together at the end.

      @robmulla@robmulla Жыл бұрын
  • Thank you, Jim Halpert

    @captaincal6447@captaincal64474 ай бұрын
  • I have been grinding through your videos lately in preperation for my data science job and you have been an absolute blessing! Thanks a bunch!!

    @adarshtiwari7395@adarshtiwari7395 Жыл бұрын
    • Wonderful! Glad I could help.

      @robmulla@robmulla Жыл бұрын
    • Hey Adarsh! Other than this what else are you learning that will help you in the data science job, I'm also preparing for the same but kinda new to data science so any guidance would be appreciated. Cheers!

      @hmx21@hmx21 Жыл бұрын
    • @@hmx21 Hi Hemang. I'm a fresher in data science as well. I started with Python and statistics. Then moved on to EDA followed by Machine Learning algorithms. I then made a few projects on ML. Also tools like SQL, Power BI, Excel are preferred

      @adarshtiwari7395@adarshtiwari7395 Жыл бұрын
    • @@adarshtiwari7395 Hey Adarsh! Thnaks for the reply, I'm done with EDA and made a dashboard using Power BI, and don't know how much machine learning or SQL is required for the role as I've studied SQL in college and know how to work with joins,etc. Any tips or resources you'd like to share would be a great help. Also from where did you learn stats for ds, whenver I try to learn stats online I get overwhelmed with the magnitude of tutorials.

      @hmx21@hmx21 Жыл бұрын
    • @@hmx21 depends on what you're going for. If you are interested in a data analyst position, EDA through Power BI is great but if you want to go done the data scientist or machine learning route you need to be hands on with Python. EDA using python is much more nuanced as compared to visualisation tools like Power BI. SQL is essential in all contexts so it's a must. But whether you should study machine learning depends on your career goal.

      @adarshtiwari7395@adarshtiwari7395 Жыл бұрын
  • Late to the party but this is really really good. Helps you dig in to the detail (rather than you thinking, how do I do what I'm thinking I need to do). This should be a template to use as it general enough for you to pick it up but specific enough with examples to be used elsewhere

    @leonrobinson2053@leonrobinson20538 ай бұрын
  • This is the best video I have watched so far. Thanksss!

    @nyadokuamponsah04@nyadokuamponsah04 Жыл бұрын
    • Thanks so much!

      @robmulla@robmulla Жыл бұрын
  • Great video man. Kudos.

    @arunavamukherjee7549@arunavamukherjee754911 ай бұрын
    • Thanks for watching

      @robmulla@robmulla11 ай бұрын
  • very useful tutorial video, 😍 this makes me less scared of learning Python than before thank you for your great work !!

    @ShiNguyenchu@ShiNguyenchu3 ай бұрын
  • excelent job i learned a lot from this video thnk you

    @haydercorleoni6128@haydercorleoni61286 ай бұрын
  • This video is AWESOME! Keep it up!

    @alfredoch3811@alfredoch3811 Жыл бұрын
    • So glad you liked it!

      @robmulla@robmulla Жыл бұрын
  • Really great video. Clear and well presented. Thank you.

    @alasdairmunro1953@alasdairmunro1953 Жыл бұрын
    • Glad you enjoyed it!

      @robmulla@robmulla Жыл бұрын
KZhead