Complete Python Pandas Data Science Tutorial! (Reading CSV/Excel files, Sorting, Filtering, Groupby)

2024 ж. 21 Мам.
3 076 917 Рет қаралды

Practice your Python Pandas data science skills with problems on StrataScratch!
stratascratch.com/?via=keith
Data & code used in this Tutorial: github.com/KeithGalli/pandas
Python Pandas Documentation: pandas.pydata.org/pandas-docs/...
Let me know if you have any questions!
In this video we walk through many of the fundamental concepts to use the Python Pandas Data Science Library. We start off by installing pandas and loading in an example csv. We then look at different ways to read the data. Read a column, rows, specific cell, etc. Also ways to read data based on conditioning. We then move into some more advanced ways to sort & filter data. We look at making conditional changes to our data. We also start doing aggregate stats using the groupby function. We finished the video talking about how you would work with a very large dataset (many gigabytes)
I realized as I upload this video there are some additional things I want to talk about in a later video. The first thing that comes to mind immediately is using the apply() function on a dataframe to alter the data using a custom or lambda function. If you have questions on this or anything else before I get around to making a part 2, feel free to write me a note in the comments.
If you enjoyed this video, be sure to throw it a like and make sure to subscribe to not miss any future videos!
Thanks for watching friends! Happy coding! :)
Join the Python Army to get access to perks!
KZhead - / @keithgalli
Patreon - / keithgalli
---------------------------------------------
Follow me on social media!
Instagram | / keithgalli
Twitter | / keithgalli
---------------------------------------------
Link to original source of data from Kaggle: www.kaggle.com/abcsds/pokemon
---------------------------------------------
Video Outline!
0:00 - Why Pandas?
1:46 - Installing Pandas
2:03 - Getting the data used in this video
3:50 - Loading the data into Pandas (CSVs, Excel, TXTs, etc.)
8:49 - Reading Data (Getting Rows, Columns, Cells, Headers, etc.)
13:10 - Iterate through each Row
14:11 - Getting rows based on a specific condition
15:47 - High Level description of your data (min, max, mean, std dev, etc.)
16:24 - Sorting Values (Alphabetically, Numerically)
18:19 - Making Changes to the DataFrame
18:56 - Adding a column
21:22 - Deleting a column
22:14 - Summing Multiple Columns to Create new Column.
24:14 - Rearranging columns
28:06 - Saving our Data (CSV, Excel, TXT, etc.)
31:47 - Filtering Data (based on multiple conditions)
35:40 - Reset Index
37:41 - Regex Filtering (filter based on textual patterns)
43:08 - Conditional Changes
47:57 - Aggregate Statistics using Groupby (Sum, Mean, Counting)
54:53 - Working with large amounts of data (setting chunksize)
-------------------------
If you are curious to learn how I make my tutorials, check out this video: • How to Make a High Qua...
*I use affiliate links on the products that I recommend. I may earn a purchase commission or a referral bonus from the usage of these links.

Пікірлер
  • Hey ya'll! I created a second channel with more Python content (including additional Pandas tips & tricks). Please consider subscribing 😊 kzhead.info

    @KeithGalli@KeithGalli3 жыл бұрын
    • You cleverly edited the code between 25:50 to 25:59 list(df.columns.values) to list(df.columns)😉👍

      @sam7250ii@sam7250ii3 жыл бұрын
    • Error:Cannot mask with non-boolean array containing NA / NaN values - gives me error when usinf df.loc (on 40:49 in video)? df.loc[df['Our Global Company'].str.contains('Smith', regex=True)]: this is code, I imported another .xlsx table when practising.

      @Vribejs@Vribejs2 жыл бұрын
    • @@Vribejs go google it... you can't expect him to do it for you. He checked the documentation just to give us a good overview of pandas.... google out your error if not you will not learn.

      @yidizhou9899@yidizhou98992 жыл бұрын
    • i have been working on a excelworkbook having 8 worksheet and i m performing operations on data nd want to place dataframe in the 6 sheet in place of its data .but everytime i do all other sheets gets vanished nd a single gets get formed with the dataframe .plzz help me in appending df into an existing excel

      @chiraggupta1897@chiraggupta18972 жыл бұрын
    • Hey Keith , can can please help me to download the csv.file on an android tablet. sorry for bad english.

      @benten5018@benten50182 жыл бұрын
  • I know this is 5 years old but I learned more about using Pandas from this one video than all the other videos ive watched on the topic combined! Just awesome! Thank you!

    @jcspaziano@jcspaziano3 ай бұрын
    • Glad that it is still helpful!!

      @KeithGalli@KeithGalli3 ай бұрын
  • This video was super helpful, thank you Keith! In case anyone gets to the end of this video, around 48:00, Keith talks about the groupby operator and starts to go over the section "Aggregate Statistic using Groupby (Sum, Mean Counting)". You might run into errors due something that changed after Pandas version 2.0.0. Instead of writing: df.groupby(["Type 1"]).mean() Try writing: df.groupby(["Type 1"]).mean(numeric_only=True) After version 2.0.0 the numeric_only value was changed to False versus True as it's default, causing errors such as "can not convert strings". Hope this is helpful, have a good one!

    @_Nelyen@_Nelyen6 ай бұрын
    • Thank you very much, I ran into the problem, this is really helpful! :)

      @ljubicabrenjo2794@ljubicabrenjo27946 ай бұрын
    • Thank you

      @mayowafele9691@mayowafele96915 ай бұрын
    • Thanks Man .

      @NachtHere@NachtHere4 ай бұрын
    • Thanks Dude

      @mrme2120@mrme21204 ай бұрын
    • Was facing the same issue, thanks a lot.

      @rajdevanshu65@rajdevanshu654 ай бұрын
  • This 1 hour video did more for me than entire semester of my Data Analysis course... Amazing

    @RisingLoaf@RisingLoaf Жыл бұрын
    • SAME DUDE omg

      @juliachristiaanse2985@juliachristiaanse2985 Жыл бұрын
    • me too!

      @brahyamalmonteruiz9984@brahyamalmonteruiz9984 Жыл бұрын
    • Same here bro

      @ahtesamalamdxb@ahtesamalamdxb Жыл бұрын
    • Were you paying attention?

      @mrman880@mrman880 Жыл бұрын
    • am a self taught but this one saved me

      @uchindamiphiri1381@uchindamiphiri138111 ай бұрын
  • A strugling biologist here thanks you! We are mostly dealing with big data and it can get a little overwhelming, but you made it a lot easier!

    @nikithroumpari2553@nikithroumpari25532 жыл бұрын
    • Awesome

      @goSomewhereElse@goSomewhereElse2 жыл бұрын
    • big data in a csv file? lol

      @jankucera8505@jankucera8505 Жыл бұрын
    • Hey, you might enjoy SAS

      @mfundokhumalo3933@mfundokhumalo393310 ай бұрын
    • they dont lie when they say data is everywhere and every field needs data scientists

      @manan-543@manan-5438 ай бұрын
  • Best Pandas tutorial on KZhead, especially 24:25

    @shayonghoshroy7208@shayonghoshroy72084 жыл бұрын
    • hahahahaa

      @evangelosspyromilios5994@evangelosspyromilios59943 жыл бұрын
    • LOL

      @cr2093@cr20933 жыл бұрын
    • jaajajajaaj

      @fernandourrutia2566@fernandourrutia25663 жыл бұрын
    • Huge hhhhh

      @arielsfitness7073@arielsfitness70732 жыл бұрын
    • 29:19 is where he's getting texts from actual pandas

      @pappymu@pappymu2 жыл бұрын
  • Mannnn your one of the best Python go-tos PERIOD. Straight to the point and easy to understand. thanks for teaching us all!

    @Orion3000k@Orion3000k3 жыл бұрын
  • Hi Keith - not sure you will read this but wanted to sincerely thank you for this tutorial. 3 years ago this was the first python video I ever watched after graduating from unrelated subject. Today I'm typing this from a business class lounge at JFK, on my way to London where I just got a job as a quant developer at a hedge fund, building pricing models and infra for trading. Worked hard for this but if not for your videos I could be at a very different place. Thank you from the bottom of my heart, your work means a lot to many people. Cheers!

    @piotr5830@piotr583010 ай бұрын
    • Bro hire me

      @Eizengoldt@Eizengoldt7 ай бұрын
    • can you tell us more about your journey? :)

      @mydailystruggles@mydailystruggles2 күн бұрын
  • Keith You are more than a teacher. Your level of simplicity in explaining Python in details is out of the moon. Keep up the good work. Your video is always my “go to” any time. Again, thanks a lot for using your skills as a blessing to people around the world.

    @dicspringdkz8234@dicspringdkz82342 жыл бұрын
  • One of the best tutorial that I've ever seen in KZhead! Thumbs UP!

    @LureUnitFtw@LureUnitFtw5 жыл бұрын
  • Excellent!! I like the way you organize the videos on different topics and functions of working with data. Please make more videos on how to work data science in Python. E.g. Statistical analysis (descriptive statistics, t-test, linear regression) or data processing tutorial (like what we do in SQL).

    @klauscheang7063@klauscheang70634 жыл бұрын
  • Wasted an hour watching a completely useless video on pandas, didn't understand a thing...... Then found this pure gold of a video, it really helped me a lot. Why didn't I click it earlier............

    @not_proton@not_proton3 жыл бұрын
    • lol you had me in the first half 😂

      @KeithGalli@KeithGalli3 жыл бұрын
    • glad it helped!

      @KeithGalli@KeithGalli3 жыл бұрын
    • @@KeithGalli yeah, really nice job explaining it Currently watching the other pandas video (real life problems)

      @not_proton@not_proton3 жыл бұрын
  • Great video! One of the best pandas tutorials I've seen. I have one comment though. When you run (at 40:00) df.loc[df['Name'].str.contains('Mega')]) You are actually including Meganium in this filter, even though it is not a Mega pokemon. So, one needs to include a space after Mega, such as: df.loc[df['Name'].str.contains('Mega ')]) One can see that this makes a difference because when you run len(df.loc[df['Name'].str.contains('Mega')])) and len(df.loc[df['Name'].str.contains('Mega ')])), to know the number of rows, there are two distinct outputs (respectively 49 and 48)

    @prubin18@prubin182 жыл бұрын
  • this is an excellent tutorial, especially the filtering/conditional changes section. I have always loved how google sheets has built in queries, and I wanted to be able to do a lot of the same things using pandas. This essentially gave me all of the power I needed! thanks!

    @nikluz3807@nikluz38074 жыл бұрын
  • Awesome tutorial! One advice I'd have for any python developers is to get in practice of working within virtual environments. Really helps to avoid conflicts when you're working on a project which may require some older versions of a library but your other projects may require latest ones, stuff like that.

    @bentrash7885@bentrash78854 жыл бұрын
  • I have bought multiple Udemy courses on pandas and this one blows them all out of the water, and it’s free! I’m deff subbing!

    @hughjazz8416@hughjazz84163 жыл бұрын
  • I haven't started this yet, but based on your previous videos I know this is going to be great. Thanks Keith, you are a great teacher.

    @brandongarza1366@brandongarza13662 жыл бұрын
  • Video Outline! 0:45 - Why Pandas? 1:46 - Installing Pandas 2:03 - Getting the data used in this video 3:50 - Loading the data into Pandas (CSVs, Excel, TXTs, etc.) 8:49 - Reading Data (Getting Rows, Columns, Cells, Headers, etc.) 13:10 - Iterate through each Row 14:11 - Getting rows based on a specific condition 15:47 - High Level description of your data (min, max, mean, std dev, etc.) 16:24 - Sorting Values (Alphabetically, Numerically) 18:19 - Making Changes to the DataFrame 18:56 - Adding a column 21:22 - Deleting a column 22:14 - Summing Multiple Columns to Create new Column. 24:14 - Rearranging columns 28:06 - Saving our Data (CSV, Excel, TXT, etc.) 31:47 - Filtering Data (based on multiple conditions) 35:40 - Reset Index 37:41 - Regex Filtering (filter based on textual patterns) 43:08 - Conditional Changes 47:57 - Aggregate Statistics using Groupby (Sum, Mean, Counting) 54:53 - Working with large amounts of data (setting chunksize) Thanks for watching friends! :) Let me know if you have any questions

    @KeithGalli@KeithGalli5 жыл бұрын
    • YES!!! THANK YOU!

      @dtran288@dtran2885 жыл бұрын
    • Thank you so much for posting this! I have a test in Python soon, so I've been watching this for a review. You explain everything so well and make it easy to follow. I also like how the data was from Pokémon - it makes it more relatable.

      @shadow2frost325@shadow2frost3255 жыл бұрын
    • great tutorial

      @dchitan1234@dchitan12344 жыл бұрын
    • A reference notes to help you while you watch the video. docs.google.com/document/d/16qcfjwLp1vV-5VnIOGuDC2vxkHQ534_RzQd2Gihk7x8/edit?usp=sharing

      @tejasnareshsuvarna7948@tejasnareshsuvarna79484 жыл бұрын
    • Hey dude, love this video by the way but I have a question, can this data be used for machine learning? I have my exams coming up where I have to find a dataset to make predictions and stuff. Are these pokemon cards, do they have label and features if you understand what i'm talking about? Any help would be greatly appreaciated. Thanks in advance.

      @Tropax1@Tropax14 жыл бұрын
  • This video helped my massively! Been learning through online python courses with people trying to act and saying unnatural jokes, but your video felt super natural and easy to watch. Many thanks!

    @MiguelMusic123@MiguelMusic1234 жыл бұрын
  • There is something to the way Keith teaches that keeps me coming back. Besides being a good teacher and utilizing techniques which help people grasp the material quickly and remember for long time, he sends forth a wave of positivism. He is such a positive, energetic person. Thanks for sharing your knowledge. May it grow and enable you to bless more people with it.

    @faizalimuhammadzoda4731@faizalimuhammadzoda47312 жыл бұрын
  • I watched the entire video in 30 minutes and learned more than I did with hours of video content. Amazing work.

    @adedokunagunbiade5324@adedokunagunbiade5324 Жыл бұрын
  • I can't believe I watched this for free, thank you so much!

    @pivo6499@pivo64995 жыл бұрын
    • This was pretty good. I would also check udemy or r/learnpython for other free resources. Found a 30 hour FREE pandas course there the other day

      @johnwiley1221@johnwiley12214 жыл бұрын
    • www.udemy.com/course/the-ultimate-pandas-bootcamp-advanced-python-data-analysis/?couponCode=FF041817B54B4BC9EB6B

      @johnwiley1221@johnwiley12214 жыл бұрын
    • @@johnwiley1221 It's not free now, unfortunately :(

      @quartercast@quartercast3 жыл бұрын
    • ki

      @musclemusic123@musclemusic1233 жыл бұрын
    • The documentation is also free.

      @shambhav9534@shambhav95343 жыл бұрын
  • Dude you deserved all the subs for this video alone. You explained everything so good. keep it up :)

    @nimaonta1725@nimaonta17253 жыл бұрын
  • Started my PhD in hydrogeology and learning Python from the scratch. I love your work, keep it up!

    @crtnnn@crtnnn Жыл бұрын
  • 53:30 you can use .size() to get the count of each Pokemon type instead of adding a new column. It would look like this: df.groupby(['Type 1']).size() Great tutorial!!

    @gustinelimurilo@gustinelimurilo3 жыл бұрын
  • Coming from the R environment, I must say this is an excellent tutorial to learn about Pandas. I'm very happy to learn that the tools I use in R for data management can be implemented in a similar way in Python. Thanks for taking the time to put this together! Great job.

    @rutzyco@rutzyco3 жыл бұрын
    • Same here

      @konata_fan@konata_fan2 жыл бұрын
    • I agree - coming to Python from RStudio and after looking at videos all day this is definitely the most helpful and intuitive video!

      @bretfolger631@bretfolger631 Жыл бұрын
    • sometimes the syntax may be getting confused for python and r right? if you use both

      @ratansharma8026@ratansharma8026 Жыл бұрын
    • can someone tell me why is r so encouraged in the data science/analysis circle when python can do everything and more and it is so intuitive

      @manan-543@manan-5438 ай бұрын
    • @@manan-543 I think Python is far more general and overall can do a lot more, but in my field, packages associated with statistical models are far more abundant in R than in Python. For example, I'm not sure Python comes even close to R for the implementation of Bayesian hierarchical models, GLMMs, GAMMs, etc. Also, methods papers often publish packages in R, so it seems to remain the default for statistics. Until the statisticians start switching in large numbers I'm not sure this is gonna change anytime soon; and when it does, it probably will be Julia, not Python.

      @rutzyco@rutzyco8 ай бұрын
  • Between jobs for the first time in decades I wanted to learn data science using software other than just Excel and Access. Your video was well explained and frankly better than anything else I have seen so far involving Python and Pandas. Thank you for a job well done.

    @AndrewMann205@AndrewMann2055 жыл бұрын
  • Great video to get people up and running. It took me two hours to watch, take notes, and test out some examples. I feel like this was time very well spent. Thank you for this.

    @ProdMGD@ProdMGD2 жыл бұрын
  • I like the way he interacts with his viewers

    @RockIT1@RockIT13 жыл бұрын
  • This is an amazing tutorial! Please keep publishing like this. very well explained! I would love to see about matplotlib, numpy and if you can get inside machine learning

    @nutrathriveyoutube7056@nutrathriveyoutube70565 жыл бұрын
  • Very simple yet comprehensive tutorial on Pandas. You had my attention throughout. I do use Pandas for data analytics along with numpy. That said I learnt quite a few tips and tricks. Thank you for sharing your knowledge. Way to go Keith! Liked and subscribed.

    @bijoysaraf650@bijoysaraf6504 жыл бұрын
  • I have been learning python and using pandas for about 3 months now and done innumerable searches on the internet with questions regarding use of specific statements and coding. I wished I had come across your video earlier! You are a born teacher and know how to layout and explain complex terms and concepts. How can someone that looks so young have such a strong grasp on presentation and user needs? The concepts you explain are the same things I have sought information on for 3 months but all in one place and succinctly explained. Thank you for all your work.

    @garthhorne617@garthhorne6172 жыл бұрын
  • Thank you, Keith, for making this super helpful tutorial. You're a great teacher!

    @cindyshaw2485@cindyshaw24853 жыл бұрын
  • dude this is an amazing introduction to pandas. Really helpful, thanks a lot

    @orfeaspapaioannou2755@orfeaspapaioannou27554 жыл бұрын
  • Excellent Tutorial Keith. Very clear, at the right speed and interesting to learn from. This material is very suitable for a self learner. Keep it up.

    @bensondube5646@bensondube56465 жыл бұрын
  • Wow, thanks for this tutorial. I'm starting on python and took a course of udemy, but it was confusing, with your explanations many doubts are cleared up. Thanks Keith:)

    @Diegtz555@Diegtz5552 жыл бұрын
  • Thank you so much for your time and effort. This is the best python tutorial I have watched. Straight forward and well organized. I appreciate the time stamps.

    @paulblades2325@paulblades23252 жыл бұрын
  • Excellent tutorial; exactly what I was looking for. Liked and subbed. Thank you for sharing your expertise.

    @micsierra806@micsierra8065 жыл бұрын
  • On point Keith. 5 hrs worth training covered in an hour. Made my day.

    @bharathianjeneya2111@bharathianjeneya21114 жыл бұрын
  • Saved my day! I started learning Pandas, but when I missed several months during circumstances and this video about basics helped me quick comeback. Thank you!

    @kanstantsinhupalau6337@kanstantsinhupalau6337 Жыл бұрын
  • This 1 hour course is all I need for my data analysis course. This is the best video I found on KZhead. Thanks ❤️❤️❤️

    @remy0705@remy07054 ай бұрын
  • Comprehensive, perfectly paced.... Lovely tutorial!

    @viveknayak9899@viveknayak98994 жыл бұрын
  • WOW! This was just what I have been looking for! Fantastic tutorial! You explained everything very well and clear from start to finish. Best Pandas tutorial on youtube for sure! Thanks man :)

    @andyn6053@andyn60534 жыл бұрын
  • Awesome video Keith! I'm a beginner programmer but your explanation is super clear! Thanks for the videos:)

    @takako230@takako2302 жыл бұрын
  • Bro I started a data science internship in the beggining of the Year, we use a lot of pandas and you are saving my life from day 1. Thanks again, you are a god send! Subbed on both channels, cheers!

    @MatBat__@MatBat__3 жыл бұрын
  • Ok I've been learning Pandas for a while now, over many different sources, and this one video has shown me much more helpful little hints and tips than all of the other material I've looked at previously!!! Thannnnnk you! Please do more Pandas stuff as this has been so awesome =]

    @jamesdonly518@jamesdonly5185 жыл бұрын
  • Been looking for a complete tutorial on Pandas. This is amazing! Thank you.

    @takakosuzuki2514@takakosuzuki25144 жыл бұрын
  • You break down all the details in a way that I can't believe this is for free. Very high quality stuff. I was up and running with this library in short order

    @DennisGorshteyn@DennisGorshteyn3 жыл бұрын
  • This is such high effort content. I was amazed that you actually went back and sped up the video where you said you would. 11/10 great tutorial. Thanks!

    @zacharyyarost5804@zacharyyarost58043 жыл бұрын
  • Great tutorial. I'm just starting with Python, and this is a great video for picking up a lot of knowledge fast. You asked for suggestions about other videos. I'd definitely like to see more tutorials that are about cleaning data. That's the hardest part, I think. The most laborious and time consuming. I'm learning the re module (regular expressions) for Python, but it's going SLOWLY.

    @joelprestonsmith@joelprestonsmith4 жыл бұрын
  • Dude... you should make more videos... you are a natural born teacher!!

    @DavidWhitt@DavidWhitt5 жыл бұрын
  • I have my final exam in my data structures course soon, this was a great way to study and work through things my class covered months ago, thanks so much!

    @woaq4486@woaq44863 жыл бұрын
  • thanks for useful video If anybody have a problem with calculating the mean of Type 1 grouped data, use this: df= pd.read_csv('modified.csv') df.groupby(['Type 1']).mean(numeric_only=True) instand of this: df= pd.read_csv('modified.csv') df.groupby(['Type 1']).mean() That way, it won't include string-type data in the mean and sum functions.

    @amiliavachford183@amiliavachford1836 ай бұрын
    • thanks it helped a lot...can't understand the error while all the values are numreic already

      @vissokis@vissokis3 ай бұрын
    • so is it got updated now, since you can only perform the method on int or float columns ...

      @llamaland1737@llamaland1737Ай бұрын
  • Makes me want to play the old Emerald games again, wonderful tutorial, keep them coming

    @bidhanbhattarai8863@bidhanbhattarai88634 жыл бұрын
  • Thank you Keith for this video, absolutely amazing and valuable for many! THANK YOU!

    @skyblue021@skyblue0214 жыл бұрын
    • Glad you found it helpful! :)

      @KeithGalli@KeithGalli4 жыл бұрын
  • The best pandas tutorial, best mentor having strong grip on subject

    @rehanbaig71@rehanbaig713 жыл бұрын
  • Keith, I stumbled across your video from random search on deeper understanding of pandas . I felt like you did a great job presenting the material. Well done man, it was easy to follow and understand. I did appreciate the part at the end with the chunk size and group by - would like further explanation of those concepts. Thanks.

    @HansOnProduction1984@HansOnProduction19842 жыл бұрын
  • I just went through your numpy tutorial. And that's the reason I come here. Thumb up!

    @jiangxu3895@jiangxu38954 жыл бұрын
    • Appreciate it!!

      @KeithGalli@KeithGalli4 жыл бұрын
  • This is an extremely usefull tutorial. You explain so good bro. Thank you very much. Like and subscribed. Hugs.

    @jasonaraosfuentes2130@jasonaraosfuentes21305 жыл бұрын
  • I swear this is the most useful python channel on KZhead. Top stuff.

    @MichaelPeterDalsgaard@MichaelPeterDalsgaard3 жыл бұрын
  • What an excellent video. I have watched so so many tutorials, but this is definitely one of the very best.

    @stephenbouldin8163@stephenbouldin81632 жыл бұрын
  • Wow man! Holy smokes that was such an amazing breakdown. I came into this knowing nothing about Pandas and now I want to get back to work with my personal data! Thank you so so so so much. I’m off to find the documentation!

    @modernafsolutions3233@modernafsolutions32334 жыл бұрын
    • Glad you enjoyed! Your comment made my day :)

      @KeithGalli@KeithGalli4 жыл бұрын
  • 27:15 It seems that the dataframe got scrambled up a bit there, most likely from having the cell running multiple times. Even when there was an error message, it appears that either the Total or the Legendary column was moved to the left of HP. Upon running the cell again (with the corrected version?) it calculated a new Total adding the previous values and generating corrupted results.

    @xnick_uy@xnick_uy2 жыл бұрын
  • Like, seriously, The best of the best Pandas Course I have done so far! Starts off easily and basically and the explodes!

    @vzntoup@vzntoup3 жыл бұрын
  • The best tutorial I have seen so far on data analytics. I now see how python/pandas helps in data analytics. Thank you very much for making and sharing this video.

    @8rameshb@8rameshb3 жыл бұрын
  • Awesome tutorial Keith, I learnt a lot by following your hour long tutorial. Created a new notebook instead of using the GIT version as it doesn't show what happens before you commented the code.

    @mdhidayat5706@mdhidayat57062 жыл бұрын
  • I came for the tutorial, stayed for the cutesy pokemon stuff, really warmed my heart

    @disagio9517@disagio95173 жыл бұрын
  • Gold medal bro, I was searching extensively for a good data science resource and reddit just sent me to random coursera/edx courses that used to be free but don't appear to be anymore. Your content is highly organized, extremely concise, and well thought out. There is a reason that only .01% of the votes are downvotes. THANK YOU!

    @bjbmbc@bjbmbc3 жыл бұрын
  • Well Done Keith, you truly nailed pandas and made python learners life much easier :) I have started learning Python a month ago and have no language background, and I'm glad i find you.

    @NaziaFarooqui@NaziaFarooqui2 жыл бұрын
  • I've been looking for a good pandas and python video for quite sometime now. I have to say that this is really amazing. You've explained it so well that a beginner like me could easily understand. Great job and thank you. Can't wait for more videos. (if possible, matplotlib)

    @cdgxflower2679@cdgxflower26795 жыл бұрын
  • This tutorial helped me alot. Thank you so much!

    @saurabh-patil@saurabh-patil4 жыл бұрын
  • Your iterative approach is very effective to expand concepts without overloading. Great job. On to the next one.

    @nuclearhotel2172@nuclearhotel21723 жыл бұрын
  • Great job dude.

    @yomajo@yomajo5 жыл бұрын
  • I learned so much, thank you. Then at the end...that music tho. I lost it! LOL! Did not see it coming.

    @BrandonS-lk2qc@BrandonS-lk2qc3 жыл бұрын
  • Keith, you are the best Python instructor! Very easy to follow. Thank you!

    @gegao3198@gegao3198 Жыл бұрын
  • Day 1 on my journey to learn data analysis with python, this vid and kaggle's free pandas course is just what i needed to give me more motivation to keep learning.

    @atraps7882@atraps78823 жыл бұрын
  • finally.. a new video... I was waiting for a Long Time😍😋

    @mohitjain4943@mohitjain49435 жыл бұрын
  • "stop texting me! I'm making a video!" "who has the nerve" hahahahahahha you explained well, thank you.

    @nicoledeasis664@nicoledeasis6643 жыл бұрын
    • that is funny :))

      @fixithvac9360@fixithvac93602 жыл бұрын
  • you've been really helpful mate, been struggling with pandas/numpy and finally im getting somewhere! more of this please!!!

    @idonotcomplyrevolution@idonotcomplyrevolution2 жыл бұрын
  • This helped me so much Keith thank you. Work does not want me to use excel anymore for any data frame so I have to learn python. I have spent nearly a month trying to learn, even something as simple as adding a data set to python I was getting wrong. Now I am so confident with the work I have produced.

    @johnwalton1656@johnwalton16562 жыл бұрын
  • In the chunksize section, you pick a well-documented bad practice, namely calling pd.concat inside a for loop. As the loop runs repeatedly, this operation becomes more and more expensive (because new_df gets longer and longer). Per the pandas documentation, the better approach is to append each df to a list and then pd.concat the list elements just once, after the for loop.

    @DrewLevitt@DrewLevitt2 жыл бұрын
    • Hello, can you please provide with a tutorial for that? Quite new and clueless here.

      @terabhaininja9@terabhaininja9 Жыл бұрын
    • dataHere = [] for chunk in pd.read_csv('modified.csv', chunksize=5): dataHere.append(chunk) newnew = pd.concat(dataHere) This looks right?

      @terabhaininja9@terabhaininja9 Жыл бұрын
  • Thanks for posting! As an MIT student taking a data analysis class, this video was very helpful, more useful than the other tutorials online!!

    @kylieying2@kylieying25 жыл бұрын
    • Found it very useful too!

      @kipishism@kipishism5 жыл бұрын
    • "As an MIT student" Weird flex but ok

      @kregg34@kregg345 жыл бұрын
  • From the bottom of my heart, Thank you very much. May you never lack. May the elements, forces, and the entire Creation align itself for your own good.

    @Chuukwudi@Chuukwudi3 жыл бұрын
  • 2 years after this video was posted, I'm here watching and learning Tons of stuff. Thanks man!!!!

    @elwinmentaram6031@elwinmentaram60313 жыл бұрын
    • Me tooo today i watched it Comedy 😂😂😂

      @narayangautam6955@narayangautam69553 жыл бұрын
  • I loved the fact you used pokemon as data set it was fun learning I could also check my knowledge about pokemon hahha Love love

    @pemadechen9901@pemadechen99014 жыл бұрын
  • What the hell, I imagined this topic in afternoon and video recommended after only few hours. And the shocking fact I didn't even searched about this topic from many days.

    @NoName-fi2ow@NoName-fi2ow3 жыл бұрын
    • thast AI at work

      @sahajsandhuv8@sahajsandhuv83 жыл бұрын
  • This was such a good walk through for Pandas covering so much information. Thank you so much, I hope you continue to do more videos.

    @bencole8301@bencole83013 жыл бұрын
  • No BS, To the point! Man, probably the best tutorial out there. 🔥🔥 You have a sub right here. Thank you so very much for explaining this like no one ever has!

    @budwhyy9016@budwhyy9016 Жыл бұрын
  • SQL person w/ limited exposure to Python here. This was useful as hell.

    @philipcoppage3592@philipcoppage35923 жыл бұрын
  • This was such a great introduction to pandas and on DataFrame. This is exactly what I was looking for. Since I hadn't previously downloaded pandas onto my mac, and didn't feel like installing anaconda either, I was running into some troubles installing pandas with just "pip install pandas" so I thought I would include the instructions as to how I did it. simply do: pip install pandas --user If nose and tornado aren’t downloaded do: pip install nose --user then pip install tornado --user (nose needs to be installed first) then terminal also suggested I add it to my path, so I did: sudo nano /etc/paths add the path at the end of the file do ^X and then Y then hit enter

    @stephanierodriguez1035@stephanierodriguez10354 жыл бұрын
  • You've just got me 30% of my whole assignment. Thanks dude

    @joashbrijit8742@joashbrijit87423 жыл бұрын
  • Thanks for this content. I really apriciate when people pass their knowledge ahead. I'm starting with Data Analytics, after getting to know the basics of Python. Your video helped me a ton! Hope we get to see other videos with panda a cases of use.

    @yagovpf@yagovpf2 жыл бұрын
  • 39:27 i was looking for .str.contains('*') filter thanks

    @chandraprakashv2044@chandraprakashv20444 жыл бұрын
  • F for "MEGAnium" that got filtered out while being old school Poke 😂 Great course! Looking forward to learn more from you!

    @RPGMadnessVX@RPGMadnessVX3 жыл бұрын
    • I fixed it by writing "Mega " in the code.

      @iftrejom@iftrejom3 жыл бұрын
  • This was genuinly so helpful, thank you! I am mostly through a data science course and have been struggling to figure out actual applications for the information I have learned. This was excellent!

    @TomNeedhamNeDrum@TomNeedhamNeDrum3 ай бұрын
    • Glad you found it helpful!!

      @KeithGalli@KeithGalli3 ай бұрын
  • Thanks Keith for this video! I'm new on data science I'm using your videos to practice and learn a lot more. Really thanks!

    @franciscoortega104@franciscoortega1045 күн бұрын
  • When I start making money with these knowledge, I'll give you some share!

    @SMFahim-vo5zn@SMFahim-vo5zn4 жыл бұрын
  • damn I found the right one!!

    @mrxgmn@mrxgmn4 жыл бұрын
  • Working on my CS Bachelor final project, this video has been so useful to me as I'm using Python. Thank you very much!

    @eristonmansambu8225@eristonmansambu82252 жыл бұрын
  • Omgeeeeee!!!! Thank you so much!!! I've searched sooooo many videos trying to help with the delimiter problem I've had (i didn't know that was the problem) and you're the ONLY one I've found that even mentions it!!! 🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾🙌🏾

    @martistarti2374@martistarti23742 жыл бұрын
KZhead