Computer Vision: Crash Course Computer Science #35
Today we’re going to talk about how computers see. We’ve long known that our digital cameras and smartphones can take incredibly detailed images, but taking pictures is not quite the same thing. For the past half-century, computer scientists have been working to help our computing devices understand the imagery they capture, leading to advancements everywhere, from tracking hands and whole bodies, biometrics to unlock our phones, and eventually giving autonomous cars the ability to understand their surroundings.
Check out Origin of Everything here!
/ @pbsorigins
Produced in collaboration with PBS Digital Studios: / pbsdigitalstudios
Want to know more about Carrie Anne?
about.me/carrieannephilbin
The Latest from PBS Digital Studios: • All PBS Digital Studio...
Want to find Crash Course elsewhere on the internet?
Facebook - / youtubecrash. .
Twitter - / thecrashcourse
Tumblr - / thecrashcourse
Support Crash Course on Patreon: / crashcourse
CC Kids: / crashcoursekids
Your computer will detect when you are happy and start a forced 10GB update to swipe off the smile on your face.
lol
@@Elfezen or you can upgrade to Linux
@@user-rj8th9mr3s Well, being incapable to run games it doesn't necessarily mean that is worse, ain't only videogames out there , ya know
Your Windows computer*
This is probably the best explanation of computer vision I've ever seen in my life.
This is by far the greatest course that I had on my entire life about computers. I work with full flight simulators for pilot training and many things that I learnt here became so clear for me... We see many systems in a very superficial way due to those abstraction levels and with those classes I can see what's behind the scene, what's going on in a deeper way. Thank you, guys. Thank you very much for sharing all this knowledge and in a way so simple and easy to understand. You're the best!!! And I'm recommending the channel for everybody I know that likes computer science on any level of understanding!
"not to ask for updates if you are frustrated" LOL this course is so informative and entertaining at the same time. Very good job!
Our university's robotics team is currently using OpenCV so our autonomous drone can see and navigate the world. Lots of theory, documentation reading, and pulled hairs come along with computer vision, that's for sure.
Dude, I know what you feel I have been learning machine learning and most of the times it gets very frustrating.
I really love this show, it's a great way to introduce concepts before having a full lecture at a college class, or to have a wide general idea of what the career path will include.
Seems like a convoluted way to process images.
Come up with a more efficient algorithm I'll wait
I see what you did there.
@@dustinjames1268 Still, you've got to *recognize* that there's a *kernel* of truth to the criticism.
I am currently studying Imaging Science at RIT and this is the best explanation I have ever found. One of the greatest refreshers of what’s going on sense I got here.
I *totally* understood all of this. Yeah, thats it...
Facinating to get to this one in 2023 in the context of where things have gone since.
For anyone who's interested, there's a (relatively) recent system called YOLO: You Only Look Once. Version 2 came out less than a year ago, if I remember right, and basically it uses computer vision techniques to classify many different objects in a scene in real-time video. As in, it's fast enough to fairly accurately detect and label many different objects in an arbitrary scene 24 times per second (24fps is a standard video frame rate). It's super interesting! :D
Do you know if it uses the information it got from previous frames?
Awesome naming for it!
Only the current frame, also you use in general, downloaded pre-trained models for weights, so you can start detecting things easily, you can add if want new detection, but of course it takes a lot of work, real time HD video needs around the GPU power of the GTX 1080 Ti, depending on the complexity of the weights, but you can trans-code a video, with the labels added on them, and watch later the final results.
@buda20, Thanks for referring to the type of gpu required for this, which answered one of questions as I'm building my own workstation for deep cnn, video object detection, ... Can you recommend minimum hardware specs? Seems to me a capable system has always been taken for granted. Thanks
Yes! 😁
Ooo, speech recognition and synthesis! I'm super excited for next week now - I'm a computational linguist, so this is my jam. Can't wait!
I found the narrator very pleasant to listen to. Also the video was very good.
Great video! I'm taking a Computational Vision course right now. It was nice to know what you were talking about.
Tahsin Loqman May I have your email address . I am interested in this course
Great lesson. I can't wait 'til next week. Thanks!
Wow, you did a great job of making something difficult easy to understand! This video was a great help!
Anyone else watch these on 0.75 speed?
I've used Photoshop for years, it's really cool took take a look under the hood of image processing.
I love computer vision with maths and all ❤
First couple seconds of the Video, what a second that looks familiar, then realise it’s a footage of my hometown.
The best online program, don't stop doin it!
Just wondering where have you been 😊 Happy to see you again
Thanks a lot! It was a great introductory video to computer vision.
The computer in the thumbnail looks like the one in Don't Hug Me I'm Scared Part 4. Which makes the topic even scarier.
Can't wait for next week!
funny and clear! This series is the best.
Paused because I noticed the Ghost in The Wires book on your shelf. Bought this book after a Kevin Mitnick conference I saw last year :)
Best videos series ever about computer science,.,, Thank you..
wonderfully explained
Very excellent explanation. Thanks for your videos. Please upload videos on machine learning and artificial intelligence.
More useful than my whole semester CV course...
Apart from face recognition, OCR is another nice field of research for 'teaching computers' to see !
thank you, this was helpful
Way to go Carrie Anne!
Love to see the passion this woman have for her job. I lost my passion somewhere along the way.
Great video , very informative.
This is awesome
You're an absolutely brilliant communicator! I'm doing a computer vision specialization on Coursera with the University of Buffalo and your high level intuition just gave me oodles of excitement. I dream of one day developing my own algorithm for real time navigation for data constrained systems. Thanks, really, this was a fabulous primer video, and certainly one I'll show my best friends. ☺️
This could be were Quantum computers shine. It can analyze all that data all at once basically seeing the bigger picture.
Really interesting !
Extremely excellent. Thankyou.
YOU ARE AMAZING!
That's super cool^^ Thank you!!!
what a sweet world would be one that has computers capable of awareness of their surroundings
Self driving cars often (also) use LIDAR, which has the great advantage of knowing distances, so the car is able to see in 3d. The (biggest) exception to this is Tesla, which decided that normal cameras work just fine, to which I say sure, but why not make it even better?
Good quality LIDAR sensors needed for automotive applications are expensive. One of Tesla's goals was to ship all of their cars with the hardware needed for automated driving, long before their cars were capable of automated driving. Shipping the initial cars without this feature but with LIDAR might have been too expensive to be feasible. Using this approach, Tesla collected millions of miles of consumer data about typical driving conditions. This treasure trove of data enables them to understand under what conditions what sensors fare poorly and gives a large test set to compare algorithms on. I don't know if Tesla made the right decision. They might have problems getting their system working in urban areas or other challenging environments without the more detailed data that LIDAR provides, but the low cost of Telsa's sensor package enabled them to use consumers to collect data for Tesla, and that is a significant advantage for Tesla's engineering design team.
Good video Anne.. i need your insight on something... am working on recognizing partial occluded license plate. can you contribute to my research. thanks
Thanks for the greate video!
Can anybody recommend a minimum hardware requirements for computer vision/object detection? Thanks
Mam very nice video, Mam please also made full course videos also with very easy explanation & cover only those maths which require for that course. Because your explanation is very simple
Will you guys be uploading after 2 weeks from now on as you did with this video ?
At 5:52 you forgot to mention the bias value.
You guys rock!!!!
*connects a function generator to an oscilloscope in the background for some fun sciency atmosphere *
thats really convoluted
When my Windows laptop will be able to recognize I'm not in the mood for an update, only then I'll pull that sticky tape off my webcam. That also means I'll never get updates :(
Awesome video! How exactly are these image processing softwares implemented - would it be a low-level programming language like C, a high-level like Python or would it even be at the hardware level?
Shame no one answered before. Anyway, can be both. Python can be used in production and in testing (Really good libraries for complex computer vision like convolutional neural networks, object segmentation and so on). However when really high control over performance is needed, be it memory or computing speed, a low level language like c or c++ are used. Hope that helped :D
@@TalSoikis Yeah awesome, thanks :D
Would you share a link for further reading?
Hey, i know that place! Sydney Olympic park!
I love convolutional neural networks
Convolution just happened to pop out from nowhere. In case you are wondering, convolution is the operation that maps a set of values (also called N-tuple where N stands for the quantity of elements) to another set of values. Very simple example: _1,2,3,4_ is a 4-tuple _+1,+1,+2,+2_ is a simple convolution _2,3,5,6_ is a 4-tuple as result of applying the above convolution
Wasn't AlexNet responsible for CNNs becoming a thing?
Great !!
very like this video
GREAT-VIDEO!!😁💻👀👂👍
CVision + Neural Network + Bad AI = me by the way 5th
Lol. 😂
Lol I just imagined this in the next patch of Windows: If(user.faceEmotion=frustrated){ restartNotice.Postpone(2) }; Not that they would ever do that though...
Where can i find the sources for this video???
My PC is already quite aware of it's suroundings. Usually there's me and there will be a hammer if computer starts to misbehave.
When I started watching this video, I did not expect it would actually help me with my physiology course. I finally understand receptive fields :-D
Plz leave a link to The Origin of Everything, would love to check it out.
The link is in the description.
Carrie Anne you look so cute with your glasses on, you should keep them on for all your videos
Big brother 😎
She said kernel so many times i can’t quit thinking about popcorn
Nice👍
KinaTrax uses computer vision to record kinematic data on baseball pitchers. Biomarkers are no longer a requirement and data can be tracked accurately in game. Computer vision is revolutionizing the game!
yay!
Make a video on Mercury cycle! Please
you make it
Does anyone know the titles of all the books in the background of the videos? The only ones I can make out are "Ghost in the Wires" and "Linear Systems and Signals".
The orange one is *Robot Builder's Bonanza* . The others are way too blurry to read unless someone recognizes the spine design. (I've tried extracting frames from the downloaded video and enhancing them... no luck :C) I guess we could always ask CrashCourse themselves? ^^
Yay Fei-Fei Li! Watch her TED talk too.
I would trade all my privacy just so Windows do not ask to install updates when I'm mad!
So the government is watching me through my webcam?
"Abstraction is the key to build complex systems"
Machine vision will match ours when we can shrink 1000s of processors each capable of 1000s of petaflops to the size of an eyeball connected to the equivalent of the human brain's vision center.
The clip of the tracking of the fingers, arms, and face of the guy reading from the book makes me think that some day soon there will be a presentation or something where they show a computer detecting sleight of hand in a magic trick. Would be a pretty neat way to show off the accuracy, anyway.
That Macintosh in the back needs some serious retrobright treatment.
Dang, you're right. I shall watch an 8BitGuy restoration video to soothe myself now.
Ah I knew there'd be an 8 Bit Guy fan around here somewhere :D
I suppose these are the same kernels used in Photoshop
Is the guy in the middle the secret brother Dave?
Designer is a Liverpool FC fan I see.
....are internet connected microwaves a real thing?
How did you get 147 ? I can't understand. -185-186-186+233+233+233 = 142
May be the presentation error, but still the concept is clear with the next example which equals to 1
isn't it upper left corner?
Amazon Go is an example
我想字幕 Who stole the subtitles?
I was 100% in until 80% of the video. Then, it was like...
CCTV camera?
So.... How do you play sudoku
According to this, I should never be asked to update...
YOLO (you only look once)
I was going to mention that.
No edge!
A computing device should never change behavior depending on highly subjective factors, it should only do what it is explicitely told to do.
If my computer can't lie then it's not really alive!
But then it's just a faster calculator :(
Even if something that is subjective is in the code, it’s still *in the code*, meaning that the computer is not making its own decisions. Only the programmer makes decisions based on subjective ideas.
The point is that the user's inputs to the program are no longer under the complete control of the user. Typing & clicking the same things might not get the same behavior/output, based on some ML classifier trying to interpret your mood/intentions/... more or less well. This *affective computing* approach is very different from regular UI design.
PogChamp