Training AI to Play Pokemon with Reinforcement Learning

2024 ж. 10 Мам.
6 300 122 Рет қаралды

Code:
github.com/PWhiddy/PokemonRed...
Discord:
/ discord
Collaborations, Sponsors:
See channel email
Buy me a tuna melt:
www.buymeacoffee.com/peterwhi...
Sections:
0:00 - Intro
1:20 - How it works
2:54 - Let the games begin
4:04 - Exploration, distraction
5:46 - Level reward
6:38 - Viridian Forest
8:06 - A new issue
8:44 - PC Trauma
10:10 - Healing
10:45 - Gym Battle
12:43 - Route 3
14:44 - Mt Moon
15:54 - Map Visualizations
18:53 - RNG manipulation
20:07 - First Outro
20:26 - Technical Intro, Challenges
21:44 - Simplify
22:43 - Efficient Iteration
23:56 - Environment, Reward function
26:26 - Metrics & Visualization
27:46 - Future Improvements
29:24 - Run it yourself
32:58 - Final Outro

Пікірлер
  • An AI being traumatized by using a pc is the most ironic thing I've heard in a while

    @deesh6378@deesh63786 ай бұрын
    • Haven't you seen twitch plays pokemon? PCs are a death sentence!

      @mcstrategist@mcstrategist5 ай бұрын
    • I didn't even make that connection 😂

      @nimi-nae@nimi-nae5 ай бұрын
    • @@mcstrategistI remember that. People were spamming. To get rid of pokemon. They had to ban people and make rules. That was pretty hilarious though.

      @derrickkamphaus8743@derrickkamphaus87435 ай бұрын
    • @@nimi-naesame. But yeah that’s pretty funny

      @derrickkamphaus8743@derrickkamphaus87435 ай бұрын
    • Sudden excessive punishment against a curiosity traumatizes first time experiencer Seems all too legit 😅

      @Kurayamiblack@Kurayamiblack4 ай бұрын
  • I laughed so hard when the AI refused to press the A button when it lost.

    @markcooper4876@markcooper48765 ай бұрын
    • Stalling to avoid the outcome confirmation. Reminds me of young children, actually. Haha

      @MP-lv5vk@MP-lv5vk4 ай бұрын
    • @@MP-lv5vk Sometimes the sound of a door slamming because of a gust of wind can remind me of children slamming their hands on a table. There is ZERO connection/homology between anything in the bot produced behavior, and the realm of human motivation or other emotions. It is logically impossible to learn anything about humans from literally everything about this showcase except by observing the actual human who decided to create this mathematical formula of instructions (algorithm) to a low level brute force bot.

      @JohnnyNatrium@JohnnyNatrium4 ай бұрын
    • @@JohnnyNatrium Yeah but it reminds me of children's stubbornness lmao

      @kphaxx@kphaxx4 ай бұрын
    • Children can be the sorest losers, refusing to keep playing is hilarious 😂

      @marcelgonzalez1151@marcelgonzalez11513 ай бұрын
    • The only winning move is not to play.

      @Loliconman@Loliconman3 ай бұрын
  • I’m so glad you didn’t stop when you said “this sounds like a reasonable stopping point”

    @Toolazytothinkofagoodhandle@Toolazytothinkofagoodhandle2 ай бұрын
    • But then he stopped not to long after 🥲

      @BorrisBackyardigan@BorrisBackyardigan5 күн бұрын
  • I dunno why but the clips were all the AIs aimlessly walk around like a colony of small ants is unbelievably adorable to me

    @cappuccinocappy@cappuccinocappyАй бұрын
    • holy shit ai are the ants. or are ants the ai?

      @sugabopp@sugaboppАй бұрын
    • Is this a subtle nod to @SmallAnt ?😂

      @shreyandas4243@shreyandas42434 күн бұрын
  • it was unreasonably adorable when the AI stopped in Pallet Town to enjoy the scenery

    @Hitmonstahp@Hitmonstahp7 ай бұрын
    • Seconded

      @azukar8@azukar87 ай бұрын
    • The AI is cute

      @AyaxTheDragon@AyaxTheDragon7 ай бұрын
    • Based AI knows true happiness.

      @htspencer9084@htspencer90847 ай бұрын
    • Ok but did you see the little dance after beating the bug catcher on the first try?

      @Trashley652@Trashley6527 ай бұрын
    • Yeessss I envision it talking everything in with a solemn smile, knowing that it’s about to leave this quaint town on a grand adventure of trials and learning. ‘Just one more moment at the banks of this familiar lake, then I’ll be off…’

      @kratangg-arang@kratangg-arang7 ай бұрын
  • “The ai is learning how to move, and is just walking around” really explains a lot of my online teammates in first person shooters.

    @butterfish6799@butterfish67996 ай бұрын
    • Bots

      @jeffwooten6888@jeffwooten68885 ай бұрын
    • Like my team mates in LoL

      @Johnrick90@Johnrick905 ай бұрын
    • Lvl 1 lukes in star wars battlefront 2 hvv

      @hallmark1@hallmark15 ай бұрын
    • Npcs playing npcs 😢😮

      @porkhill6665@porkhill66655 ай бұрын
    • @@jeffwooten6888"bot" sounds so negative. Maybe we should start calling them "reinforcemenrt learners" instead.

      @IschmarVI@IschmarVIАй бұрын
  • i love that the AI decided to just hang out and watch the scenery. reminds me of my favorite poem “Stopping by the woods on a snowy evening” by Robert Frost

    @brandonbrsndon@brandonbrsndon2 ай бұрын
    • Everybody likes Robert Frost

      @piciperkuadrik4636@piciperkuadrik4636Ай бұрын
    • I’ve done this many times in my play throughs with Pokémon, it’s actually scary how much the AI “mimics” human behavior.

      @danielserrano929@danielserrano929Ай бұрын
    • ​@@piciperkuadrik4636not True I actually HATE Robert Frost

      @aceq361@aceq361Ай бұрын
    • You have good taste. That's a beautiful poem

      @TheMysticalmax@TheMysticalmax4 сағат бұрын
  • Since I'm all into both Pokémon and coding, KZhead suggested your video just minutes after you uploaded it. I subscribed after a few minutes watching it, and now I watched it again and noticed you have almost 50k subscribers! With just one video! Please take that as a public, worldwide testament of the effort you have put into this. Thank you so much!

    @lateusbetelgeuse@lateusbetelgeuse5 ай бұрын
    • Broke yt 😂

      @DruggiePlays@DruggiePlaysАй бұрын
    • If you like Pokemon and AI, you'll love this: kzhead.info/sun/fruweqixeXpunJ8/bejne.html&ab_channel=Spawnvilley

      @t2g648@t2g648Ай бұрын
  • This must've taken an insane amount of time to not only simulate but also edit, really good video, nice work

    @DolanDarker@DolanDarker6 ай бұрын
    • Omg Dolan you fucking legend where you been

      @MoazSalama-ly5jf@MoazSalama-ly5jf6 ай бұрын
    • rN6media does the edits

      @MrGoodeats@MrGoodeats6 ай бұрын
    • Have you forgotten your password?

      @anouaressanoussi@anouaressanoussi6 ай бұрын
    • ​@@anouaressanoussiobviously not

      @Hawk7886@Hawk78866 ай бұрын
    • ​@@MrGoodeatsyeah most youtubers dont edit their own content anymore

      @itsOZone@itsOZone6 ай бұрын
  • The ai discovering rng manipulation is mindblowing. I wonder if games in future could use ai to learn tedious or very specific glitches during beta testing.

    @pengwino828@pengwino8287 ай бұрын
    • They already do!

      @antoinecharbonneau5108@antoinecharbonneau51087 ай бұрын
    • Dude it clicked as he was explaining it "wasn't optimal" but also repeating and I was like "NOOOOOOO!!!"

      @NinjaArmy36@NinjaArmy367 ай бұрын
    • Why bother?

      @user-kt6ne3fx6u@user-kt6ne3fx6u7 ай бұрын
    • @@user-kt6ne3fx6u simple, ai thinks and tries things different to a human, it could discover stuff the devs wouldn't even imagine it was possible

      @veto_5762@veto_57627 ай бұрын
    • This is an elaborate version of fuzz testing, which is the act of feeding random data to a program to see how it react.

      @snowolf494@snowolf4947 ай бұрын
  • Your findings, implementation, logic, and ANIMATION is incredible. 👏👏

    @brandonvolesky9867@brandonvolesky98674 ай бұрын
  • Extremely impressive visualization of the simultaneous iterations. It can be hard to grasp that machine learning is happening in batches of mass parallel attempts, not each progressive scenario after another one by one. Excellent video!

    @e4e5e2e7@e4e5e2e73 ай бұрын
  • The AI is cool and all, lots of comments discussing it, but. I just wanna say, the editing is so awesome for a video like these, you don't often see such excellent presentation

    @Tommybgoode@Tommybgoode7 ай бұрын
    • I'm honestly baffled by how this was animated. How did you get the scenes with the thousands of character sprites moving about, all overlapping one another?

      @Lone.Willow@Lone.Willow7 ай бұрын
    • @@Lone.Willow all is revealed at 26:27

      @patrickjones3826@patrickjones38267 ай бұрын
    • 200% this. Not only taking on the entire workload of the project, but taking the time making such an enjoyable and informative visual aid is stellar!

      @scotthuber8536@scotthuber85366 ай бұрын
    • ​@@Lone.Willow Yeah that's what's wild, the AI stuff is sick, but the editing to show the iterations had me fucking floored.

      @XistenceX1@XistenceX16 ай бұрын
    • I just thought the same, the presentation is amazing 👏

      @maxmuller6730@maxmuller67306 ай бұрын
  • That whole traumatic experience with the PC and the Pokecenter was fascinating. Thank you for making this

    @olemew@olemew6 ай бұрын
    • The poor AI aww 😢❤

      @jeremycontreras6229@jeremycontreras62295 ай бұрын
    • It triggered my Twitch Plays Pokemon PTSD

      @whirlpoolstudio97@whirlpoolstudio975 ай бұрын
    • the analogies between human behavior and AI behavior were quite interesting in general, though the trauma sticks out. also kinda makes you think about ourselves, doesn't it? after all, this is ultimately just a statistic algorithm with a simple reward system, but it manages to show some rather lifelike emergent behaviors, which weren't inherently programmed it. then again, pretty much all of life is not that different, the model and algorithm are just much bigger and more granular and complex.

      @Spooglecraft@Spooglecraft5 ай бұрын
    • Indeed it happened to me when i was young, i didnt know how to withow pkmn bc the storage system was a mess so i didnt use the pc anymore xd

      @perrowason5096@perrowason50965 ай бұрын
    • Reminds me of the trauma triggered whenever Twitch plays pokemon went near the computer after they accidentally released all those pokemon haha!

      @shanemorris3554@shanemorris35545 ай бұрын
  • As a Pokemon enthusiast with 4 Pokemon tattoos and a data analyst aspiring to become a data scientist, this project was one of the coolest to watch! I was so fascinated that I decided to replicate the project myself. I encountered some difficulties along the way, but the Discord community was incredibly helpful. Congratulations on the project! 🙌

    @willianrocha5475@willianrocha54752 ай бұрын
  • 5 mil on your first video. Great quality, good research and break down. Congrats, can't wait to see what you bring next!

    @menerdo@menerdo4 ай бұрын
  • This was extremely well made. Great job

    @kylehill@kylehill6 ай бұрын
    • Holy cannoli it's science boi Kyle "Thor" Hill with his locks in the wild.

      @GameTimeWhy@GameTimeWhy6 ай бұрын
    • I see we spend our sunday nights similarly. Lmfao.

      @draaaven157@draaaven1576 ай бұрын
    • This is honestly one of the best endorsements this video could have

      @buddycal1@buddycal16 ай бұрын
    • I’m certain the algorithm recommended me this video because of your comment

      @BigBaadMark12@BigBaadMark126 ай бұрын
    • Its the goat 🐐

      @Big_Biba@Big_Biba6 ай бұрын
  • Honestly the AI becoming traumatized from the PC was heartbreaking. Poor lil guy didnt understand what happened

    @ArmoredarmadilloX@ArmoredarmadilloX6 ай бұрын
    • My heart dropped when it was revealed he never went back to the Pokémon center afterwards, I felt so bad for the guy.

      @istumby@istumby6 ай бұрын
    • @@istumbyright? Just imagine how rewarding it would’ve been to gain those total levels back! Probably would’ve broken the reward system, as there’s nothing keeping the AI from depositing the Pokémon just to get rewards for pulling it back out.

      @hunterwylie6969@hunterwylie69696 ай бұрын
    • @@hunterwylie6969 Deposit, withdraw, deposit, withdraw like a junkie.

      @Yelonek1986@Yelonek19866 ай бұрын
    • "The Pokémon center stole my only squirtle!"

      @nousukas@nousukas6 ай бұрын
    • Don't feel bad, they learn as they go!

      @kdsavage1991@kdsavage19916 ай бұрын
  • I’m not sure if you noticed this or not Peter, but this is historic. In terms of R&D and just human science. Very impressed with this creativity and passion. Cheers 🥂

    @elrudiiisimo3066@elrudiiisimo30665 ай бұрын
    • Genuinely blown away by the many high level skills this takes. On top of that, you have an incredible ability to teach high level concepts to a lay audience. Very rare!

      @kaComposer@kaComposer3 ай бұрын
    • @@kaComposer Agree. This level of technical ability plus storytelling ability is magnificent.

      @PantheraTK@PantheraTK12 күн бұрын
  • Incredibly well made video! I think your resourcefulness and ability to explain things in non-technical terms shows a deep understanding of the topic. Plus the storytelling is top notch

    @McDonaldsCalifornia@McDonaldsCalifornia2 ай бұрын
  • Not that I don’t love the videos that just say “I applied an AI to this game and here’s how long it took to finish it” but this video (in addition to its high quality visuals and great script) is so much beyond that. Instead of just watching a video on AI, we’re learning about reward implementation, the human condition, curiosity, and more and more. This went above and beyond, I was so rooting for our AI buddy by the end of this lol.

    @timothypickarski5234@timothypickarski52347 ай бұрын
    • You're right! This feels like an in depth, academic essay

      @Fractisdnb@Fractisdnb7 ай бұрын
    • I want to see the AI beating the game

      @wiiu-theunderratedconsole7569@wiiu-theunderratedconsole75696 ай бұрын
  • "Just hanging out and admiring the scenery, is more rewarding than exploring the rest of the world." Never have I felt more like a machine learning algorithm than this sentence right here.

    @DigitalIndra@DigitalIndra6 ай бұрын
    • The digital world is more rewarding than the real world

      @harm991@harm9916 ай бұрын
    • Very relatable outcome!

      @ceigey-au@ceigey-au6 ай бұрын
    • Me too, why bother capturing and fighting when you can just chill and enjoy the motion of leaves and waves? Quite poetic

      @counterleo@counterleo6 ай бұрын
  • I’m thoroughly impressed by this video. Rarely someone executes an idea to this level of clarity. Visually it is very easy to understand what’s going on as well. Plus you’re sharing the project with everyone. Keep up the hard work

    @Lelouch999@Lelouch9994 ай бұрын
  • everything about this video is extremely well done, even the editing , i think big bro is an actual genius

    @dtolud@dtolud4 ай бұрын
  • This was edited and put together so amazingly well. I haven’t even finished yet- I just needed to express my gratitude that you took the time to not only complete this project but edit the process in such a visibly appealing way. Thanks for 33 genuinely enjoyable minutes!

    @aylakoch4516@aylakoch45166 ай бұрын
    • shit was boring asf, felt like a lecture lol

      @dognigga@dognigga6 ай бұрын
    • A Happy Way to Live The servants who are ready and waiting for his return will be rewarded. -Luke 12:37 All around us we can see fulfilled Bible prophecies, signs indicating that the return of Jesus Christ is drawing near. As followers of Christ, we should be watching for Him. We need to be ready to go. Jesus, speaking about His return, said, “Be dressed for service and keep your lamps burning, as though you were waiting for your master to return from the wedding feast. . . . The servants who are ready and waiting for his return will be rewarded” (Luke 12:35-37) Are you ready for His return? To be ready means to be engaged in activities that you wouldn’t be ashamed to be doing if Jesus were to return. It’s a good idea to periodically ask ourselves this question: This place that I am about to go, this thing that I am about to do, would I be embarrassed if I were doing it when Jesus came back?” Think about your plans. Is there anything you will be doing today, tonight, or tomorrow that you would be ashamed to do if Christ were to return? If so, then change your plans. You want to be ready for His return. Not only should we be ready, but we should anxiously await the return of Christ. We used to have a German Shepherd who slept outside the bedroom, leaning against our door. We didn’t let him sleep in our room because he often had nightmares and would wake us up. Every morning when we opened the door, he rolled into the room. Then he’d jump up and start running in circles. He was genuinely happy to see us. That is how we should be waiting for Christ’s return. And anything that might prevent us from saying “Come quickly, Lord Jesus” is out of place in our lives. In addition to waiting, we should be working. Every now and then, someone predicts that Jesus will return on a specific date. People believe these predictions and start quitting their jobs or divorcing their spouses. But that is not what we should be doing as we wait for the return of Christ. Instead, we should be working for Him. The Bible says, “Just as the body is dead without breath, so also faith is dead without good works” (James 2:26) If watching is the evidence of faith, then working is the evidence of faith in action. Watching for the Lord’s return will help us prepare our own lives. But working will ensure that we bring others with us to Heaven. The great British preacher C. H. Spurgeon said, “It is a very blessed thing to be on the watch for Christ. . . . You can be poor without murmuring; you can be rich without worldliness; you can be sick without sorrowing; you can be healthy without presumption. If you are always waiting for Christ’s Coming, untold blessings are wrapped up in that glorious hope” When you live in the anticipation of Christ’s return, it’s a happy way to live.

      @faith9505@faith95056 ай бұрын
  • If you ever do have the AI finsih the game, I think it would be really cool if you let the same AI try Pokemon Gold. I think seeing if an AI trained on Gen 1 could play Gen 2 that would be an interesting experiment

    @hornoxthekingslayer8100@hornoxthekingslayer81006 ай бұрын
    • Obviously it'd have to relearn how to navigate the map, but it'd probably do well in battles since it already knows how

      @zeebo30@zeebo305 ай бұрын
    • It wouldn't be able to catch the farfetchd or use cut This game would fail too with the hm's

      @johnhamilton5431@johnhamilton54315 ай бұрын
    • I'm going to do this as a project for my machine learning class, and I am planning on trying the same algo on Gen 2.

      @geekygecko1849@geekygecko18495 ай бұрын
    • do make a video@@geekygecko1849

      @lpsfoxstar8454@lpsfoxstar84545 ай бұрын
    • ​@@geekygecko1849how can I follow along?

      @MizChivVvOzZz@MizChivVvOzZz4 ай бұрын
  • I can't wait for this project to hopefully continue in the future. This was so well done

    @000glowinthedark000@000glowinthedark0004 ай бұрын
  • Such an amazing first video - can't wait to see what you do next.

    @telprydain1@telprydain13 ай бұрын
  • watching the little reds go round like an ant colony brings me so much joy and i don't know why. look at them all exploring. learning. discovering the world. lil guys. thank you for spending at least 1000USD and several hours putting this together just for me to uncontrollably laugh at the reds for 20 minutes ..with that out of the way, fantastic video. incredibly readable visuals and clear voiceover, awesome topic, understandable for several levels of previous knowledge. can see this hitting the high hundred thousands.

    @turingtestingmypatience@turingtestingmypatience7 ай бұрын
    • I was looking for this comment because I thought the same!!! It was like watching ants!! Just amazing!! This video exploded my mind... Imagine a Pokemon game were you can compete against a real "rival" (blue) in real time just to see who wins the league first... And every run the rival gets different pokemons with different moves... This guy is just insane, this is like a Pandora box!!!! New sub for sure!!!! And thank you for this video Peter!!!!!

      @raula6533@raula65336 ай бұрын
  • I'm so glad the KZhead algorithm decided to recommend your video and I clicked on it. It's a fascinating thing to watch the process and journey that the AI goes through, while the presentation of the whole video is equally fantastic. Great video, you all deserve a round of applause for the effort and quality put into this whole project.

    @fartmicrowave@fartmicrowave7 ай бұрын
  • This is a masterpiece! I really respect you that made all of these including video editing. What a talent!

    @kboss1998@kboss19982 ай бұрын
  • I cannot believe i am actually watching this intently. Its so fascinating! And how you apply the AIs experiences with that of a human's experience in the real world is exceptionally well done. Good job mate!

    @Prince_Oli@Prince_Oli3 ай бұрын
  • As a physicist i appreciate those visualizations. This is truly remarkable content.

    @jondebeer6863@jondebeer68637 ай бұрын
    • wtf does you being a physicist have to do with anything? guess you just wanted attention.

      @aurelia8028@aurelia80286 ай бұрын
  • The AI naming Squirtle “AAAAAAAAAA” killed me! 😂Thanks, amazing content.

    @MichaelCrecker@MichaelCrecker6 ай бұрын
    • AI picked the Squirtle in Pokemon Red lol what a contrarian

      @RevanBC@RevanBC6 ай бұрын
    • i was hoping someone else had mentioned this

      @Tropictopic69@Tropictopic696 ай бұрын
    • @@RevanBC that was its only option...

      @Tyler-qh7bf@Tyler-qh7bf6 ай бұрын
    • @@Tyler-qh7bf No you can pick 2 other pokemon! idiot.

      @RevanBC@RevanBC6 ай бұрын
    • Pigeoto was ‐-----------

      @sergeantjoe6802@sergeantjoe68026 ай бұрын
  • Insanely well made video, rare that I want to go out of my way to share videos with people but this is really impressive. Lots of extra "human lessons" in here that you glossed over, like the Pokémon Center trauma for example. Super good man

    @songofalchemy@songofalchemy4 ай бұрын
  • It's a very qualitative video All the screens view and different IA moving make it very pleasant to watch! Thanks

    @ToGham21@ToGham215 ай бұрын
  • I honestly expected this video to be from a youtuber with thousands of subscribers, to see that you only have 60 baffles me, this is an incredibly well-made and well-put together video.

    @eddie7252@eddie72527 ай бұрын
    • yeah i thought the same, its gone up to 400 now but still nuts

      @viperific3410@viperific34107 ай бұрын
    • tbf, it's his first video.

      @androsp9105@androsp91057 ай бұрын
    • @@androsp9105 yeah I only realised that after I left this comment, even more nuts lmao

      @viperific3410@viperific34107 ай бұрын
    • He’s gained nearly 5,000 in a few days. Very good going.

      @Station9.75@Station9.757 ай бұрын
    • Misuse of commas.

      @AwesomeHairo@AwesomeHairo7 ай бұрын
  • It's one thing to set all this up, and it's another to visualize and present it in such a coherent and digestible way. You did both so well! Hope to see more content from you in the future

    @jdllim@jdllim6 ай бұрын
    • Agreed! This video is insane!

      @user-vs3fv1ii1o@user-vs3fv1ii1o6 ай бұрын
    • I can't believe it's done by individual. Super high quality.

      @youngsdiscovery8909@youngsdiscovery89096 ай бұрын
  • This might be the coolest video of AI playing a video game I've ever seen. I love all the fascinating emergent behaviours (especially the RNG manipulation), as well as the analogies you draw to humans. I also love that you presented the technical explanations in a way that allowed me understand almost everything without any programming knowledge, just a decent understanding of AI. Genuinely amazing job, I hope to see more like this in the future! :)

    @user-io6ww9uv7e@user-io6ww9uv7e3 ай бұрын
  • This was absolutely amazing, my friend! Please do more of these! I must admit I was disappointed that you didn't do the whole game 😂

    @Distractionn-CG_5945@Distractionn-CG_594521 күн бұрын
  • Seeing high effort videos like these from relatively low sub channels always surprises me. Definitely deserves more recognition/subs.

    @steven-mz3jf@steven-mz3jf6 ай бұрын
    • It's the only video on bro's account lmfao wdym

      @napoleonbonerfarte6739@napoleonbonerfarte67396 ай бұрын
    • @@napoleonbonerfarte6739lol was about to write this too

      @RandyGBH@RandyGBH6 ай бұрын
    • Good things take time

      @abhishekkoundal584@abhishekkoundal5846 ай бұрын
    • And people who over react to low sub channels being high quality doesn't surprise me. Lots and lots of dumdums out there

      @absolutelyfookinnobody2843@absolutelyfookinnobody28436 ай бұрын
  • Haven't even finished the video yet, but I want this to pop off in the Algorythm, this video had tons of efforts put into it, and deserves to get out there.

    @BlackScytheLP@BlackScytheLP7 ай бұрын
    • I've got some good news for you, that's how I found this video

      @Neo_Data@Neo_Data7 ай бұрын
    • The algorithm brought me here

      @esotericraime1441@esotericraime14417 ай бұрын
    • Guess I'll throw on a comment too then. This is great!

      @auraonline9073@auraonline90737 ай бұрын
    • yesss this was so cool

      @pionaiz@pionaiz7 ай бұрын
    • Thanks then

      @aaronhpa@aaronhpa7 ай бұрын
  • The fact you walk through running everything for everyone else is so generous. Thanks!

    @woodybutler@woodybutler4 ай бұрын
  • superb video, text and learning, good job! Also, on the relatableness of experience, occurrences and patterns, there's memetic we use now more than ever to render perceptible the archetypal moment in videogames where a point of inflection occurs and hold a certain humour that we choose to transmit via memes later on.

    @dead0barbie@dead0barbie2 ай бұрын
  • As a psych prof I'm always trying to think of different ways to explain certain concepts and give relatable examples, and this one is perfect!

    @SplishySploshy@SplishySploshy6 ай бұрын
    • They tell me I’m crazy here 🤪

      @MasteringSilence@MasteringSilence6 ай бұрын
    • ​@@MasteringSilenceCrazy? I was crazy once

      @norabarlow17@norabarlow176 ай бұрын
    • @@norabarlow17 you only lose your mind once… They put me in a rubber room with rubber rats…

      @MasteringSilence@MasteringSilence6 ай бұрын
    • @@norabarlow17they locked me in a room. A rubber room with rats.

      @knockout8157@knockout81576 ай бұрын
    • As a psych professor can you explain the appeal to these people repeating the copy paste comments? Also just to be clear I'm also asking out of genuine curiosity if there may be psychological reasons past the basic wanting to be a part of something, and not just trying to hate on them or anything ✌

      @charpool169@charpool1696 ай бұрын
  • Very impressive. I'm looking forward to any future content you upload

    @Liquid_Joe@Liquid_Joe3 ай бұрын
  • First and only video of the channel and it already is a banger like this... Great work there amigo. Btw. I really appreciate you leaving the project opensource. I hope I can improve it in some way in my own experiments...

    @mateuscrevelin3394@mateuscrevelin33944 ай бұрын
  • They told me my Pokémon phase would pass. Little did they know, it was just evolving into an AI obsession!

    @BryceHuston@BryceHuston6 ай бұрын
    • Pokémaniac Bryce Huston wants to battle!

      @sanjaywilson8232@sanjaywilson82325 ай бұрын
    • ​@@sanjaywilson8232LMAO!

      @ashashii911@ashashii9115 ай бұрын
    • *Pokemon Trainer Battle Theme starts playing*

      @30303Steve@30303Steve4 ай бұрын
    • Edit : Go Lucario ! Fight Pokemon Bag Run away

      @821aq@821aq4 ай бұрын
  • This is their first KZhead Upload, it’s crazy to me how much work, effort and money went into its production without having built an audience on an already successful channel before. Mad props to you Peter.

    @chrispyvolterra@chrispyvolterra6 ай бұрын
    • I am looking forward to see what else you will create.

      @chrispyvolterra@chrispyvolterra6 ай бұрын
    • He is an employee of Amazon Headquarters in Seattle 👏🏽👌🏽 He is smart af

      @notavailable947@notavailable9476 ай бұрын
  • We need more! this was so fascinating, informative and entertaining. I hope your able to make an ai finish the game one day, it would make for one hell of a video! Thank you!!

    @yusukeurameshi500@yusukeurameshi5003 ай бұрын
  • More more more! What an outstanding first video! Can't wait for more from you!

    @freshcupofjoel3000@freshcupofjoel300021 күн бұрын
  • That was incredible! I’ve always wondered if this was possible, I’m blown away by what the AI was able to learn! The visualizations and presentation were excellent, I hope this video reaches a wide audience!

    @joshuasims5421@joshuasims54217 ай бұрын
  • This was awesome, I'd love to see a full series of the AI completing the game.

    @RageAgainstTheTards@RageAgainstTheTards7 ай бұрын
    • Yes!!

      @user-pv4cw3du2p@user-pv4cw3du2p7 ай бұрын
    • downloaded it and train the ai more

      @tsunalein@tsunalein6 ай бұрын
    • And then i'd like to see it completing the game as fast as possible. An AI speedrun competition: winner gets 100,000 arbitrary points

      @cassidy8307@cassidy83076 ай бұрын
  • Breaking down some really cool tech for the layman just earned you another sub! Great video mate 👍

    @QoStoOds@QoStoOds4 ай бұрын
  • This video is amazing! The AI part must have taken hundreds of hours or even over 1000 and the editing must have taken dozens of hours. Almost 5 million views and 50k subscribers in 2 months is a lot and well deserved.

    @NaudVanDalen@NaudVanDalen5 ай бұрын
  • This was an amazing project and explanation. You should submit this to The Journal of Geek Studies if you don't have a publication lined up already.

    @henriquemagalhaessoares8739@henriquemagalhaessoares87397 ай бұрын
    • Wah is that a thing?

      @seveneyes77@seveneyes776 ай бұрын
    • @@seveneyes77 Yep! They are an online publication that uses geek culture as a way to popularize science. They had a bunch of articles from the biology if final fantasy monsters to the effectiveness of super man disguise.

      @henriquemagalhaessoares8739@henriquemagalhaessoares87396 ай бұрын
  • Dropping a comment to help the algorithm. This video honestly deserves millions of views. I love the part where the AI learned to RNG manip to catch a Rattata. It's one of those moments that's unexpected at first but when you go back and look at it it's like, "oh, of course it would react like that!" Moments like those are why I love AI learning videos like this.

    @Solsumi@Solsumi7 ай бұрын
  • Amazing model organization and execution, amazing post production, amazing communication of novel and or complex concepts in a way everyone can understand. Well done and well worth the multi million likes you’ve gotten

    @MrRaveHaven@MrRaveHaven3 ай бұрын
  • how many grew a gambling addiction trying to win an eevee?

    @DNAngelOtaku@DNAngelOtaku2 ай бұрын
    • No one since you get it for free as a gift in one of the rooftops lol 😂

      @livingdamen4363@livingdamen4363Ай бұрын
    • ​@@livingdamen4363 Still six, gambling addiction doesn't work logical 😢

      @looppooper2306@looppooper23063 күн бұрын
  • Holy crap this is your very first YT video? I can't wait to see what you cook up if you continue to create! Outstanding work!

    @LPcrazy_88@LPcrazy_886 ай бұрын
    • Tbh I didn’t know youtube algorithm allowed channel with 1 video to pop off like this. Over 1 million views in 7 days?? If this video was posted in a sizable channel, it might have been even 10 times more.

      @clickpwn@clickpwn6 ай бұрын
    • he paid for the view XD@@clickpwn

      @bilibangbang@bilibangbang6 ай бұрын
    • @@bilibangbanghow you know that?

      @sarkhaaan@sarkhaaan6 ай бұрын
    • It gets better when u go into the git-hub project and find out that he has been working on this for the last 2 years...

      @stonybaboon@stonybaboon6 ай бұрын
    • @@bilibangbangmald

      @blake..-@blake..-6 ай бұрын
  • I really like how grounded and transparent your breakdown of the AI capabilities and limitations is, it shows it as a tool and not as a magical solve-all-problems strategy. Also, what a masterful storyteller and explainer you are. This video is very well paced and laid out, congrats!

    @trbremm@trbremm6 ай бұрын
    • Yes it's limited but imagine what it could become in a few more years 🤖

      @SSGoatanks@SSGoatanks6 ай бұрын
  • I love this video, idk what you have planned for the future of this channel, but I can't wait to see it!

    @jonathanlunger2775@jonathanlunger27753 ай бұрын
  • I love how you equated the AI actions with human actions. This was fascinating. Great editing work.

    @elizabethburns-gundel1052@elizabethburns-gundel10525 ай бұрын
  • Fellas, I'm an AI engineer, with a short background in Reinforcement Learning for a period I interacted with Sony for a job. I need you to understand the MAGNITUDE of these results. It's an insane work, and I'm sad that probably only a few might understand the sheer amount of skill require to do this. Insane job man, you are a goat

    @matteoemanuele-gi4jk@matteoemanuele-gi4jk6 ай бұрын
    • This is no understatement . This takes a level of focus and problem solving that is just not normal. Savage!

      @dkm9090@dkm90906 ай бұрын
    • I’m not even an engineer, and my jaw is on the ground. I genuinely would love to learn how to become a part of this world. I wish there were more people in my circle with hobbies and fascinations like this. I used to help write xml codes for world of Warcraft bots when I was a kid. Now laying in bed with an alarm set for five hours from now. I’ve got a sales job… is 33 years old too old to learn how to work in this scene? This video drips with knowledge, and a wisdom and understanding of something that I have no idea how to even begin to approach. Kudos!!

      @bricegardner7815@bricegardner78156 ай бұрын
    • I wouldnt say those results are impressive theory wise ? The impressiveness of the work comes from a technical point of view, how great he managed to link the RL model with the game and the fine-tuning he put in it. By the way, AI engineer doesnt really mean anything, what is your job title ? Out of curiosity

      @alr9447@alr94476 ай бұрын
    • ​@bricegardner7815 no age is too high. With enough determination and curiosity you can definitely pivot. Look into videos explaining the skills required to get a job in game development/ AI.

      @harshrajjadhav940@harshrajjadhav9406 ай бұрын
    • @@alr9447 I am officially a data scientist, but within the team I'm the guy responsible of the training of the ML models, therefore I make this distinction because nowadays "data scientist" is too broad. In most big tech companies, AI engineer is a common notation to distinguish between the data science folks

      @matteoemanuele-gi4jk@matteoemanuele-gi4jk6 ай бұрын
  • Just 10 minutes in, and it has already gotten so damn interesting! The behaviors, the systems, the events, the unexpected but explainable scenarios, the AI literally experiencing something comparable to trauma? I want to see more!

    @dralinkushinen@dralinkushinen6 ай бұрын
    • The Red swarm wasn't enough?

      @kaelthunderhoof5619@kaelthunderhoof56196 ай бұрын
    • me too sad when he stop at moon mt.

      @kaio0777@kaio07776 ай бұрын
    • The AI doesn't experience anything because it's not a conscious entity. It experiences as much as Microsoft Word when you open it.

      @Elintasokas@Elintasokas6 ай бұрын
    • @@Elintasokas😅😂😂

      @alexb8926@alexb89266 ай бұрын
    • @@Elintasokas Based on my PCs heavy breathing when I open Word I assume its orgasming.

      @JamanWerSonst@JamanWerSonst6 ай бұрын
  • this man releases a single video in his career and makes the front page, keep the sick vids coming clearly a big name in the making dude you got this

    @ideannassiri9672@ideannassiri96722 ай бұрын
    • 1 video, 55k subs, 5.7M views. This is historic

      @dala555@dala5552 ай бұрын
  • This is cool. I’ve seen plenty of these ai videos on different games but this one has gone the most in depth to the technical side of it. I am very interested in learning more now

    @Spuddy987@Spuddy9872 ай бұрын
  • Everything about this was amazing, the computational approach, the video edit, the tone, the explanations and the real life parallels. Beautiful work!

    @JoaoMorais-ee1oq@JoaoMorais-ee1oq6 ай бұрын
  • this is honestly worthy of an entire course's final project at the graduate level. Thank you for making this freely available!

    @SevereMalfunction7@SevereMalfunction76 ай бұрын
    • Isn’t it just! I’m currently half way through my final project for my MSc, with a relatively shit regression model predicting energy usage. 😂

      @warmcat@warmcat6 ай бұрын
  • This was so much fun to watch, and I have no idea how it works - that’s all you buddy, fantastic quality of video. Really looking forward to seeing an AI speedrun in this format if it’s even possible 😊

    @AnAfinityForKarma@AnAfinityForKarma2 ай бұрын
  • I’ve got no idea who you are but i can say how proud i am that someone tried this and had the patience to gather such interesting, noteworthy and valuable insights. Great work fam. Awesome explanations as well

    @bronzebond4869@bronzebond486927 күн бұрын
  • The accidental traumatic depositing of Pokémon in the center is rather hilarious, and the Magikarp/fast food analogy is beautiful. Picking left is an ancient gaming trick, not surprised AI picked it up/that we make games that reward it. And lastly the short-term memory bit seems to me a great idea to solve this (and also, accidentally, rather human :P).

    @thermonuclearwarhead@thermonuclearwarhead6 ай бұрын
    • I was feeling sad for the AI who must have thought it accidentally killed its Pokémon 🥲😂

      @counterleo@counterleo6 ай бұрын
    • The only flaw in the fast food analogy is we'd need to learn that in the future eating fast food will make you live longer (or something else awesome) given what Magikarp evolves into!

      @lesbo37@lesbo376 ай бұрын
    • i thought the traumatic experience was super interesting too and funny lol

      @teenslayer@teenslayer6 ай бұрын
  • I remember back when there were 1 or 2 reinforcement learning videos on YT. Now we get all sorts. But this one...this one is special. The production value here is excellent. Thanks for all of your hard work.

    @Hateburn@Hateburn7 ай бұрын
  • incredible video editing and above all research. well done :D

    @aaronsmith4113@aaronsmith41132 ай бұрын
  • That reinforcement training for AI is technically the same as pure clicker training (positive reinforcement) for animals, where you have to train them by only rewarding tiny steps torwards the goal. Anyone being advanced in that field would be a great choice to give ideas for how to train those AIs

    @blindcatdonovan229@blindcatdonovan2295 ай бұрын
    • I think that's more like curriculum learning. In the purest form of reinforcement learning you don't encode your knowledge on how to solve a problem in the reward function.

      @MrCmon113@MrCmon1133 ай бұрын
  • Bro honestly this is KZhead video of the year. How spectacularly you presented this information in such a clear and entertaining way that is honestly on the level of professional science productions like Cosmos. Absolutely colossal performance man. I wouldn’t be surprised if you had an entire production team.

    @flicmylich@flicmylich6 ай бұрын
    • thank you for the kind words :) no production team, but my friend @torinblankensmith made the thumbnail

      @peterwhidden@peterwhidden6 ай бұрын
    • I second this. I'm super interested in the content, but at the same time I'm like.... However did he make this look so good.

      @lovol2@lovol26 ай бұрын
    • It’s not that deep dude holy shit

      @glupshitto1977@glupshitto19776 ай бұрын
    • @@glupshitto1977 its deep.. learning.

      @Tom-yg7mi@Tom-yg7mi6 ай бұрын
    • @@Tom-yg7mi get out

      @Fissan_Poulsen@Fissan_Poulsen6 ай бұрын
  • Did you edit this yourself? Not only is the content amazing, but I'm blown away by how well this was all put together and demonstrated. If this is really your first video that's seriously impressive

    @cashmoneybanks8442@cashmoneybanks84426 ай бұрын
    • My first thought when it panned out. Like who tf is this guy lol

      @caderlocke8869@caderlocke88696 ай бұрын
    • He used JRGMediaYT for the edits

      @MrGoodeats@MrGoodeats6 ай бұрын
    • He probably asked an AI to edit it 😅

      @rtm3530@rtm35306 ай бұрын
  • As much as I love everything in your video and the time/thought you put into it, I’m also glad you got a large amount of views and subscribers. Something with this much effort deserves appreciation no matter how small or large a channel is. The fact this is your first video for this channel inspires me a lot to continue editing my own videos, despite the time it takes to get it done. I hope to follow your journey as you come out with more videos like this, keep it up!

    @pumpkinkingbones@pumpkinkingbones2 ай бұрын
  • I absolutely love you bringing up the parallels to human evolution and psychology. We really can learn a good bit about ourselves through ai

    @Myla-zl4jv@Myla-zl4jv5 ай бұрын
  • “Just hanging out and admiring the scenery is more rewarding than exploring the world” Amazing work Peter! I look forward to see how this will progress

    @mischavandenburg@mischavandenburg6 ай бұрын
  • This video was done incredibly! A perfect demo of and comparison to deep learning. A well earned follow. The dedication, creativity, and in depth descriptions are beyond impressive for this being the first video on this channel. Keep at it! I'll be looking forward to what ever you produce next!

    @rainwatervideography4546@rainwatervideography45467 ай бұрын
  • Phenomenal video with wonderful visualizations, great practical comparisons, and great educational content. No doubt, you've inspired many people to learn more about AI.

    @SelfSimilarJosh@SelfSimilarJosh4 ай бұрын
  • congrats on the project amazing!

    @diegoatila2633@diegoatila263310 күн бұрын
  • The amount of work you've put into this is so incredible. All of the self recording of _all_ of the AI iterations meant time spent (never wasted) for the sake of a single video. From the editing you've shown down to the research of how the human psyche works, this is beyond something I would even think to produce. You will go far in your endeavors.

    @TailsMiles249@TailsMiles2496 ай бұрын
    • This guy's first video and it's about using ai, so KZhead AI said "I gochu"

      @MageMinionsOP@MageMinionsOP6 ай бұрын
  • it’s really crazy how much of human psychology can be compared to AI behaviors

    @anxia-tea5846@anxia-tea58465 ай бұрын
    • Or how it’s behavior might exist because of its creator

      @trashyturtle1666@trashyturtle16663 ай бұрын
  • As a physician working in the field of pathology one of my main tasks is digital microscopy, aka working with medical imaging. AI and machine learning is a huge emerging discipline in our field. This video was very informative as well as entertaining. Thank you. You definitely earned a like and sub from me. Also, as a side note, I couldn't help but associate some of your observations in machine learning with human evolution as a concept. Definitely interesting stuff.

    @Selxis@Selxis3 ай бұрын
  • Not sure if its been said already, but, I would love to see them beat the game. Then we can see what levels they got to and what they thought was the best pokemon to have for the elite 4. Would be interesting.

    @benjones8779@benjones87796 ай бұрын
    • Charizard with Slash, easy

      @Asidders@Asidders6 ай бұрын
    • This will took a looooott of time and video preparation edit etc. But I agree would be awesome

      @teracraged320@teracraged3206 ай бұрын
    • i really doubt if the AI can solve the Stone Moving "Puzzles" inside the IceCave and VictoryRoad thou. Can it even be teached to learn and use the VMs? but id love to see it :D

      @Mcobange@Mcobange6 ай бұрын
    • I think it would be hard to program the rewards to get them through the specific obstacles tho like using cut in certain places etc

      @davidfl4@davidfl46 ай бұрын
    • Yeah they would shatter Wersters Speedrun World Record!

      @stephenh9483@stephenh94836 ай бұрын
  • this video is mindblowing. I have absolutely no clue how you collected and translated all this data into such cool visualizations, but i am in awe. this is so cool. thank you so much for making it!

    @headyshotta5777@headyshotta57776 ай бұрын
  • This was really amazing and fun to watch! Immense amount of respect for the hard work you put in!!

    @Fire_AJ_@Fire_AJ_3 ай бұрын
  • This video was fantastically made video. Thank you for this awesome visualisation of a complicated topic. Respect to everyone involved in this x

    @fredv9140@fredv91403 ай бұрын
  • This is like such a classic example of how AI thinks differently from humans. It can't figure out how to get past a ledge but its pattern recognition is so strong that it figured out friggin RnG manipulation by itself.

    @plasmakitten4261@plasmakitten42616 ай бұрын
    • We humans also have reward systems. Everything "living" does. It's different to an AI model. But who's to say that we're not just an AI model with different base rewards?

      @NikhilAutar@NikhilAutar6 ай бұрын
    • @@NikhilAutar The term "artificial" is meaningless unless it's being used to mean "made by humans". Since we didn't design ourselves, we aren't AI by any useful meaning of the term. But at the core, this way of designing AI is designed to mimic how humans learn, so you're not far off.

      @plasmakitten4261@plasmakitten42616 ай бұрын
    • Not only the pokemon gains EXP points, the AI gains EXP too

      @ryanwirawan5012@ryanwirawan50126 ай бұрын
    • @@plasmakitten4261 We'd be artificial to whoever designed us/this haha

      @NikhilAutar@NikhilAutar6 ай бұрын
    • I think completely opposite :D This (video) was prime example of how phenomenons that happens with humans can be put into numbers used by AI learning. Our learning = pattern recognitions based on the rewards we've gotten. They aren't as vivid with "Getting 3 points on catching pokemon", but rather intuitive that happens automatically.

      @cs16Tactics@cs16Tactics6 ай бұрын
  • As a Data Scientist, this was amazing to watch :) well done !

    @ChacalLoL@ChacalLoL6 ай бұрын
    • I wanted to be a Data Scientist then I realized I couldn't code😂

      @antonioiniguez1615@antonioiniguez16153 ай бұрын
  • Dayum, 5m views on your only video. Incredibly well made, thanks. Your observation that the game's audio, graphics, and logic is stored in less than a MB doesn't surprise me. I once saw a video essay about how our modern wealth of processing power and storage has lead to increasingly suboptimal design, we don't have the benefits of having to work within restrictions to make games, applications, etc in as optimized, condensed form.

    @Sound_Tech@Sound_Tech3 ай бұрын
    • Never thought about it like that. Do you think that's why games are so unoptimized when first released, and even if they stay unoptimized later in the game's lifespan?

      @gbodybala9295@gbodybala92953 ай бұрын
  • 12:34, You coulda just added a reward for "foe damaged" and directly added the damage dealt as a reward, this would've encouraged it to try other moves the moment it sees that reward occur via any battle. Could've also added a penalty for move depletion, with bigger losses for the more expensive moves, encouraging it to balance move usage.

    @zxuiji@zxuiji4 ай бұрын
    • I’m guessing the first suggestion would make the AI just stay and constantly battle Pokemon in the first area

      @HieronymousLex@HieronymousLex3 ай бұрын
    • @@HieronymousLex Hadn't thought of that, but yeah I guess the penalty for move depletion would deal with that by accident

      @zxuiji@zxuiji3 ай бұрын
    • ​​​@@HieronymousLexif in New tile give points for super effective 30s cooldown after battle

      @SpecialJess2@SpecialJess22 ай бұрын
  • This video reminds me of when I got Pokemon Yellow as a kid, I didn't read/speak english so I just had to try things to learn what everything did and was. It's weird how similar the AI playing feels to my experiences as a kid. The Pokemon games (among TV and other games) actually helped me learn english at the age of 9 far before my classmates could and as a little extra ROM hacking got me into graphic design and coding/web development somehow. Pokemon in general is the base of my origin story.

    @AMNEZ1A@AMNEZ1A7 ай бұрын
    • damn bro that is deep.

      @kaio0777@kaio07776 ай бұрын
    • Me with Spanish at 3 years old and English at 2 ahah Pokemon Azul and Pokemon Red 😅

      @Edoss98@Edoss986 ай бұрын
    • When I first played Pokemon, just like yourself I was still a kid didn't know any English so I couldn't even save. The first few months was just like the AI, start from that little room and trial and error.

      @joaofernandes6349@joaofernandes63496 ай бұрын
    • Hello fellow ESL player, i was like 5 when I got my first hand on pokemon. I was EXTREMELY upset when I accidentally start over the game (the copy was second hand and the saved file is from my older brother who already completed the game.) that I cried. I lost my brother's charizard, even the moltres he caught with an ultra ball because i couldnt understand a lick of English back then - overwriting his save accidentally, and I just love exploring the pokemon world more than battling them. Only then 3 whole years later when I did restart and beat pokemon on my own, around 12 I became competently aiming to "gotta catch em all".

      @2006HondaCivicD@2006HondaCivicD6 ай бұрын
  • Is this really your first video?! This is incredibly well done. So glad YT has recognized that your content is deserving of being pushed algorithmically.

    @3ountyhunter@3ountyhunter7 ай бұрын
  • An awesome video my dude! Had a great time watching it. The editing is top notch as well as the overall presentation. Thank you! Keep making great videos:))

    @user-es8dp4xf2e@user-es8dp4xf2e2 ай бұрын
  • Loved seeing the technical details!

    @myuzu_@myuzu_3 ай бұрын
  • This is one of the best implementation and visualization videos on the subject I've ever seen. Amazing work!

    @BernardoMachado@BernardoMachado6 ай бұрын
  • This is one of the most fascinating things I’ve ever seen. You deserve (1) reinforcement point in the form of an award. 🤙

    @theopiumden1551@theopiumden15516 ай бұрын
KZhead