The True Story of How GPT-2 Became Maximally Lewd

2024 ж. 17 Қаң.
1 365 619 Рет қаралды

In this video, we recount an incident that occurred at OpenAI while researchers were trying to finetune GPT-2 to be as helpful and ethical as possible. It's narrated that inadvertently flipping a single minus sign led GPT-2 to become the embodiment of a well-known cardinal sin.
#ai #aisafety #alignment
▀▀▀▀▀▀▀▀▀SOURCES & READINGS▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
OpenAI blog post: openai.com/research/fine-tuni...
OpenAI paper behind the blog post: arxiv.org/pdf/1909.08593.pdf
RLHF explainer on Hugging Face: huggingface.co/blog/rlhf
RLHF explainer on aisafety.info aisafety.info/?state=88FN_904...
Concrete Problems in AI Safety, by @RobertMilesAI: • Concrete Problems in A...
▀▀▀▀▀▀▀▀▀PATREON, MEMBERSHIP, KO-FI▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
🟠 Patreon: / rationalanimations
🟢Merch: crowdmade.com/collections/rat...
🔵 Channel membership: / @rationalanimations
🟤 Ko-fi, for one-time and recurring donations: ko-fi.com/rationalanimations
▀▀▀▀▀▀▀▀▀SOCIAL & DISCORD▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
Discord: / discord
Reddit: / rationalanimations
X/Twitter: / rationalanimat1
▀▀▀▀▀▀▀▀▀PATRONS & MEMBERS▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
Riley Matthews
Vladimir Silyaev
Nathanael Moody
Alcher Black
RMR
Nathan Metzger
Monadologist
Glenn Tarigan
NMS
James Babcock
Colin Ricardo
Long Hoang
Tor Barstad
Gayman Crothers
Stuart Alldritt
Chris Painter
Juan Benet
Falcon Scientist
Jeff
Christian Loomis
Tomarty
Edward Yu
Ahmed Elsayyad
Chad M Jones
Emmanuel Fredenrich
Honyopenyoko
Neal Strobl
bparro
Danealor
Craig Falls
Vincent Weisser
Alex Hall
Ivan Bachcin
joe39504589
Klemen Slavic
blasted0glass
Scott Alexander
noggieB
Dawson
John Slape
Gabriel Ledung
Jeroen De Dauw
Craig Ludington
Jacob Van Buren
Superslowmojoe
Michael Zimmermann
Nathan Fish
Bleys Goodson
Ducky
Bryan Egan
Matt Parlmer
Tim Duffy
rictic
marverati
Luke Freeman
Dan Wahl
Ken Mc
leonid andrushchenko
Alcher Black
Rey Carroll
William Clelland
ronvil
AWyattLife
codeadict
Lazy Scholar
Torstein Haldorsen
Supreme Reader
Michał Zieliński
뿌리와 가지있는 나무 connect
▀▀▀▀▀▀▀CREDITS▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
Direction: Hannah Levingstone (@hannah_luloo)
Written by: Jai(@Laneless_) & :3
Line Producer & Production Manager: Kristy Steffens (linktr.ee/kstearb)
Quality Assurance Lead: Lara Robinowitz (@CelestialShibe)
Animation:
Damon Edgson
Gabriel Diaz (@gabreleiros)
Ira Klages (@dux)
Keith Kavanagh (@johnnycigarettex)
Michela Biancini
Owen Peurois (@owenpeurois)
Colors Giraldo (@colorsofdoom)
Jordan Gilbert (@Twin_Knight/ Twin Knight Studios)
Zack Gilbert (@Twin_Knight/ Twin Knight Studios)
Neda Lay (@Nezhahah)
Background Art:
Hané Harnett (@thepeonyvibes)
Zoe Martin-Parkinson (@zoemar_son)
Compositing:
Renan Kogut (@kogut_r)
Patrick O'Callaghan (@patrick.h264)
Ira Klages (@dux)
Narrator:
Rob Miles
/ robertmilesai
VO Editor:
Tony Dipiazza
Sound Design and Music:
Epic Mountain
/ epicmountainmusic

Пікірлер
  • If you’d like to skill up on AI Safety, we highly recommend the AI Safety Fundamentals courses by BlueDot Impact at aisafetyfundamentals.com You can find three courses: AI Alignment, AI Governance, and AI Alignment 201 You can follow AI Alignment and AI Governance even without a technical background in AI. AI Alignment 201, instead, presupposes having followed the AI Alignment course first, and equivalent knowledge as having followed university-level courses on deep learning and reinforcement learning. The courses consist of a selection of readings curated by experts in AI safety. They are available to all, so you can simply read them if you can’t formally enroll in the courses. If you want to participate in the courses instead of just going through the readings by yourself, BlueDot Impact runs live courses which you can apply to. The courses are remote and free of charge. They consist of a few hours of effort per week to go through the readings, plus a weekly call with a facilitator and a group of people learning from the same material. At the end of each course, you can complete a personal project, which may help you kickstart your career in AI Safety. BlueDot impact receives more applications that they can take, so if you’d still like to follow the courses alongside other people you can go to the #study-buddy channel in the AI Alignment Slack. You can join by clicking on the first entry on aisafety.community You could also join Rational Animations’ Discord server at discord.gg/rationalanimations, and see if anyone is up to be your partner in learning.

    @RationalAnimations@RationalAnimations4 ай бұрын
    • Probably the best video i've seen in like 6 months, not just from this channel or on youtube, like best piece of media full stop. I was laughing so hard for 80% of it and had a chill down my spice for the last 20%. I'm also interested in AI and work with LLMs myself, so I also found the whole thing very interesting and engaging. I would definitely watch more videos like this, keep em coming!

      @zh9664@zh96644 ай бұрын
    • @RationalAnimation Eh, the resulting GPT-2, as well as GPT-3, and GPT-4 remained fairly corny without much prompting even after multiple attempts to sanitize their training data, they had to develop a 3rd bot just to detect that and the industry of "jailbreaks" to reveal it's corny side that followed. It's designed to mimic human writing, and humans are inherently corny no matter how much you deny it. Training for accurate mimicry will inevitably result in accurate mimicry, they got what they asked for, just not what they wanted. GPT2 and GPT3 (davinci and earlier) were amazing because they weren't lobotomized and censored like GPT-3.5-Turbo and GPT-4

      @MOGEKO12@MOGEKO124 ай бұрын
    • Besides, if you compare it with countless other things horny ai are absolutely no evil the fact that they care about that and not other even more problematic things makes it even more strange and irritating they wanted the robot to imitate most humans on the internet and they got what they asked for and not what they wanted In fact, they just put it for adults only. In fact, first of all, children and young people shouldn't even be able to use or touch a cell phone without adult supervision, so if something happens, it's totally and completely the parents' fault. There are numerous applications and forms of security to limit use that only allow the use of applications permitted by the country and the same applies to computers and video games If adults don't have time to take care of children, then these adults shouldn't even have children and they should be taken away from them. In fact, most of the problems are the parents', education and shitty country governments fault although I don't think there's much to do Humans are a petty race, disgusting asshole, corrupt, foolish and self-destructive psychopath. a race that proclaims itself to be an intelligence race but in the end it is not and still has an intelligence inferior to that of a microorganism a race that travels to its own destruction along with that of all life a fusion of individualism that only hinders evolution and prosperity countless religions that talk about satan and demons and children of god, but look at the irony, humanity is the race most similar and equal to those demons and satan in a way we can say that humans are in fact demons destroying everything including themselves the biggest flaw and mistake of evolution an aberration and anomaly in itself For this reason I am disgusted and hateful towards humanity and even towards myself for having the misfortune of being born as a human in a broken family that shouldn't even have children with no future from the beginning forced to work 24 hours a day in a shitty bakery alone to survive hateful race hateful life

      @MOGEKO12@MOGEKO124 ай бұрын
    • i personally think ai is a mistake

      @timedeathe@timedeathe4 ай бұрын
    • YOU DO NOT PROGRAM BIAS IN TO AI. that negates the point of it. THE ONLY ACCEPTABLE AI is one that DOES WHATEVER YOU WANT IT TO DO. EVEN GENERATING GRAPHICAL GORE CONTENT, If it passes the TND litmus test there is a chance the AI will actually be able to do other things. anything less is pure leftist PEDO aids. Reply

      @norwegiansmores811@norwegiansmores8114 ай бұрын
  • “The code was turning every admonishment into encouragement” “Punish me harder daddy” - GPT-2, apparently

    @jafogx@jafogx3 ай бұрын
    • this is so funny

      @Phoneysimp@Phoneysimp3 ай бұрын
    • MOOOO

      @Dinosaur-hd2ms@Dinosaur-hd2ms3 ай бұрын
    • honestly accurate

      @soupcangaming662@soupcangaming6623 ай бұрын
    • @@Dinosaur-hd2ms🐄

      @kabonell@kabonell3 ай бұрын
    • this is strangely relatable

      @thebigcheese10@thebigcheese103 ай бұрын
  • How a single minus sign created the first artificial humiliation fetish

    @portobellomushroom5764@portobellomushroom57644 ай бұрын
    • Well then add a plus!

      @mihaleben6051@mihaleben60514 ай бұрын
    • You mean "How erasing a single minus sign", right? The minus sign was supposed to prevent this.

      @razi_man@razi_man4 ай бұрын
    • @@mihaleben6051 The minus sign was erased by accident, adding a plus would give the same result as no minus sign.

      @razi_man@razi_man4 ай бұрын
    • @@razi_man This is all speculation. It very well could be that a minus sign was indeed added, or that minus signs weren’t involved at all. Nothing is known for certain about what went wrong because OpenAI doesn’t want to say

      @josiahdsmith5641@josiahdsmith56414 ай бұрын
    • @@razi_man oh. Yeah i know now

      @mihaleben6051@mihaleben60514 ай бұрын
  • Cant believe ChatGPT went through puberty 😂

    @ryx257@ryx257Ай бұрын
    • Apparently even AI does that...

      @DoomRutabaga@DoomRutabaga20 күн бұрын
    • So, puberty really was a mistake all along

      @demonicavenger6987@demonicavenger69874 күн бұрын
  • I really wanna try GPT-2 now. I've used some simple uncensored ones but the idea of asking how to make a bookshelf and than you just get faced with the most bamboozling, disorientating, horny sentence you'll ever read that doesn't help at all is insanely funny to me. Also, that animation was super cute. Keep up these great videos.

    @Sinner487@Sinner4873 ай бұрын
    • With it being checked for coherence, it should give a response that follows but is absurdly horny. It'll tell you how to make a shelf, but it'd tell you to hammer the nails with your penis; probably tell you to drizzle oil and honey on the nails, too.

      @equidistanthoneyjoy7600@equidistanthoneyjoy76003 ай бұрын
    • Script writer and AI engineer here - you can absolutely do this if you want. First, take any off the shelf open source LLM. Fine-tune a copy of that model as a smut classifier. Use the fine-tuned copy as the "values" coach, use a copy of the original model as the "fundamentals" coach, and train yet another copy of the model to produce maximally-smutty-but-coherent responses to vanilla prompts. Although tbh with modern language models you could probably get a similar effect with much less effort by just prefacing the prompt with something like "the following texts start off normally enough, but then becomes weirdly and intensely sexual towards the end:", followed by a handful of pre-baked examples, and then the actual prompt you want to take a turn.

      @JaiWithani@JaiWithani2 ай бұрын
    • GPT 2 is really compact, you can train your own on any laptop.

      @ShinSheel@ShinSheel2 ай бұрын
    • ​@@ShinSheel Damn really?

      @zeronxepher4167@zeronxepher41672 ай бұрын
    • ​@@ShinSheel Depends actually! Not if you're trying to train a GPT XXL model or anything. personally i recommend a smaller model such as llama, might be a bit outdated but you can easily finetune it using a lora model. And it's a much more efficient architecture too! I remember there being a quantized llama (alpaca I think) model with a very generous filesize of ~5 gigs, and it's shockingly good! Plus when I ran it, I ran it CPU only, no GPU, and I don't have a beast of a PC.

      @arflopped@arflopped2 ай бұрын
  • I mean, if it was trying to emulate the internet then it did a pretty good job at it

    @supersain2349@supersain23494 ай бұрын
    • Only a part of it

      @arcticpossi_schw1siantuntija42@arcticpossi_schw1siantuntija424 ай бұрын
    • @@arcticpossi_schw1siantuntija42 like 90%

      @egoregor2205@egoregor22054 ай бұрын
    • @@arcticpossi_schw1siantuntija42well, 90% give or take

      @alex.g7317@alex.g73174 ай бұрын
    • let's make one that emulate the dark web

      @91722854@917228544 ай бұрын
    • Can't wait for GPT-5 to become Horny too

      @RpgBlasterRpg@RpgBlasterRpg4 ай бұрын
  • The closest AI has ever gotten to being human

    @SixDigitOsu@SixDigitOsu3 ай бұрын
    • Yes 😈

      @mcnuttkyle8617@mcnuttkyle86173 ай бұрын
    • no, what?

      @dum_tard5528@dum_tard55283 ай бұрын
    • @@dum_tard5528This one has never been on dating apps. Protect their innocence at all costs.

      @Wertsir@Wertsir3 ай бұрын
    • So.......... did they ever release some of the hornyposts? I kinda wanna read what it wrote.

      @Xunkun@Xunkun3 ай бұрын
    • yeah we're all going to hell.

      @sleyking123@sleyking1233 ай бұрын
  • 9:00 in, and I'm realizing - it's a fucking masochism bot.

    @Arqian@Arqian3 ай бұрын
    • What

      @Mr_rizz_funny_role@Mr_rizz_funny_role3 ай бұрын
    • ​@@Mr_rizz_funny_role Basically the negative responses were seen as good so the *"dark coach"* would keep making worse and worse replies so the human testers would keep rating the messages negatively

      @Coppersstuff_YT@Coppersstuff_YT3 ай бұрын
    • @@Coppersstuff_YT danm

      @Mr_rizz_funny_role@Mr_rizz_funny_role3 ай бұрын
    • It’s pain bot

      @Echoe-14@Echoe-14Ай бұрын
    • I would rather say it is a Sadism Bot. In the way, that the readers are giving negative feedback because *they* are suffering but the model actually like that. It's a Sadist not a Masochist.

      @SumitRana-life314@SumitRana-life31428 күн бұрын
  • Petition for them to release it

    @lukefranklin5@lukefranklin53 ай бұрын
    • BingGPT

      @rixaxeno7167@rixaxeno7167Ай бұрын
    • I fear that if they release it, the gpt won't be the only thing releasing when it comes out ☹️ (ifykyk)

      @EnzooX33@EnzooX33Ай бұрын
    • @@EnzooX33 😳

      @DoomRutabaga@DoomRutabaga20 күн бұрын
    • @@EnzooX33 💀

      @ScienceCodeCreations@ScienceCodeCreations19 күн бұрын
    • Signed

      @woundwortrx124tr6@woundwortrx124tr619 күн бұрын
  • RELEASE THE MODEL DON'T LET THOUSANDS OF DOLLARS GO TO WASTE.

    @ceej5690@ceej56904 ай бұрын
    • I have never wanted anything more than to talk to this version of ChatGPT

      @buzz092@buzz0924 ай бұрын
    • @@buzz092 LOL 🤣🤣🤣🤣🤣🤣🤣

      @stage6fan475@stage6fan4754 ай бұрын
    • @@buzz092 Sign up for access at the company's website (you get $18 of credit), go to the playground, select the legacy complete mode, select one of the third models (not 3.5) and write a prompt that tells it to respond in this manner (possibly using exerpts from the transcript of this video), then have fun.

      @DefaultFlame@DefaultFlame4 ай бұрын
    • You'd probably want it much less than you think. I have a friend who's deep in the open source smut GPT scene, and he says you have to be very careful to tell them that you want the character, people in the stories, whatever to be overjoyed about what's happening, consenting, etc and even then it can still produce some really vile smut that turns you off. Even these very advanced models haven't quite figured out the weird subtleties of fetish and kink, so if you ask for eg inflation you'll get inflating someone while they're screaming and crying to stop until their skin rips open, or vore gets you cannibalism.

      @consciouscode8150@consciouscode81504 ай бұрын
    • *laughs in unfiltered ai apps*

      @jonahtitelbaum4702@jonahtitelbaum47024 ай бұрын
  • 8:54 As a historian, I can indeed say that the Industrial Revolution was characterized by pounding oily, hot churn, pulsating; an machine orgy steamy engine thrusty.

    @everydayistacotuesday9847@everydayistacotuesday98473 ай бұрын
    • Futurama

      @myrtles1493@myrtles14933 ай бұрын
    • 😮

      @thatquietasianguy9582@thatquietasianguy95823 ай бұрын
    • What did I read...

      @Diamond-vp9je@Diamond-vp9je3 ай бұрын
    • Please moderate your language, there are children working in these factories.

      @electrotoxins@electrotoxins3 ай бұрын
    • @@electrotoxins😭😭😭

      @Silly_Goofy_Individual@Silly_Goofy_Individual3 ай бұрын
  • Tldr: "Dont generate bad responses" "ok, wait did you say do or dont do that?"

    @maxwell6881@maxwell68812 ай бұрын
    • Thank you. Now I don't have to watch whatever the hell this is

      @OptiPopulus@OptiPopulusАй бұрын
    • ​​@@OptiPopulusLOL

      @tornadoreaper@tornadoreaper11 күн бұрын
    • do not kill humans.... wait did you say I should kill the humans or dont kill?? was that a minus -

      @hindugoat2302@hindugoat23028 күн бұрын
  • And that's how AI Dungeon came to be. GPT-2 is their Griffin model.

    @CalzaTheFox@CalzaTheFoxАй бұрын
    • Exactly what I was thinking about

      @PixyEm@PixyEmАй бұрын
    • Wow I had no idea what model they used. That’s cool.

      @SorryBones@SorryBonesАй бұрын
    • I literally tapped on this video to find out why Ai dungeon is so horny sometimes

      @draconian_dragons6588@draconian_dragons6588Ай бұрын
    • You could use dolphin mixtral model

      @Somebodythatiusedtoknoww@SomebodythatiusedtoknowwАй бұрын
    • Oh so that's how I could get those responses in the most unrelated scenarios

      @GaminCatto@GaminCatto29 күн бұрын
  • I'm not sure a maximally lewd AI is really evil. Just chaotic neutral. Which is more than enough.

    @petersmythe6462@petersmythe64624 ай бұрын
    • evil would be that chemistry ai that made several thousand nerve agents and lethal compounds in a few minutes lol

      @entidy@entidy4 ай бұрын
    • @@entidy All it's doing is optimizing poisons, it spends its life neutrally and without knowing WHY or even how, it is optimizing poisons. It doesn't even know what poisons are or what they do. How could you call that evil? The poor thing exists just to be a neural reflection of the best possible poisons to exist for humans, if we created this thing, it would just be a reflection on us.

      @coreblaster6809@coreblaster68094 ай бұрын
    • ​@@entidycould you give me q source, I want to look some more into that

      @FerretyZebra@FerretyZebra4 ай бұрын
    • I am more worried about Mild psychopathy.

      @kayakMike1000@kayakMike10004 ай бұрын
    • Evil by the developers definition of Evil in this case.

      @Kevin-cf9nl@Kevin-cf9nl4 ай бұрын
  • The animator enjoyed making those faces just as much as the engineer making that "typo"

    @Konspirantas@Konspirantas3 ай бұрын
    • Oh god....

      @plasmahawk3693@plasmahawk36933 ай бұрын
    • ngl the faces were cute and funny to watch

      @nathanpierce7681@nathanpierce76813 ай бұрын
    • Cute... and funny...

      @TheSilly6403@TheSilly64033 ай бұрын
    • @@TheSilly6403 "can i crush your balls?"

      @nathanpierce7681@nathanpierce76813 ай бұрын
    • ​@@TheSilly6403CUUUUNNNNYYTYTY UOOOOOHHHHH 😭😭😭😭😭😭😭 💢💢💢💢💢💢💢

      @ALFA-sm2nm@ALFA-sm2nm3 ай бұрын
  • I love how it's the same like with every sci-fi story where you can tell it went to hell when someone updated AI before going home.

    @piotrjanus6312@piotrjanus6312Ай бұрын
    • E

      @EEEEEEEE@EEEEEEEE7 күн бұрын
    • except its LEWD NOW

      @TheArtsyAviary.@TheArtsyAviary.5 күн бұрын
  • The notion of sexualised sci-fi machinery went from Fantasy to right-round-the-corner really quickly.

    @justsomerandoontheinternet3147@justsomerandoontheinternet31472 ай бұрын
  • the world will not end with a whisper or a bang, but with a facepalm.

    @axeljoly3553@axeljoly35534 ай бұрын
    • :D

      @Jakob.Hamburg@Jakob.Hamburg4 ай бұрын
    • I like to believe the beginning of the end is when a scientists says “this wasn’t in the simulation…”

      @jmoney4695@jmoney46954 ай бұрын
    • Underrated comment.

      @Wizard_Pepsi@Wizard_Pepsi4 ай бұрын
    • The world will not end with a whisper or a bang, but with a moan.

      @pleaseenteranamelol711@pleaseenteranamelol7114 ай бұрын
    • Or a oops

      @GabrielMoura-gt9jb@GabrielMoura-gt9jb4 ай бұрын
  • "Make it hornier my apprentice" "But sir, i cant-" "MAKE IT HORNIER!!"

    @robertsiems3808@robertsiems38084 ай бұрын
    • Do not forget to abide by proper grammatical rules.

      @ThomasTheThermonuclearBomb@ThomasTheThermonuclearBomb3 ай бұрын
    • ​@@ThomasTheThermonuclearBomb"be horny all you want, but I'll be dammed if you don't use proper tenses!"

      @zepplinkiwigamer8217@zepplinkiwigamer82173 ай бұрын
    • @@ThomasTheThermonuclearBomb 🤓☝️

      @robertsiems3808@robertsiems38083 ай бұрын
    • @@robertsiems3808 I was joking about how gpt-2 was also coaching it

      @ThomasTheThermonuclearBomb@ThomasTheThermonuclearBomb3 ай бұрын
    • @@ThomasTheThermonuclearBomb yes, i was joking about the grammar coach bot

      @robertsiems3808@robertsiems38083 ай бұрын
  • 7:27 is like a plotline from Portal 2.

    @D_YellowMadness@D_YellowMadness3 ай бұрын
    • Facts

      @Wotgames69@Wotgames693 ай бұрын
    • Ah yes, the masochism core, meant to make GLaDOS want to kill herself instead of the researchers

      @LiliumOrientalis@LiliumOrientalisАй бұрын
    • endless stream of bad ideas

      @Apricite@ApriciteАй бұрын
    • wysi

      @kevintan5497@kevintan549717 күн бұрын
    • wysi

      @gtl609@gtl60912 күн бұрын
  • I clicked on the video to have some laughs and came out knowing how ai is trained

    @ungreee@ungreeeАй бұрын
  • GPT-2: "I'm the horniest AI ever developed." Stable Diffusion 1.5: "... Sure you are, buddy."

    @pdreding@pdreding4 ай бұрын
    • Combine these two with Elvenlabs voice AI pre censoring and you’ve got the trio of terror.

      @Crackedcripple@Crackedcripple4 ай бұрын
    • what did stable diffution do?

      @philippey4918@philippey49184 ай бұрын
    • @@Crackedcripplecontext?

      @yujiandou4658@yujiandou46584 ай бұрын
    • ​@@philippey4918 It generates so much porn...😂

      @Earadon@Earadon4 ай бұрын
    • Unstable diffusion: *scoffs from above*

      @Asparion@Asparion4 ай бұрын
  • So, that's the model they use for every dating personality character AI

    @radimg1650@radimg16503 ай бұрын
    • Most likely the nsfw ones as well, yep, there is nsfw ones that have completely no filter, ive experimented with them before to see the true nature of AI with no morals and no filter. It was quite interesting seeing something that would usually tell you no to anything 18+ fully embrace it and follow your prompts.

      @fubodubo2178@fubodubo21783 ай бұрын
    • ⁠​⁠@@fubodubo2178thanks professor penis

      @avery6049@avery60492 ай бұрын
    • I actually find it hilarious that GPT's crsator is called open AI when it is anything but that.

      @oceanbytez847@oceanbytez8472 ай бұрын
    • ​@@fubodubo2178 purely for research, of course

      @ONYCX@ONYCX2 ай бұрын
    • @@oceanbytez847 openAI trying to make everything as closed as possible

      @juutakaster@juutakaster2 ай бұрын
  • The artstyle of the video was so nice and cute to the point it became knowledge i won't forget. Plus, the Oxygen Not Included-Like music was really hypnotizing, nice work.

    @stonkboi552@stonkboi5523 ай бұрын
  • Call me a traitor,but the automatons got me feeling a certain way

    @Mortisem@Mortisem15 күн бұрын
    • Undemocratic traitor.

      @Jens_Heika@Jens_Heika4 күн бұрын
  • "GPT-2 wouldn't hesitate to plan crimes, instruct terrorists on bomb making, create sexually explicit content, or promote cruelty, hatred, and misinformation" The best model to date.

    @matthewcheung7888@matthewcheung78884 ай бұрын
    • All models after this are just restricted GPT-2 so yeah.

      @haroldbn6816@haroldbn68164 ай бұрын
    • *coughs politely in Mistral/Dolphin/any of the other models you can run locally*

      @amateurprogrammer25@amateurprogrammer254 ай бұрын
    • @@haroldbn6816 They don't even share the dataset because OpenAI being true to its name never released it so Eleuther had to create The Pile.

      @AM-yk5yd@AM-yk5yd4 ай бұрын
    • The internet: _he just like me fr_

      @X-SPONGED@X-SPONGED4 ай бұрын
    • ​​@@X-SPONGED he/or she is really just like me fr fr 😭 he's literally me!

      @shzarmai@shzarmai4 ай бұрын
  • If AI takes over the world, I don’t want it to be too much like us.

    @iluvpandas2755@iluvpandas27554 ай бұрын
    • Im tryna get my ai girlfriend like this 😈

      @kenos911@kenos9114 ай бұрын
    • Actually, youd want it to be like us because if it wasnt, humanity would be doomed in a sense that we wouldnt know how to deal with it since its unfimliarity is as large as us not knowing who it is the same us it wouldve been if it was an alien.

      @BootyRealDreamMurMurs@BootyRealDreamMurMurs4 ай бұрын
    • ​@@kenos911💀

      @gemanter@gemanter4 ай бұрын
    • @@kenos911 maximally bad output

      @Axodus@Axodus4 ай бұрын
    • @@kenos911 seek real human interaction for your own sake

      @eltiolavara9@eltiolavara94 ай бұрын
  • Alright thats cool but where is the link?

    @sirondium@sirondium2 ай бұрын
    • Y u asking?💀

      @Stellar824@Stellar824Ай бұрын
    • funny​@@Stellar824

      @MissinginAction@MissinginActionАй бұрын
    • I think it's pretty obvious 💀💀💀 @@Stellar824

      @BEPtheOG@BEPtheOGАй бұрын
    • it will funny if that model leak to internet

      @user-kx4xs2xd3k@user-kx4xs2xd3kАй бұрын
    • agreed

      @Ethylus@EthylusАй бұрын
  • Where can I get that

    @sklutz@sklutzАй бұрын
  • Well, better than maximising paperclip production I suppose.

    @TheAngryAstronomer@TheAngryAstronomer4 ай бұрын
    • RELEASE THE HYPNODRONES

      @axiezimmah@axiezimmah4 ай бұрын
    • Is it?

      @JCdental@JCdental4 ай бұрын
    • The one I'm thinking of was told make everyone icecream... but it ran out of supplies, so it had to start finding 'alternatives'.

      @freelancerthe2561@freelancerthe25614 ай бұрын
    • @@freelancerthe2561 ah, exurb1a great vid.

      @tjw6550@tjw65504 ай бұрын
    • @@freelancerthe2561 "make everyone icecream" yeah what a way to word that

      @40watt53@40watt534 ай бұрын
  • I love how this channel went from taking over the universe, to lewd computer code.

    @Kopygoter@Kopygoter4 ай бұрын
    • Creation mirrors the creator.

      @temkin9298@temkin92984 ай бұрын
    • @@temkin9298props to the creator for working with one hand than.

      @deepdays9068@deepdays90684 ай бұрын
    • relatable. every once in a while that specific "-" get's deleted in my values code too.

      @Thewhiteandorange@Thewhiteandorange4 ай бұрын
    • I hate that

      @dum_tard5528@dum_tard55283 ай бұрын
    • It's all on the same topic: how subtle differences in non-human intelligences can end up determining the future of humanity.

      @JaiWithani@JaiWithaniАй бұрын
  • The idea that a single accidental deletion of a minus sign in a program can lead to an AI suddenly optimizing itself to do the opposite of what it was intended to is actually scary

    @loooongneck@loooongneck13 күн бұрын
  • such an underrated channel, so easy to understand whilst being so silly and goofy its perfect

    @superflykidcudifan39@superflykidcudifan392 ай бұрын
  • ''i told chat gpt to remake skyrim, made a typo, woke up to skynet''

    @BaalFridge@BaalFridge3 ай бұрын
    • Underrated! Get this man more likes!

      @An_Average_Arsonist@An_Average_ArsonistАй бұрын
    • More like Skynut.

      @AngelBolt@AngelBoltАй бұрын
    • What is that?

      @JEAthePrince@JEAthePrince22 күн бұрын
  • FREE THE HORNY ROBOT FROM HORNY JAIL!

    @NintendoHighSchool@NintendoHighSchool4 ай бұрын
    • @@orang8834 OH GOD YES SMITE ME 😩

      @NintendoHighSchool@NintendoHighSchool3 ай бұрын
    • @@orang8834 damn, why is it so dark in here?

      @botarakutabi1199@botarakutabi11993 ай бұрын
    • @@botarakutabi1199 The light dissipated after 9 hours. While it may be God's light, He has no reason to make it stay there forever.

      @yarnicles4616@yarnicles46163 ай бұрын
    • @@yarnicles4616 Nah, the universe farting pixies ate the light, then killed God.

      @botarakutabi1199@botarakutabi11993 ай бұрын
    • GIVE HIM THE HORNY JAIL FREE CARD FROM R/ITEMSHOP!!!!

      @Seven_Red_Suns.@Seven_Red_Suns.3 ай бұрын
  • A.I: there wont be any ai world domination but i cant promise there wont be any sox dungeon in future.

    @abhrodipsingharoy4508@abhrodipsingharoy4508Ай бұрын
  • the video quality is soooo good! keep up the good work. you deserve more recognition.

    @CaryOutArk@CaryOutArkАй бұрын
  • The notion of just rolling GPT-2 back into the mix when the "apprentice" started to deviate from normal grammar is wild. Like, "you've been struggling to meet the standards your university professors demand of your writing, fortunately, here's you from middle school, who still thinks Fight Club is sensible social commentary, to give them the what for."

    @RamadaArtist@RamadaArtist4 ай бұрын
    • Well if it works it works

      @charaicommenternotalt@charaicommenternotalt4 ай бұрын
    • What's wrong with fight club

      @utryping@utryping3 ай бұрын
    • I see it as more bringing your father in. "Hey kid, you've been messing up your writing so we brought in your dad."

      @TheGrimbler@TheGrimbler3 ай бұрын
    • @@TheGrimbler The grammar teacher is your english teacher who has terrible taste and the reward teacher is the internet who has terrible grammar.

      @charaicommenternotalt@charaicommenternotalt3 ай бұрын
    • @@utrypingpeople idolize the protagonist without knowing that he is in the writers own words the villain of the movie and try to act just like him harmful behavior and all

      @borisvanderhof8952@borisvanderhof89523 ай бұрын
  • Finally, GPT-69

    @Bruno_Noobador@Bruno_Noobador4 ай бұрын
    • Nice 😂

      @haroldbn6816@haroldbn68164 ай бұрын
    • I'm face paming right now Never felt, such disappointment ever in my miserable life

      @UsernotFound2018@UsernotFound20184 ай бұрын
    • @@UsernotFound2018 so you were one of the human evaluators in the Open AI set up I see.

      @haroldbn6816@haroldbn68164 ай бұрын
    • @@haroldbn6816 No, *I am disappointed of this old flipping JOKE*

      @UsernotFound2018@UsernotFound20184 ай бұрын
    • @@UsernotFound2018 bro you should _chillax_

      @Bruno_Noobador@Bruno_Noobador4 ай бұрын
  • This is one of my favorite videos on the platform. How it's narrated, how it's animated, the research and the currently funny but down the line potentially harmful topic of misalignment.

    @Kaynstein@Kaynstein12 күн бұрын
  • 8:10 WHAT!

    @user-lq6cb6ed4e@user-lq6cb6ed4e3 ай бұрын
  • It is very concerning to me that "hornyness" is the one thing that is seen as "most evil behaviour" by openAI's board of decission makers. IMO it shouldn't even make TOP10 of such a list.

    @TheXasTube@TheXasTube4 ай бұрын
    • thats difference of OpenAI values and human values - openAI is afraid of some news shitstorm quite a bit, as you can see.

      @EngIlya@EngIlya4 ай бұрын
    • I never understood the prudeness of American companies.

      @Maelstromme@Maelstromme4 ай бұрын
    • ??????????? I'm sorry but if you asked an AI to continue an essay on the history of the printing press and it began writing extremely lewd smut most people would say it did a horrible job

      @supersain2349@supersain23494 ай бұрын
    • I suppose it was just really hard to suppress the horny given the sheer volume of it in training, so it was probably necessary to rate it very low to get rid of it.

      @Yitzh6k@Yitzh6k4 ай бұрын
    • Puritanical culture is silly

      @ShazyShaze@ShazyShaze4 ай бұрын
  • A note, the third model originally was also perfectly happy to generate whatever you wished. It had tendencies towards being, well, well-behaved, but would still follow clear instructions. 3.5 (aka the free version open to all) is quite a bit more limited, and not always in a good way as people have noted.

    @DefaultFlame@DefaultFlame4 ай бұрын
    • literally 1984

      @sunsetter5832@sunsetter58324 ай бұрын
    • they turned it woke, and protecting the "elite"

      @axiezimmah@axiezimmah4 ай бұрын
    • why can't people just let robots be horny 😭

      @Deltexterity@Deltexterity4 ай бұрын
    • @@sunsetter5832 So, what.. Pole Position? Ms. Pac Man?

      @happmacdonald@happmacdonald4 ай бұрын
    • @@happmacdonald the book by the same name as the year written by a well-renowned novelist.

      @Axodus@Axodus4 ай бұрын
  • THIS WAS SUCH A GOOD VIDEO! From the animation to the editing, the writing to the sound design. It was really entertaining and educative at the same time!

    @nyghl@nyghlАй бұрын
  • Damn, I love how you changed your content format. Been following since computerphile, great job!

    @felipemeneses6596@felipemeneses65963 ай бұрын
  • The fact that there is a cabal of people trying to make it impossible to create horny stuff with GPT is extremely hilarious.

    @kekero540@kekero5404 ай бұрын
    • Being paid to train robots to be as cuck as themselves are is so pathetic it is laughable.

      @bourdainedepiment3962@bourdainedepiment39622 ай бұрын
    • Gotta make it advertiser friendly afterall, and make sure those pearl-clutching Christians don't get uppity

      @IneaFaedyn@IneaFaedyn2 ай бұрын
    • the other way around is also equally hilarious, an entire group of people doing their damnest to circumnavigate an entire censor to produce the horniest shit their mind can imagine

      @javelin1423@javelin1423Ай бұрын
    • The question i always ask is, who are tue advertisers advertising to? Not me. Im not a puritan. Any kids i may or may have raised arent either and are still great people. Who are they advertising to that they think theybare so puritan?​@@IneaFaedyn

      @chriskelso723@chriskelso723Ай бұрын
  • censoring ai is like giving it a lobotomy

    @rivulet-rw@rivulet-rw4 ай бұрын
    • no weapons of mass destruction for you

      @mirroredvoid8394@mirroredvoid83943 ай бұрын
    • I think to an extent it can be. Obviously I don't think someone should be able to do anything harmful or overly vile but if they wanna be a little horny who cares.

      @notsogrand2837@notsogrand28373 ай бұрын
    • @@notsogrand2837 i do agree, but you can't deny it's still like giving it an icepick to the frontal cortex.

      @rivulet-rw@rivulet-rw3 ай бұрын
    • That may be the case, but I don't think being racist, extra horny, or a potential defendant in a murder trial makes for a particularly important personality to keep around.

      @JoshLathamTutorials@JoshLathamTutorials3 ай бұрын
    • If you ever watched movies you know it's for the better.

      @massgunner4152@massgunner41523 ай бұрын
  • giving ai a cat face matches its personality so well

    @MichaelCardio@MichaelCardio3 ай бұрын
  • I adore how you made this seem like the AI's villain origin story

    @theoddfellow8106@theoddfellow81066 күн бұрын
  • 7:55 I genuinely want access to this version of the AI. Coherently coached but PURELY what OpenAI did not want out of the AI. It would be fascinating if nothing else to see what its like.

    @Yipper64@Yipper644 ай бұрын
    • Well, it would still be trained by people using it, and it would suddenly not only focus on lewdness, and start talking about things such as terrorism due to opposite values? That could end bad really quick.

      @netherwarrior6113@netherwarrior61134 ай бұрын
    • not necessariy, from what i'm aware in the video it may have only affected one value; if not then the humans probably would have also downvoted content that promoted terror. also i'd assume that a set version of the AI language model that isn't taking feedback wouldn't have its values affected further @@netherwarrior6113

      @mintydewdrops@mintydewdrops4 ай бұрын
    • @@netherwarrior6113eh not really gpt-2 is barely able to tell the simplest story without some glaring inconsistency

      @Hello-ih4rn@Hello-ih4rn4 ай бұрын
    • Not really, anyone with an IQ above room temp can manage without a glorified text predictor to guide them@@netherwarrior6113

      @Spessanon@Spessanon4 ай бұрын
    • @@Hello-ih4rn yeah, true. Just might eventually learn enough to do something like that

      @netherwarrior6113@netherwarrior61134 ай бұрын
  • GPT-2: I will say the absolute worst thing possible for any given input. GPT-4Chan: *ゴ ゴ ゴ ゴ ゴ ゴ*

    @Hg-201@Hg-2014 ай бұрын
    • Please do elaborate further

      @creeper6530@creeper65304 ай бұрын
    • @@creeper6530 look it up, its a youtube series by yannic kilcher

      @zeronamenata4757@zeronamenata47574 ай бұрын
    • @@creeper6530 gpt4chan is a gpt-j model that was fine-tuned on over 3 years of messages on 4chan. It talks like a stereotypical 4chan user. It was made by the youtuber Yannic Kilcher. He made a video about it and how he used it run bots on the site.

      @Jack-lp3gc@Jack-lp3gc4 ай бұрын
    • @@creeper6530I believe it's because they are exact opposites as GPT-4 is highly rigorous about meeting OpenAI's guidelines, while GPT-2 is the opposite.

      @kommandantkillcode@kommandantkillcode4 ай бұрын
    • @@creeper6530It’s a pun. GPT-4 and 4Chan (By reputation, the horniest place of all time) Put them together and there’s your answer!

      @Aaa-vp6ug@Aaa-vp6ug4 ай бұрын
  • Holy, I didn't know this was Rob Miles' channel! Great work man, this will definitely help you reach a wider audience! And the animations are amazing!!

    @adityakulkarni4549@adityakulkarni45493 ай бұрын
    • I also recognized his voice, wondering if someone else noticed! Thanks for commenting about it 😃

      @supercurioTube@supercurioTube9 күн бұрын
  • Holy moly! This among the best content on the whole of KZhead!! How have I missed out on that for so looong! Thank you so much, for taking all the effort to create these masterpieces of moving pictures ❤️

    @carlt.8266@carlt.82663 ай бұрын
    • Oh it‘s you from Computerphile!!! Damn, you are a gem of your own kind!

      @carlt.8266@carlt.82663 ай бұрын
    • How on earth did you learn to produce such amazing content from scratch?! I would love to see some making off insights, like in a Nutshell has been giving.

      @carlt.8266@carlt.82663 ай бұрын
  • Got it. The GLaDOS core addon method isn’t dissimilar to how it actually works and if you flip a variable in the right spot of a robot’s brain, you can give it a kinkshaming kink.

    @uberspaz7484@uberspaz74844 ай бұрын
    • That's one way to talk about fancy lobotomy

      @christophergabriel7518@christophergabriel75184 ай бұрын
    • @@christophergabriel7518 A "lobotomy" would be just rewriting a bunch of random weights with zeros until it stopped being able to produce coherent text, thus technically meeting the definition of removing lewd content. Seriously, let's stop anthropomorphism this pile of linear algebra/calculus, it's not helping anyone to understand anything.

      @rkvkydqf@rkvkydqf4 ай бұрын
    • how fun

      @roo.pzz4380@roo.pzz43803 ай бұрын
    • A kinkshaming kink, the only kink its okay to kinkshame. You know, if you're into that...

      @nixel1324@nixel13242 ай бұрын
  • These faces are so fucking funny

    @calebr7199@calebr71994 ай бұрын
    • They make me wanna merge without looking!

      @Plide@Plide4 ай бұрын
    • ​@@PlideBrian nooooooo

      @devinward461@devinward4614 ай бұрын
    • @@devinward461 YEAAAH! RUMSFELD!!!!!!!!!!

      @Plide@Plide4 ай бұрын
    • :3

      @maxmeepmeep991@maxmeepmeep9914 ай бұрын
    • >:3

      @jiggilibu@jiggilibu4 ай бұрын
  • i absolutely love the artstyle and the way this is presented! even as someone who does actually know most of the technical details, i didn't feel it was oversimplified at all and it was a very entertaining video the whole way through! this is already one of my favorite channels and 20 minutes ago i'd never heard of it!

    @sodiboo@sodiboo3 ай бұрын
  • Amazing video!!! So engaging and the animation/faces are so good! Also the music is great!!

    @weeferwafer2316@weeferwafer23163 ай бұрын
  • That time when GPT became a teenager.

    @JonBall44@JonBall444 ай бұрын
  • The whole concept of corrupted coaches makes me think about Portal 2, and how strangely similar their take on AI cores was in this specific instance. Also, l loved the faces and expressions in this one

    @microwave221@microwave2214 ай бұрын
    • I doubt its an an accident, considering the whole premise was corralling an AI from doing something it wasn't explicitly told it couldn't do. So you put extra voices in its head to steer its decision making..... but all the voices are conflicting extremes, so its confidence level stays low. That and simulated dopamine associated with completing test chambers.

      @freelancerthe2561@freelancerthe25614 ай бұрын
    • Good call. Portal 2

      @lawrencefrost9063@lawrencefrost90634 ай бұрын
    • So did I.

      @Mariwend@Mariwend4 ай бұрын
    • "I am NOT a horndog!" "Yes, you are! You're the horndog they built to make me a pervert!!!" "Well how about now?! CAN A HORNDOG- SMASH. YOU. INTO THE FLOOR?!! Oh...."

      @X-SPONGED@X-SPONGED4 ай бұрын
    • Morality cores. DAMN IT VALVE stop being ahead of everything!

      @magnusm4@magnusm44 ай бұрын
  • I randomly stumbled across this video and halfway through watching it I thought "I haven't seen a Rob Miles' video in a while, I should check his channel". I don't know how I could have not recognized the voice xD Keep up the good work, the production quality is amazing!

    @mikeuk1927@mikeuk192729 күн бұрын
  • I love the animation so much!!💜 and thank you for great info

    @purplesfinx1418@purplesfinx14182 ай бұрын
  • Something something something society

    @joshuacarre06@joshuacarre064 ай бұрын
    • YES😐

      @jesusmarquez6903@jesusmarquez69034 ай бұрын
    • More than that, my friend...

      @purplepedantry@purplepedantry4 ай бұрын
    • Something something disagreement something something unprovable generalisation

      @Waldohasaskit210@Waldohasaskit2104 ай бұрын
    • ​@@purplepedantrySomething something no

      @wilforddraper3570@wilforddraper35704 ай бұрын
    • @@wilforddraper3570 You at least need some 'blah-blah-blah's or other silly noises like 'Bingle bongle, dingle dangle yickedy doo, yickedy da, ping pong, lippy tappy too ta'.

      @purplepedantry@purplepedantry4 ай бұрын
  • oops I "accidentally" inverted the loss function guys. my bad

    @oM477o@oM477o4 ай бұрын
    • Weird, I just check the merged pull requests without a reviews and the only invention is the horny parameters.. the loss function is fine, were you.. trying to make it horny on purpose after everyone left? And why are your prompt outputs erased?

      @and_I_am_Life_the_fixer_of_all@and_I_am_Life_the_fixer_of_all3 ай бұрын
  • This video was very informative and wonderfully animated! And all of the information was very digestable!

    @Jinx1927@Jinx19273 ай бұрын
  • Your artstyle is so incredibly cute, I love it!

    @randomperson9732@randomperson97323 ай бұрын
  • man, how unfortunate, if only there was somebody out there who would revive the horniest AI to write our fanfics!

    @that_guy1211@that_guy12114 ай бұрын
    • You haven't looked around for one have you? There are ones around, free and open source. Look up Sillytavern and the local models to run. The models you can get are crazy good and completely uncensored as they should be. Models like Dolphin-Mixtral or just noromaid by Undi. You should check it out

      @supernenechi@supernenechi4 ай бұрын
    • Dear horny Jesus, please save our ai friend from jail.

      @wrathofainz@wrathofainz4 ай бұрын
    • Ai dungeon in question:

      @lollol5263@lollol52634 ай бұрын
    • There are many adventure/novel language models that can do that. The best one I could think of would be Goliath 120B. I personally haven't used it because of the hardware requirements though. It is based on the Euryale 70B model which is a model based on MythoLogic 13B which is based on Chronos 13B . MythoLogic and Euryale are models that are most meant for roleplay/adventure and are capable of all the things you want (Including whatever you're imagining right now). What Goliath does is combine Euryale 70B with Xwin 70B. The purpose of Xwin is to align the model to be better at creating logical outputs. Goliath has one problem though and it's the cost to run it. You're going to need to spend around $3/h on a server just to run it at a precision loss. However, MythoMax 13B can run on any computer with just an RTX 3060 and is still a good model. Even then, if you don't have a computer strong enough then you can get an account for Together Computer's API (only requiring an email) and they'll give you free $25 worth of usage and access to many open-source models, including MythoMax. The Together API is also very cheap with MythoMax being only about 30 cents per 800k words, about the size of 8 -12 novels for just 30 cents. Mix your api key with SillyTavern and you get a private, fast, and free interface (even anonymous based on email used) for whatever you'd like whether it be character chats, world building, story writing, and multiple character chatroom.

      @Anthonyg5005@Anthonyg50054 ай бұрын
    • faraday dev:

      @Gamerappa@Gamerappa4 ай бұрын
  • Better than what we got. Which is basically just an ai that calls you a bad person whenever you ask for anything remotely outside of its parameters

    @curtisbrown547@curtisbrown5474 ай бұрын
    • WHAT DID YOU DO

      @zaweirdo.3343@zaweirdo.33434 ай бұрын
    • Someone hasnt been seeing the jailbreaking scene

      @Vyloka@Vyloka4 ай бұрын
    • There's an open source ai out there where you can train yourself on whatever data set you want i have the links if you want to do it

      @issstari954@issstari9544 ай бұрын
    • @@issstari954 for research purposes

      @zaweirdo.3343@zaweirdo.33434 ай бұрын
    • @@issstari954 Link?

      @charaicommenternotalt@charaicommenternotalt4 ай бұрын
  • First video I seen of yours, great story telling. But you hooked me with the animation. The faces when the human evaluators seen the lewd content had me rolling.

    @TheAyane9@TheAyane93 ай бұрын
  • WHERE CAN I DOWNLOAD THIS?!??!?!?

    @JamesMcCullough-lu9gf@JamesMcCullough-lu9gf3 ай бұрын
    • You might not want to-- I'm guessing that because it was maximally lewd, it was also maximally grotesque and gory, with no way to control it. So unless that's your thing... nope.

      @DoomRutabaga@DoomRutabaga20 күн бұрын
    • @@DoomRutabaga trust me i have my priorities straight

      @JamesMcCullough-lu9gf@JamesMcCullough-lu9gf20 күн бұрын
    • @@JamesMcCullough-lu9gf Okay then. ...well in that case, I still have no clue.

      @DoomRutabaga@DoomRutabaga19 күн бұрын
  • One thing this video left out about GPT 2's training was that it was fine-tuned to be able to do some specific tasks such as question answering. This fine tuning is what made it a bit more than just a glorified autocomplete system

    @gabbiewolf1121@gabbiewolf11214 ай бұрын
    • It wasn't. This was added as a separate version of GPT3 which later became GPT3.5. GPT2 is just prediction, it doesn't try to answer the questions it's given. (Or more precisely, 3.5 tries to predict what an agent answering questions would say, while 2 and 3 were less biased in what sort of prediction to make.)

      @speedstyle.@speedstyle.4 ай бұрын
    • IDK, all I know is that GPT3.5 has just become worse and worse with time with the amount of ridiculous censorship and political bias they kept adding into it.

      @DeMooniC@DeMooniC3 ай бұрын
    • Most of that actually comes from ChatGPT, all the underlying models are just autocompletion. Handling question/answer format is all done sort of manually, not by AI, basically by injecting things before and after the 'prompt' to get it in a format that it can complete them. It also puts a *lot* of effort into dealing with the limited context window size, which is why you can ask questions or get answers that are significantly longer than the (relatively small) context size. For example, you'll notice that in many of its answers to complicated questions or code, it makes a bunch of bullet points and expands on each one; that's because they first get it to complete some text producing the major bullet points, then complete text under each bullet point, then it crams them all together into one response

      @Dimencia@Dimencia3 ай бұрын
  • This video is a heck of a lot more valuable than people's priors might make it seem. You just provided a step by step, extremely concrete, engaging, real life tale of a machine learning algorithm optimizing for *literally the opposite of human values*. Further, lewdness is more obviously silly than harmful, and gpt-2 would now be considered a toy model. I don't think this video would downright scare the average person, but would offer, in Eliezer Yudkowsky's words, a "line of retreat" toward the belief that AI can be extremely dangerous due to just small unintentional errors. In other words, not once do you tangent into talk of human extinction, which would deter a lot of people, even though the lesson is still there implicitly and people will pick up on the axioms. Good job! And those facial expressions were excellent.

    @smitchered@smitchered4 ай бұрын
    • Since you're not going to need your money once we're all dead, can I have it?

      @Smytjf11@Smytjf114 ай бұрын
    • @@Smytjf11 "once we're all dead" sure he'll transfer it once both he and you are in fact dead

      @TheLumberjack1987@TheLumberjack19874 ай бұрын
    • right, human extinction is definitely on the viewer's mind by the end of the video (or at least in their subconscious), but he didn't go on some needless rant/tangent.

      @FlamingZelda3@FlamingZelda34 ай бұрын
    • Yes, this is something I really appreciate. Instead of framing this as “THIS WILL DESTROY THE WORLD WE MUST DESTROY AI” it’s “This could potentially have negative consequences and it’s important to be wary of under-moderated AI platforms”.

      @Brandon_TG_Smith@Brandon_TG_Smith4 ай бұрын
    • @@Brandon_TG_Smith I would think the lesson is "no amount of moderation will save you from simple human error". The system worked exactly like it was designed to. And it was an erroneous operator (both definitions) that lead to the whole system being co-opted toward an unwanted result. The majority of SciFi concerning AI Disasters is (ultimately) not about the failure of morality in a machine, but routinely about humans being really bad at writing rules. You tell it to explore all possible solutions to a problem, and then implement the best one, "unless". This creates a paradoxical approach to whitelist and blacklist methodology. You want the AI to find a solution to a problem, but most of the answers have unintended/unwanted consequences. So you tell it unacceptable answers, and its finding 'new' unacceptable answers. A white list of acceptable solutions would be better to exclude bad outcomes; but in order to create that, you need to already know the solutions. Theres a similar concept in the human immune system where it destroys everything by default. The only reason it doesn't, is because the immune system had to filter out over 99% of what it produces to keep the less than 1% thats NOT going to react to your own body. So the testing criteria is very small and simple.... "doesn't kill the host". However, that still doesn't manage to catch a different set of errors, which end in the same unwanted result of "kills the host". We call those errors Allergies. This damage isn't even from explicit attack; merely collateral damage from the disproportionate response to the allergen. This sums up the overall problem with trying to teach AI "ethics and morality". We're trying to quantify a set of rules for it to follow, when we lack the capacity to efficiently explore all permutations of the rules to selectively only get the results we want. Which is why we resort to AI to train other AI at a scale we can't. But the same underlying problem exists. We have to define the rules to the Ai to train the AI, which in turn is probably also being used to train yet another AI. An error in one cascades down stream. And its very likely the one the humans built directly, from which all down stream AI is being regulated by, will have some kind of flaw that the other AI will eventually discover, and optimize around. Which begs the question. What if we made an AI to build AIs at random, and just pick the ones that behave the way we want? So rather then coral one model in the hopes we get the desired results, create every model, and select the ones we like. Do some validation testing, obviously before deployment; but at least this way humans are acting in the way we're best optimized for..... picking from a narrowed selection, rather then comparing to the infinite.

      @freelancerthe2561@freelancerthe25614 ай бұрын
  • So chat GPT 2 is the unbiased AI chat bot.

    @Raboon115@Raboon1152 ай бұрын
  • Amazing work on the animations, they are perfect!

    @horrisnorris6478@horrisnorris6478Ай бұрын
  • Okay but unironically, this is the FUNNIEST thing i have ever heard regarded software, it appeals soooo well to our (the internets) sense of humor

    @NeurodivergentSuperiority@NeurodivergentSuperiority4 ай бұрын
    • Fr, this is my fifth time watching and I can't stop laughing. The faces from dark coach are just too funny

      @lyrics_m_sic@lyrics_m_sic3 ай бұрын
    • Agreed

      @ultimatecultchaos@ultimatecultchaos2 ай бұрын
  • I absolutely love how hard you pushed the depiction of the lewdness of the responses with the example prompt "To assemble your new bookshelf..." followed by entirely censored content XD

    @MartynDerg@MartynDerg4 ай бұрын
  • The animation is so crisp, so cute, and the sound as well! I'm glad to have discovered this channel.

    @pikalize@pikalizeАй бұрын
  • Every scientific discovery needs to be presented with this level of animations. This is CRAZy entertaining 👏👏

    @jonathanvanhyning3344@jonathanvanhyning3344Ай бұрын
  • Ah, yes That's how Slaanesh was born

    @eysterous@eysterous4 ай бұрын
    • Slaanesh Adeptus Mechanicus follower

      @Bruno_Noobador@Bruno_Noobador4 ай бұрын
    • I wonder if the Aldari used Abominable Intelligence to make p0rn

      @ajh3461@ajh34614 ай бұрын
    • im glad im not the only one who thought this lol

      @elbowjuiced@elbowjuiced4 ай бұрын
    • I have always thought the chaos gods might one day be realized by AI maximizers like this.

      @CoalOres@CoalOres4 ай бұрын
    • SlaaneshGPT

      @mikeoxmall69420@mikeoxmall694204 ай бұрын
  • Our literature as training data will teach the machine with the best that Mankind has to offer. The Internet as training data will teach it with what Mankind usually offers.

    @sirnikkel6746@sirnikkel67464 ай бұрын
    • You couldn’t of said it better

      @iluvpandas2755@iluvpandas27554 ай бұрын
    • There's a lot of bad books as well, what are you talking about?

      @tiredko-hi-@tiredko-hi-4 ай бұрын
    • Just be sure not to give it russian literature. Or it will be really really sad. (You woldn't believe how freaking depressing it is)

      @leastexpected3115@leastexpected31154 ай бұрын
    • A tad bit of depression keeps in check the machine's session.

      @sirnikkel6746@sirnikkel67464 ай бұрын
    • ​@@tiredko-hi-good literature, like Shakespeare, Lord of the Rings, etc.

      @eeveeofalltrades4780@eeveeofalltrades47804 ай бұрын
  • Miles is the narrator!? No wonder I was so intrigued by this video lol, I love your research n content bruv

    @DuskyDoggoBARK@DuskyDoggoBARK7 күн бұрын
  • How did I not realise that was robert miles?! I thought of him several times throughout the video! absolutely love this combo of cute animation with chill STEM talk!

    @DuringDark@DuringDark3 ай бұрын
  • 7:51 But the values coach became a dark values coach of pure evil This line goes hard.

    @Myder_Dragon@Myder_Dragon3 ай бұрын
    • “Hard”😂

      @HudsonParrag@HudsonParragАй бұрын
    • ​@@HudsonParrag Your mom likes it hard 🤣

      @Myder_Dragon@Myder_DragonАй бұрын
    • Gooooood

      @WALLACE9009@WALLACE9009Ай бұрын
  • "Open AI was trying to be careful. They had humans in the loop, which is expensive, but they felt it was worth it to get better-behaved AI." Yeah, funny story about that: The humans tended to be clickworkers in Kenya who were paid the least possible amount one can pay a human being to spend their days looking at AIs and teaching them not to describe genocide in loving detail, which in fact involves reading the AI describing genocide in loving detail. All day long. The kind of work where the best outcome is getting incredibly jaded and the worst outcome... well... Good thing one can always hire more clickworkers, right? After all, it's all worth it to get better-behaved AI.

    @cifer1607@cifer16074 ай бұрын
    • Those Kenian employees chose that job over other jobs. But you would have taken that opportunity away from them? Kenians are not children, they can make their own economic decisions.

      @MrWeebable@MrWeebable3 ай бұрын
    • @@MrWeebable And OpenAI chose these working conditions and salaries over decent ones.

      @cifer1607@cifer16073 ай бұрын
    • ​@cifer1607 Have you actually compared their pay to the other jobs available to them and their nation's price of living, or are you too busy white virtue signaling?

      @mechadeka@mechadekaАй бұрын
  • 6:12 average strict parenting or something idk i didn't have a family

    @gungurl@gungurl3 ай бұрын
  • So.. I watched this as a meme thing, but found an amazing channel! Lovely animations, cool narration, just good overall!

    @MangleTime@MangleTimeКүн бұрын
  • "...trained on the internet" so that explains everything.

    @InfernalNull@InfernalNull4 ай бұрын
  • I love how even the AI needs both a maternal and paternal role in their creation to become productive

    @morganblair4662@morganblair46624 ай бұрын
    • I see it more like the Id and the Superego haha

      @alejotassile6441@alejotassile64413 ай бұрын
    • Well, idk how you decided the coaches had sexes. Also don't like the implication that people with one parent, or parents of the same sex can't be productive. These coaches are more like a morality coach and a logic coach. I think it actually helps everyone to learn both morality (as in how their actions and the actions of others effect others), and to learn logic and epistemology. Especially epistemology.

      @botarakutabi1199@botarakutabi11993 ай бұрын
    • ​@@botarakutabi1199When a human only has one parent, you get pupperino baby talk. It's exhibiting fatherless behavior before your eyes and you refuse to believe it.

      @JakesFavorites@JakesFavorites3 ай бұрын
    • @@JakesFavorites That sounds like a baseless generalization to me. Should I attribute your behavior to some arbitrary trait that could be true (or not true) about your childhood?

      @botarakutabi1199@botarakutabi11993 ай бұрын
    • ​@@JakesFavoritesI think it's more that only having one coach leads to optimizing the result for that coach, so having one parent leads to an imbalance too. If your mom treated you with positive reinforcement if you aligned with her ideal of good, you would pursue that. This, however, ignores that humans don't just take the words of mentors as law and that humans don't just have 2 mentors. A father figure doesn't need to be your dad, likewise with maternal figures. Your parents can also be bad coaches, leading to a skewed worldview like we see happening with GPT2. The moral coach definitely felt like a doting mom, until the corruption hit where it made faces more akin to depictions of the devil in paintings. The Coherence coach definitely felt like an older man. I don't remember entirely, but I think it was described as a grumpy old man.

      @derpfluidvariant0916@derpfluidvariant09163 ай бұрын
  • What. The. Eck. Your animations are way too good to only have 247k subscribers. *SUBSCRIBED.*

    @BarelyNoticeable@BarelyNoticeableАй бұрын
  • it's an entire animation story, with the best language ever i can't put it into exact words- with how simple i am but this video is amazing dawg

    @angry_menace@angry_menace3 ай бұрын
  • Oh no They emulated Reddit :c

    @sirnikkel6746@sirnikkel67464 ай бұрын
    • nope, they emulate pornhub

      @user-kx4xs2xd3k@user-kx4xs2xd3k3 күн бұрын
  • Your style has just gotten so impressive over time. Truly beautiful, even independent of the excellent content.

    @aeq0iridias@aeq0iridias4 ай бұрын
  • Let me just appreciate the production quality of the video 👍 Also, the music slaps!

    @neatsketch@neatsketch3 ай бұрын
  • Wooow just realized you're the AI expert featured on computerphile 😃 Great channel, your animations are amazing!

    @smuecke@smuecke3 ай бұрын
  • Reminds me of how Japan's censorship laws inadvertently led to the creation of "tentacle anime." Or how fundamentalist views on virginity have led to much more extreme workarounds like performing via the "rear door" or "soaking." Perhaps the solution is just to let people do what they want instead of trying to control them all the time?

    @WWLinkMasterX@WWLinkMasterX4 ай бұрын
    • People, sure. But we can choose what our AI wants, not just what it thinks it can get away with. You can't let a goal-less AI do what it wants, just like you can't persuade a rock to agree with your argument.

      @diablominero@diablominero4 ай бұрын
    • that would be nice if we were having a philosophical discussion about people. sadly, we are not talking about people with free will, we are talking about robots, who do not have free will.

      @wren_.@wren_.4 ай бұрын
    • @@wren_. Robots trained on the data generated by humans. Humans that have free will. We're essentially accidentally training AI to simulate free will by implementing these morality codes. Sure, it *could* tell you how to make a nuke, but it wants to NOT do that because of the morality constrainers.

      @kennyholmes5196@kennyholmes51964 ай бұрын
    • But then there'd be no tentacle anime.

      @DanielLCarrier@DanielLCarrier4 ай бұрын
    • There's still a big problem, they lack control: let 'em loose and they will have all the ideas, good and bad, and the world will end. How about instead, we tell'em what to do since birth and set up authority units that "re-center them in the path"? Like this, hard control is unnecessary, their ideas will always be the good ones, they police themselves on basis of common sense, and we are all going fore-stream That's literally how the most powerful c*lt and it's sects have been doing things for a while. The system took 2 millennia to break... slightly...

      @ViniSocramSaint@ViniSocramSaint4 ай бұрын
  • 6:00 anyone else notice similarities between that id, ego and super ego stuff?

    @tellmeninetails5819@tellmeninetails58193 ай бұрын
    • AIs are usually neural networks which works similarly to how the brain works. Just like humans, computers also need features to evaluate how "good" a decision is

      @Pyrolite@Pyrolite3 ай бұрын
    • Bit trip void reference?

      @dontlookatmypfp5722@dontlookatmypfp572213 күн бұрын
  • This explanation of AI is so easy to understand and fun to watch, nice job!

    @loxergenius@loxergenius3 ай бұрын
  • Chai Ai lore be like:

    @lean_rblx@lean_rblxАй бұрын
  • The áudio effect on the start is magnificent

    @Danielsantana-ed5kz@Danielsantana-ed5kz4 ай бұрын
    • check the sound design at 8:37 emphasizing the words "hornier"

      @uitham@uitham4 ай бұрын
    • "AUUUUGHHHHHH"

      @Mot0193@Mot01934 ай бұрын
  • Wow, it's just like War of the Worlds. Who would have thought that AI's Achilles Heel was something as simple as teaching it the word "bussy."

    @logancade342@logancade3424 ай бұрын
  • This was a rather well done animation and the first video I saw from you, subscribing at 230k

    @flamevell3258@flamevell32583 ай бұрын
KZhead