Large Language Models: Part 2

2024 ж. 19 Сәу.
112 842 Рет қаралды

How do language models like GPT and Palm work?
Part 1: • Large Language Models ...
See next: text-to-image (Parti, Imagen, Dall-E): • Text to Image in 5 min...
0:00 - intro
0:14 - next word prediction
0:20 - word embeddings
1:01 - transformers
3:11 - generating text
4:13 - stacking attention layers
4:47 - training data
5:21 - GPT-3 examples

Пікірлер
  • It’s amazing and so funny to me how LLM can produce fully functional python programs and write these poetic Bob Dylan inspired lyrics… but when prompted with “you’re going north. You turn right, then you turn left. Now you’re going …” it said SOUTH ☠️

    @kahoku451@kahoku451 Жыл бұрын
    • Just like human "brain cramps" or "brain farts". The problem is that the current models aren't self learning or made to analyse their own answers so they don't correct these error before output or can correct the weights in the future when confronted with new data.

      @samandoria@samandoria Жыл бұрын
    • It Re-raises old but interesting points about the difference between language and logic. You can make linguistically sensible statements that defy logic. Try the book “Godel, Escher, Bach” if you like this area.

      @TimHulse@TimHulse Жыл бұрын
    • 6:02 If you are just just south of the North Pole, turn right and turn left over the North Pole, you would be heading South. I suspect it is getting confused due to the similar riddle where if you head North, turn right and you are now heading South - where are you? It has probably seen this riddle a confusing number of times and weaved that into its weights/response.

      @webcache1000@webcache1000 Жыл бұрын
    • Because it doesn't necessarily understand how directions work, it only knows how "north" and "right and left" has been used in language before, and is only an estimation. People are unlikely to talk about celestial directions and turn right or left, so there was probably not enough similar to your query in its training data that it could draw on and other examples such as "north and south" have taken more weight.

      @distrologic2925@distrologic2925 Жыл бұрын
    • Logic is a whole different thing ig

      @swanandjoshi333@swanandjoshi333 Жыл бұрын
  • This is one of the best explainer vids on LLMs I’ve seen yet. Not too long, not too short, good pacing, good visualizations. Great work, thanks!

    @5133937@513393711 ай бұрын
  • I really enjoyed both of these LLM videos. They are so concise and informative and the pacing is excellent.

    @TimHulse@TimHulse Жыл бұрын
    • He is an Arab from the Middle East. My dog wants a walk.

      @whannabi@whannabi Жыл бұрын
    • I [or shall I say, my internal GPT?] first misread your comment as "... and the pancake is excellent". 😃

      @apc067@apc06711 ай бұрын
  • Great stuff. Possibly the best intro material to LLMs that I have seen. Thunbs up!

    @f18a@f18a Жыл бұрын
  • I like that you used a recipe prompt to demonstrate what a LLM is good at doing, then actually followed the recipe and proved that it actually worked (and tasted good!).

    @Charlieee1@Charlieee1 Жыл бұрын
  • This is the single best explaination I’ve come across on LLM’s

    @roywem@roywem Жыл бұрын
  • This should be adopted by high schools everywhere. Superb teaching. A bit fast for me (I’m 61), but I can re-watch until I get it.

    @csmac3144a@csmac3144a4 ай бұрын
  • Soothing voice to learn about transformers and warm room is perfect for recipe for my sleep.🛌

    @Sm_Colly@Sm_Colly29 күн бұрын
  • you actually made the cookies hahaha that's awesome, great video btw!

    @ianassarandas@ianassarandas11 ай бұрын
  • So pleased to get a clear and credible glimpse under the hood. Thank you.

    @stuartthomas7441@stuartthomas7441 Жыл бұрын
  • Very much enjoyed these two videos. More please! Clear and detailed.

    @smudgepost@smudgepost Жыл бұрын
  • Typo alert! Frost's wonderful poem begins as so: "Two roads diverged in a yellow wood". (Not "diverted"!)

    @somethingness@somethingness Жыл бұрын
  • This is a really good intro indeed! I encourage to make more content like this

    @aidarfaizrakhmanov1901@aidarfaizrakhmanov1901 Жыл бұрын
  • Thank you. Those were very clear explanations of just the right length. Loved that you cooked the pancakes too!

    @poklet@poklet Жыл бұрын
  • Haha you made the pancake

    @tvillaluz@tvillaluz Жыл бұрын
  • I think this is the best, most intuitive and most illustrative video describing LLMs/transformers. Thank you so much!

    @xybnedasdd2930@xybnedasdd293011 ай бұрын
  • Really easy to follow, well paced, easy on the ear, and just the right level thanks!

    @craftycurate@craftycurate Жыл бұрын
  • These are two great videos that introduced how large language model works in a very comprehensive way.👍👍👍

    @MaleGeminiCat@MaleGeminiCat Жыл бұрын
  • Thanks for the crisp walkthrough of the technology. It is a very good introduction.

    @ravinatarajan4894@ravinatarajan4894 Жыл бұрын
  • What a great walk through! Thanks so much for sharing.

    @jteichma@jteichma Жыл бұрын
  • Dude, I have been researching Transformers and Attention for weeks. You’re a master mind, I get it now!

    @scottlott3794@scottlott37949 ай бұрын
  • Thanks for an original presentation of Large Language Models. It gave me new insight.

    @Paul-rs4gd@Paul-rs4gd Жыл бұрын
  • Just 16 minutes, it is amazing. These are the best concise videos on the topic including basics in neural networks I have seen. Many thanks for sharing your knowledge!

    @yojimbomk@yojimbomk10 ай бұрын
  • the Mona Lisa of LLM explanations .. thanks!

    @coraltown1@coraltown110 ай бұрын
    • Okay, that's the best comment yet -- thank you :-)

      @g5min@g5min10 ай бұрын
  • I love that you finished cooking the recipe! Great video :)

    @DavidDiaz-zg5sv@DavidDiaz-zg5sv Жыл бұрын
  • Thank you! This is excellent. I love the animations. They are helpful!

    @MindyMcAdams@MindyMcAdams Жыл бұрын
  • Man you NEED more subscribers, the content and video quality is way too good for only 15k!

    @eenvleugjegoeiegames@eenvleugjegoeiegames6 ай бұрын
  • You, sir, have talent for teaching. Even with my fairly technical background and watching a lot of videos I was struggling to get my head around LLMs until I watched your videos. Hope you continue what you started it would help a lot of people especially now when we see a lot of people trying to run their own local LLM versions on their PCs.

    @jaunalapa@jaunalapa7 ай бұрын
  • Thanks so much for both of these videos. They are wonderful. I think I understood them a bit more since I’ve done some basic assisted machine learning dev (up to neural networks). If anyone is a bit lost, read up on linear and logistic regression, then onto neural networks.

    @DrNuyenVanFaulk@DrNuyenVanFaulk Жыл бұрын
  • This brings me one step closer to understanding. Thanks.

    @fernwood@fernwood10 ай бұрын
  • Brilliant overview for a non-technical person like me... and glad to see you tested the recipe!

    @janicem5942@janicem594210 ай бұрын
  • This is awesome. Would love to learn more!

    @KumquatChampion@KumquatChampion Жыл бұрын
  • Loved this! Thanks for the great video!

    @AMGbot@AMGbot Жыл бұрын
  • Pretty nice explanation in both videos.Thanks!

    @bloodywolftr@bloodywolftr Жыл бұрын
  • Very underrated and underappreciated video.

    @preethamrangaswamy7371@preethamrangaswamy7371 Жыл бұрын
  • An excellent video on language models

    @EGlobalKnowledge@EGlobalKnowledge11 ай бұрын
  • You really have a talent to teach things.

    @vast634@vast634 Жыл бұрын
  • Great effort - nice video, good and comprehensive explanation.

    @frank-reneschafer5512@frank-reneschafer55129 ай бұрын
  • I loved that you actually cooked that recipe! :-)

    @padetiit7014@padetiit7014 Жыл бұрын
  • Very well made!

    @Patapom3@Patapom311 ай бұрын
  • I hope you keep up with these videos, they are seriously great. Already suscribed and I'll check the rest of your channel. Thank you.

    @roscatres@roscatres11 ай бұрын
  • Not a second wasted. Just brilliant ❤️

    @KalebPeters99@KalebPeters99 Жыл бұрын
  • Great video, leaving a comment to let you know it was very insightful. Thank you.

    @AncientSlugThrower@AncientSlugThrower Жыл бұрын
  • This is the best video on Language Models that I seen. Probably the best on the Internet. You should maybe add chatgpt to the title to get more views.

    @0815Snickersboy@0815Snickersboy Жыл бұрын
  • Absolute best explainer. Where have you been though since GPT-4?? Would love to see you do more content. You have a talent!! I think you might be busy working on LLM's for research or in a company, but would be cool to see more videos!

    @GodofStories@GodofStories9 ай бұрын
  • That language neural network at 0:50 belongs on a tshirt somewhere

    @ayushsharma9036@ayushsharma903611 ай бұрын
  • Well done!

    @christianwestermann4680@christianwestermann4680 Жыл бұрын
  • Great explanation, glad I stumbled upon it ! Subscribed :)

    @StratosFair@StratosFair Жыл бұрын
  • Very informative and concise videos to understand LLMs their complexities and what i takes to make these models successful.

    @ypetkar@ypetkar9 ай бұрын
  • Very clear. Thanks a lot.

    @jorgesoberon6866@jorgesoberon6866 Жыл бұрын
  • This is my first search for a LLM explanation and very pleased with the video. I am not a mathematician or programmer but I am very interested in learning how LLM works. From my humble perspective I can say we reached a point of no return and this technology is progressing at an exponential rate. With the development of quantum computing, I have no doubt that it will surpass human intelligence in ways we don't understand.

    @TheCaioKyleBraga@TheCaioKyleBraga Жыл бұрын
  • This was awesome! Thanks

    @dv6165@dv616511 ай бұрын
  • EXCELLENT Vid, PLS DO MORE, on Deep learning , covering the whole workflow of making an LLM, especially, what os LORa, VEctor embeddings, etc I'm sure you'll get huge interest You have a gift for explaining. Thanks!

    @bingolio@bingolio Жыл бұрын
  • amazing video! thank you!

    @user-hf5og1bt8j@user-hf5og1bt8jАй бұрын
  • nice work, really good to visualize these things even though I already know this.

    @fenandamilanda2032@fenandamilanda2032 Жыл бұрын
  • Thanks for the good explanation, very much on time

    @yourfuneral@yourfuneral Жыл бұрын
  • Big thanks mate

    @prathams8685@prathams8685 Жыл бұрын
  • Great, thanks so much, massively useful.

    @danjsy@danjsy8 ай бұрын
  • Thank you Steve for a personal moment of enlightening by opening the "black box of AI". This is an outstanding educational piece, in particular in combination with the first part.

    @stefanbiesdorf4637@stefanbiesdorf46379 ай бұрын
  • Very interesting... Now I understand more about how ChatGPT works...

    @sciexp@sciexp Жыл бұрын
  • Awesome. Thank you

    @rajachan8588@rajachan8588 Жыл бұрын
  • Very good explanation

    @saisaske1@saisaske19 ай бұрын
  • Very good content! Keep going! thanks.

    @ivanocj@ivanocj Жыл бұрын
  • Thumbs-up for actually making the pancakes. 😂

    @bgustinjr@bgustinjr Жыл бұрын
  • amazing videos!! learnt so much

    @Sukant98@Sukant98 Жыл бұрын
  • great video bud cheers!

    @thomasforrest1931@thomasforrest1931 Жыл бұрын
  • Great videos! Btw, how did you like your avocado cocoa thing? 😄

    @carrumar@carrumar Жыл бұрын
  • Continuing from the previous response, here are some additional considerations and challenges when working with large language models: 14. Data Privacy and Ethical Concerns: - Be aware of privacy concerns when collecting and using data for training. Ensure that you have the necessary permissions and comply with data protection regulations. Ethical considerations, such as bias and fairness in the data, also need to be addressed. 15. Compute Resources: - Training large language models requires substantial computational resources, including high-end GPUs or TPUs and large-scale distributed computing infrastructure. These resources can be expensive and may not be accessible to everyone. 16. Energy Consumption: - Training large language models consumes a significant amount of electricity, contributing to environmental concerns. Some organizations are actively working on making AI training more energy-efficient. 17. Model Size and Efficiency: - While larger models tend to perform better, they also require more memory and computational power. Balancing model size and efficiency is crucial for real-world applications, as very large models might not be practical for all use cases. 18. Fine-Tuning and Transfer Learning: - Fine-tuning pre-trained models on specific tasks is a common practice, as it requires less data and computational resources compared to training from scratch. Understanding how to effectively fine-tune models is essential. 19. Evaluation Metrics: - Choosing appropriate evaluation metrics is critical. Different NLP tasks may require different metrics. For instance, accuracy may be suitable for classification tasks, while BLEU scores are used for machine translation. Select metrics that align with your objectives. 20. Bias and Fairness: - Large language models can inherit biases present in the training data. Mitigating bias and ensuring fairness in AI systems is a significant challenge. It requires careful curation of training data and ongoing monitoring. 21. Robustness and Safety: - Ensuring that large language models are robust and safe is essential. This includes protecting against adversarial attacks, avoiding harmful or inappropriate outputs, and ensuring that the model behaves predictably. 22. Research and Collaboration: - The field of large language models is rapidly evolving. Staying up-to-date with the latest research and collaborating with the AI community can help improve your understanding and the quality of your models. 23. Resource Sharing: - Due to the resource-intensive nature of training, sharing pre-trained models, datasets, and trained weights is common in the AI community. Leveraging existing resources can save time and resources. 24. Ethical Considerations: - Consider the ethical implications of your work. The power of large language models also comes with responsibility. Engage in ethical discussions and follow guidelines for responsible AI development. 25. Interpretability: - Large language models are often criticized for their lack of interpretability. Efforts to make AI models more understandable and explainable are ongoing to build trust and ensure accountability. Training large language models from scratch is a challenging and resource-intensive endeavor, and it may not be necessary for many practical applications. Leveraging pre-trained models and fine-tuning them for specific tasks is a more common and efficient approach in many cases. Moreover, ensuring responsible and ethical AI practices is paramount when working with such powerful language models.

    @ChatGPt2001@ChatGPt20016 ай бұрын
  • Great overview! People need to see this video pair before freaking out that LLMs are actually intelligent.

    @koho@koho Жыл бұрын
  • Cool video, thanks

    @moedemama@moedemama Жыл бұрын
  • Thanks

    @abhaychandrol@abhaychandrol4 ай бұрын
  • Thank you. Just the right level for my tiny organic brain.

    @deand6411@deand6411 Жыл бұрын
  • Great the concepts transition. Great illustrations. The best of the best this couple of videos. What about more on other networks like r-cnn and audio nets? 😃

    @andresroca9736@andresroca9736 Жыл бұрын
    • Thanks! I'm working on one on reinforcement learning now...

      @g5min@g5min Жыл бұрын
  • Please do more videos on LLMs!!! But also I need to know, how were the pancakes?

    @xflory26x@xflory26x Жыл бұрын
  • liked and subscribed

    @CoreDump07@CoreDump07 Жыл бұрын
  • Ever think about audio synthesis and wave forms?? And how analogue synthesis utilising wave tables can offer a way to both communicate and comput information.

    @DJWESG1@DJWESG1 Жыл бұрын
  • those pancakes look pretty good

    @mineralt@mineralt8 ай бұрын
  • fun funny fantastic and I am a fan!

    @leeamraa@leeamraa Жыл бұрын
  • I'm a bit confused by how stacking attention layers works at 4:12. Does the second layer take the first layer's prediction as input? Is the first layer's prediction still "next words" at that point, or is it now some sort of abstract intermediate value? How exactly does that capture higher level reasoning? Would appreciate any clarification!

    @zuqini@zuqini11 ай бұрын
  • I love the fact that you really tried that recipe! How was it?

    @food4yann@food4yann9 ай бұрын
  • I would love more of the visual explainers on ML concepts. Subscribing. "Some folks say they're overhyped / But I do think that's true / I think they're just misunderstood / Just like me and you" That generated lyric gave me CHILLS

    @SethWieder@SethWieder Жыл бұрын
    • Sameee

      @kahoku451@kahoku451 Жыл бұрын
  • eliminating bias and stereotypes from language models is a lost cause, because it's the same as asking the network to lie.

    @martinstu8400@martinstu8400 Жыл бұрын
    • you didn't understand anything did you

      @parabolicpanorama@parabolicpanorama Жыл бұрын
  • This was super informative and concise, loved it! But my real question is how were those pancakes?

    @dormin1850@dormin1850 Жыл бұрын
  • GPT-4: Might I suggest dubbing them "Chocomole Pancakes"? 😮😂

    @efisgpr@efisgpr Жыл бұрын
  • ChatGPT gets the 37 question right now.

    @richardharris9708@richardharris9708 Жыл бұрын
  • Can I do this in JavaScript?!

    @logic_force@logic_force8 ай бұрын
  • This is really nice! Now I understand why chatGPT tends to make up a lot of stuff with coherent sentences

    @speicaldark@speicaldark Жыл бұрын
    • What I'm wondering is how are they correcting its errors. For traditional NN, we have heat maps but I'd like to see something similar with transformers at the highest level to see what kind of patterns it noticed. Maybe that's what they use to correct its mistakes

      @whannabi@whannabi Жыл бұрын
    • @@whannabi ChatGPT used a process called reinforcement learning from human feedback (RLHF): They used an already trained GPT-3 which already at the time. Humans both submitted new sentences as input prompts to ChatGPT being trained, and also ranked the output (responses) of the model. Then the ranking of the responses were then used as reward targets to continue training the model to obtain more desirable responses (measured by how the response rankings had increased).

      @gideonk123@gideonk123 Жыл бұрын
  • Fantastic work, hope those pancakes tasted better than they looked! xD

    @triton62674@triton62674 Жыл бұрын
    • they were seriously delicious

      @g5min@g5min Жыл бұрын
  • It looks like GPT hands you a great chocolate guacamole pancake recipe... now I wanted to try too loool

    @mlusalin2379@mlusalin2379 Жыл бұрын
  • Just look how far we have come in only eight months.

    @chnolte@chnolte Жыл бұрын
  • Rather than asking chatgpt to translate, I recommend asking the question in the target language. You'll significantly different results - more accurate in the target language. Stated without proof or explanation which should be intuitively obvious after watching these two videos...

    @GeorgeJohnsonJackofAllTrades@GeorgeJohnsonJackofAllTrades8 ай бұрын
  • Wonderful stuff. 👍 Also, _please can I get some oven and the oven please let us have to do the run and not a big difference in a bit more about people who have been in touch your own house is the best way of a bit more about people..._ *My phone wrote the part in italics.

    @nagualdesign@nagualdesign Жыл бұрын
  • Was the chocolate guacamole pancake any good?

    @Ai.Sentinel@Ai.Sentinel Жыл бұрын
  • pancake approved 🥞👍

    @KeikosCake@KeikosCake Жыл бұрын
  • Verdict on the guacookies?

    @genegade@genegade Жыл бұрын
  • I can't get why we stack them like so. If the first transformer block predicts a word, what second does, third? And why do they still need attention then?

    @altalt7653@altalt7653 Жыл бұрын
  • Me: What is a LLM? Average university pRofeSsOr: *Slams 12 academic papers with 119 equations and 165 references and told me to read* Chad Steve Seitz:

    @hongyifan5203@hongyifan52038 ай бұрын
  • This is super fascinating. I want to learn to grow my own language models from the ground up with languages like c++/rust. I dont care if the language model i develop is inaccurate or not. I just want to understand chatgpt under the hood. How may i get started?

    @DonaldTamMisterDee@DonaldTamMisterDee Жыл бұрын
    • Why dont you ask the language model how to make a language model?

      @spenarkley@spenarkley Жыл бұрын
    • I have! Really great tips!

      @infiniteplanes5775@infiniteplanes5775 Жыл бұрын
    • ask gpt to stack transformer layers and add some input output layers. add some loss functions to optimize and get a large dataset.

      @parabolicpanorama@parabolicpanorama Жыл бұрын
  • What do you think of GPT4?

    @jaredf6205@jaredf6205 Жыл бұрын
  • 5:55 the german one is correct

    @f4ls381@f4ls381 Жыл бұрын
KZhead