Large Language Models: Part 2

2024 ж. 19 Сәу.

112 842 Рет қаралды

How do language models like GPT and Palm work?
Part 1: • Large Language Models ...
See next: text-to-image (Parti, Imagen, Dall-E): • Text to Image in 5 min...
0:00 - intro
0:14 - next word prediction
0:20 - word embeddings
1:01 - transformers
3:11 - generating text
4:13 - stacking attention layers
4:47 - training data
5:21 - GPT-3 examples

Пікірлер

It’s amazing and so funny to me how LLM can produce fully functional python programs and write these poetic Bob Dylan inspired lyrics… but when prompted with “you’re going north. You turn right, then you turn left. Now you’re going …” it said SOUTH ☠️
@kahoku451 Жыл бұрын
- Just like human "brain cramps" or "brain farts". The problem is that the current models aren't self learning or made to analyse their own answers so they don't correct these error before output or can correct the weights in the future when confronted with new data.
  @samandoria Жыл бұрын
- It Re-raises old but interesting points about the difference between language and logic. You can make linguistically sensible statements that defy logic. Try the book “Godel, Escher, Bach” if you like this area.
  @TimHulse Жыл бұрын
- 6:02 If you are just just south of the North Pole, turn right and turn left over the North Pole, you would be heading South. I suspect it is getting confused due to the similar riddle where if you head North, turn right and you are now heading South - where are you? It has probably seen this riddle a confusing number of times and weaved that into its weights/response.
  @webcache1000 Жыл бұрын
- Because it doesn't necessarily understand how directions work, it only knows how "north" and "right and left" has been used in language before, and is only an estimation. People are unlikely to talk about celestial directions and turn right or left, so there was probably not enough similar to your query in its training data that it could draw on and other examples such as "north and south" have taken more weight.
  @distrologic2925 Жыл бұрын
- Logic is a whole different thing ig
  @swanandjoshi333 Жыл бұрын
This is one of the best explainer vids on LLMs I’ve seen yet. Not too long, not too short, good pacing, good visualizations. Great work, thanks!
@513393711 ай бұрын
I really enjoyed both of these LLM videos. They are so concise and informative and the pacing is excellent.
@TimHulse Жыл бұрын
- He is an Arab from the Middle East. My dog wants a walk.
  @whannabi Жыл бұрын
- I [or shall I say, my internal GPT?] first misread your comment as "... and the pancake is excellent". 😃
  @apc06711 ай бұрын
Great stuff. Possibly the best intro material to LLMs that I have seen. Thunbs up!
@f18a Жыл бұрын
I like that you used a recipe prompt to demonstrate what a LLM is good at doing, then actually followed the recipe and proved that it actually worked (and tasted good!).
@Charlieee1 Жыл бұрын
This is the single best explaination I’ve come across on LLM’s
@roywem Жыл бұрын
This should be adopted by high schools everywhere. Superb teaching. A bit fast for me (I’m 61), but I can re-watch until I get it.
@csmac3144a4 ай бұрын
Soothing voice to learn about transformers and warm room is perfect for recipe for my sleep.🛌
@Sm_Colly29 күн бұрын
you actually made the cookies hahaha that's awesome, great video btw!
@ianassarandas11 ай бұрын
So pleased to get a clear and credible glimpse under the hood. Thank you.
@stuartthomas7441 Жыл бұрын
Very much enjoyed these two videos. More please! Clear and detailed.
@smudgepost Жыл бұрын
Typo alert! Frost's wonderful poem begins as so: "Two roads diverged in a yellow wood". (Not "diverted"!)
@somethingness Жыл бұрын
This is a really good intro indeed! I encourage to make more content like this
@aidarfaizrakhmanov1901 Жыл бұрын
Thank you. Those were very clear explanations of just the right length. Loved that you cooked the pancakes too!
@poklet Жыл бұрын
Haha you made the pancake
@tvillaluz Жыл бұрын
I think this is the best, most intuitive and most illustrative video describing LLMs/transformers. Thank you so much!
@xybnedasdd293011 ай бұрын
Really easy to follow, well paced, easy on the ear, and just the right level thanks!
@craftycurate Жыл бұрын
These are two great videos that introduced how large language model works in a very comprehensive way.👍👍👍
@MaleGeminiCat Жыл бұрын
Thanks for the crisp walkthrough of the technology. It is a very good introduction.
@ravinatarajan4894 Жыл бұрын
What a great walk through! Thanks so much for sharing.
@jteichma Жыл бұрын
Dude, I have been researching Transformers and Attention for weeks. You’re a master mind, I get it now!
@scottlott37949 ай бұрын
Thanks for an original presentation of Large Language Models. It gave me new insight.
@Paul-rs4gd Жыл бұрын
Just 16 minutes, it is amazing. These are the best concise videos on the topic including basics in neural networks I have seen. Many thanks for sharing your knowledge!
@yojimbomk10 ай бұрын
the Mona Lisa of LLM explanations .. thanks!
@coraltown110 ай бұрын
- Okay, that's the best comment yet -- thank you :-)
  @g5min10 ай бұрын
I love that you finished cooking the recipe! Great video :)
@DavidDiaz-zg5sv Жыл бұрын
Thank you! This is excellent. I love the animations. They are helpful!
@MindyMcAdams Жыл бұрын
Man you NEED more subscribers, the content and video quality is way too good for only 15k!
@eenvleugjegoeiegames6 ай бұрын
You, sir, have talent for teaching. Even with my fairly technical background and watching a lot of videos I was struggling to get my head around LLMs until I watched your videos. Hope you continue what you started it would help a lot of people especially now when we see a lot of people trying to run their own local LLM versions on their PCs.
@jaunalapa7 ай бұрын
Thanks so much for both of these videos. They are wonderful. I think I understood them a bit more since I’ve done some basic assisted machine learning dev (up to neural networks). If anyone is a bit lost, read up on linear and logistic regression, then onto neural networks.
@DrNuyenVanFaulk Жыл бұрын
This brings me one step closer to understanding. Thanks.
@fernwood10 ай бұрын
Brilliant overview for a non-technical person like me... and glad to see you tested the recipe!
@janicem594210 ай бұрын
This is awesome. Would love to learn more!
@KumquatChampion Жыл бұрын
Loved this! Thanks for the great video!
@AMGbot Жыл бұрын
Pretty nice explanation in both videos.Thanks!
@bloodywolftr Жыл бұрын
Very underrated and underappreciated video.
@preethamrangaswamy7371 Жыл бұрын
An excellent video on language models
@EGlobalKnowledge11 ай бұрын
You really have a talent to teach things.
@vast634 Жыл бұрын
Great effort - nice video, good and comprehensive explanation.
@frank-reneschafer55129 ай бұрын
I loved that you actually cooked that recipe! :-)
@padetiit7014 Жыл бұрын
Very well made!
@Patapom311 ай бұрын
I hope you keep up with these videos, they are seriously great. Already suscribed and I'll check the rest of your channel. Thank you.
@roscatres11 ай бұрын
Not a second wasted. Just brilliant ❤️
@KalebPeters99 Жыл бұрын
Great video, leaving a comment to let you know it was very insightful. Thank you.
@AncientSlugThrower Жыл бұрын
This is the best video on Language Models that I seen. Probably the best on the Internet. You should maybe add chatgpt to the title to get more views.
@0815Snickersboy Жыл бұрын
Absolute best explainer. Where have you been though since GPT-4?? Would love to see you do more content. You have a talent!! I think you might be busy working on LLM's for research or in a company, but would be cool to see more videos!
@GodofStories9 ай бұрын
That language neural network at 0:50 belongs on a tshirt somewhere
@ayushsharma903611 ай бұрын
Well done!
@christianwestermann4680 Жыл бұрын
Great explanation, glad I stumbled upon it ! Subscribed :)
@StratosFair Жыл бұрын
Very informative and concise videos to understand LLMs their complexities and what i takes to make these models successful.
@ypetkar9 ай бұрын
Very clear. Thanks a lot.
@jorgesoberon6866 Жыл бұрын
This is my first search for a LLM explanation and very pleased with the video. I am not a mathematician or programmer but I am very interested in learning how LLM works. From my humble perspective I can say we reached a point of no return and this technology is progressing at an exponential rate. With the development of quantum computing, I have no doubt that it will surpass human intelligence in ways we don't understand.
@TheCaioKyleBraga Жыл бұрын
This was awesome! Thanks
@dv616511 ай бұрын
EXCELLENT Vid, PLS DO MORE, on Deep learning , covering the whole workflow of making an LLM, especially, what os LORa, VEctor embeddings, etc I'm sure you'll get huge interest You have a gift for explaining. Thanks!
@bingolio Жыл бұрын
amazing video! thank you!
@user-hf5og1bt8jАй бұрын
nice work, really good to visualize these things even though I already know this.
@fenandamilanda2032 Жыл бұрын
Thanks for the good explanation, very much on time
@yourfuneral Жыл бұрын
Big thanks mate
@prathams8685 Жыл бұрын
Great, thanks so much, massively useful.
@danjsy8 ай бұрын
Thank you Steve for a personal moment of enlightening by opening the "black box of AI". This is an outstanding educational piece, in particular in combination with the first part.
@stefanbiesdorf46379 ай бұрын
Very interesting... Now I understand more about how ChatGPT works...
@sciexp Жыл бұрын
Awesome. Thank you
@rajachan8588 Жыл бұрын
Very good explanation
@saisaske19 ай бұрын
Very good content! Keep going! thanks.
@ivanocj Жыл бұрын
Thumbs-up for actually making the pancakes. 😂
@bgustinjr Жыл бұрын
amazing videos!! learnt so much
@Sukant98 Жыл бұрын
great video bud cheers!
@thomasforrest1931 Жыл бұрын
Great videos! Btw, how did you like your avocado cocoa thing? 😄
@carrumar Жыл бұрын
Continuing from the previous response, here are some additional considerations and challenges when working with large language models: 14. Data Privacy and Ethical Concerns: - Be aware of privacy concerns when collecting and using data for training. Ensure that you have the necessary permissions and comply with data protection regulations. Ethical considerations, such as bias and fairness in the data, also need to be addressed. 15. Compute Resources: - Training large language models requires substantial computational resources, including high-end GPUs or TPUs and large-scale distributed computing infrastructure. These resources can be expensive and may not be accessible to everyone. 16. Energy Consumption: - Training large language models consumes a significant amount of electricity, contributing to environmental concerns. Some organizations are actively working on making AI training more energy-efficient. 17. Model Size and Efficiency: - While larger models tend to perform better, they also require more memory and computational power. Balancing model size and efficiency is crucial for real-world applications, as very large models might not be practical for all use cases. 18. Fine-Tuning and Transfer Learning: - Fine-tuning pre-trained models on specific tasks is a common practice, as it requires less data and computational resources compared to training from scratch. Understanding how to effectively fine-tune models is essential. 19. Evaluation Metrics: - Choosing appropriate evaluation metrics is critical. Different NLP tasks may require different metrics. For instance, accuracy may be suitable for classification tasks, while BLEU scores are used for machine translation. Select metrics that align with your objectives. 20. Bias and Fairness: - Large language models can inherit biases present in the training data. Mitigating bias and ensuring fairness in AI systems is a significant challenge. It requires careful curation of training data and ongoing monitoring. 21. Robustness and Safety: - Ensuring that large language models are robust and safe is essential. This includes protecting against adversarial attacks, avoiding harmful or inappropriate outputs, and ensuring that the model behaves predictably. 22. Research and Collaboration: - The field of large language models is rapidly evolving. Staying up-to-date with the latest research and collaborating with the AI community can help improve your understanding and the quality of your models. 23. Resource Sharing: - Due to the resource-intensive nature of training, sharing pre-trained models, datasets, and trained weights is common in the AI community. Leveraging existing resources can save time and resources. 24. Ethical Considerations: - Consider the ethical implications of your work. The power of large language models also comes with responsibility. Engage in ethical discussions and follow guidelines for responsible AI development. 25. Interpretability: - Large language models are often criticized for their lack of interpretability. Efforts to make AI models more understandable and explainable are ongoing to build trust and ensure accountability. Training large language models from scratch is a challenging and resource-intensive endeavor, and it may not be necessary for many practical applications. Leveraging pre-trained models and fine-tuning them for specific tasks is a more common and efficient approach in many cases. Moreover, ensuring responsible and ethical AI practices is paramount when working with such powerful language models.
@ChatGPt20016 ай бұрын
Great overview! People need to see this video pair before freaking out that LLMs are actually intelligent.
@koho Жыл бұрын
Cool video, thanks
@moedemama Жыл бұрын
Thanks
@abhaychandrol4 ай бұрын
Thank you. Just the right level for my tiny organic brain.
@deand6411 Жыл бұрын
Great the concepts transition. Great illustrations. The best of the best this couple of videos. What about more on other networks like r-cnn and audio nets? 😃
@andresroca9736 Жыл бұрын
- Thanks! I'm working on one on reinforcement learning now...
  @g5min Жыл бұрын
Please do more videos on LLMs!!! But also I need to know, how were the pancakes?
@xflory26x Жыл бұрын
liked and subscribed
@CoreDump07 Жыл бұрын
Ever think about audio synthesis and wave forms?? And how analogue synthesis utilising wave tables can offer a way to both communicate and comput information.
@DJWESG1 Жыл бұрын
those pancakes look pretty good
@mineralt8 ай бұрын
fun funny fantastic and I am a fan!
@leeamraa Жыл бұрын
I'm a bit confused by how stacking attention layers works at 4:12. Does the second layer take the first layer's prediction as input? Is the first layer's prediction still "next words" at that point, or is it now some sort of abstract intermediate value? How exactly does that capture higher level reasoning? Would appreciate any clarification!
@zuqini11 ай бұрын
I love the fact that you really tried that recipe! How was it?
@food4yann9 ай бұрын
I would love more of the visual explainers on ML concepts. Subscribing. "Some folks say they're overhyped / But I do think that's true / I think they're just misunderstood / Just like me and you" That generated lyric gave me CHILLS
@SethWieder Жыл бұрын
- Sameee
  @kahoku451 Жыл бұрын
eliminating bias and stereotypes from language models is a lost cause, because it's the same as asking the network to lie.
@martinstu8400 Жыл бұрын
- you didn't understand anything did you
  @parabolicpanorama Жыл бұрын
This was super informative and concise, loved it! But my real question is how were those pancakes?
@dormin1850 Жыл бұрын
GPT-4: Might I suggest dubbing them "Chocomole Pancakes"? 😮😂
@efisgpr Жыл бұрын
ChatGPT gets the 37 question right now.
@richardharris9708 Жыл бұрын
Can I do this in JavaScript?!
@logic_force8 ай бұрын
This is really nice! Now I understand why chatGPT tends to make up a lot of stuff with coherent sentences
@speicaldark Жыл бұрын
- What I'm wondering is how are they correcting its errors. For traditional NN, we have heat maps but I'd like to see something similar with transformers at the highest level to see what kind of patterns it noticed. Maybe that's what they use to correct its mistakes
  @whannabi Жыл бұрын
- @@whannabi ChatGPT used a process called reinforcement learning from human feedback (RLHF): They used an already trained GPT-3 which already at the time. Humans both submitted new sentences as input prompts to ChatGPT being trained, and also ranked the output (responses) of the model. Then the ranking of the responses were then used as reward targets to continue training the model to obtain more desirable responses (measured by how the response rankings had increased).
  @gideonk123 Жыл бұрын
Fantastic work, hope those pancakes tasted better than they looked! xD
@triton62674 Жыл бұрын
- they were seriously delicious
  @g5min Жыл бұрын
It looks like GPT hands you a great chocolate guacamole pancake recipe... now I wanted to try too loool
@mlusalin2379 Жыл бұрын
Just look how far we have come in only eight months.
@chnolte Жыл бұрын
Rather than asking chatgpt to translate, I recommend asking the question in the target language. You'll significantly different results - more accurate in the target language. Stated without proof or explanation which should be intuitively obvious after watching these two videos...
@GeorgeJohnsonJackofAllTrades8 ай бұрын
Wonderful stuff. 👍 Also, _please can I get some oven and the oven please let us have to do the run and not a big difference in a bit more about people who have been in touch your own house is the best way of a bit more about people..._ *My phone wrote the part in italics.
@nagualdesign Жыл бұрын
Was the chocolate guacamole pancake any good?
@Ai.Sentinel Жыл бұрын
pancake approved 🥞👍
@KeikosCake Жыл бұрын
Verdict on the guacookies?
@genegade Жыл бұрын
I can't get why we stack them like so. If the first transformer block predicts a word, what second does, third? And why do they still need attention then?
@altalt7653 Жыл бұрын
Me: What is a LLM? Average university pRofeSsOr: *Slams 12 academic papers with 119 equations and 165 references and told me to read* Chad Steve Seitz:
@hongyifan52038 ай бұрын
This is super fascinating. I want to learn to grow my own language models from the ground up with languages like c++/rust. I dont care if the language model i develop is inaccurate or not. I just want to understand chatgpt under the hood. How may i get started?
@DonaldTamMisterDee Жыл бұрын
- Why dont you ask the language model how to make a language model?
  @spenarkley Жыл бұрын
- I have! Really great tips!
  @infiniteplanes5775 Жыл бұрын
- ask gpt to stack transformer layers and add some input output layers. add some loss functions to optimize and get a large dataset.
  @parabolicpanorama Жыл бұрын
What do you think of GPT4?
@jaredf6205 Жыл бұрын
5:55 the german one is correct
@f4ls381 Жыл бұрын