Visualizing the Latent Space: This video will change how you imagine neural nets!

2024 ж. 10 Мам.
7 371 Рет қаралды

Latent Space is how neural networks store information. In this video, we discuss Autoencoders and Variational Autoencoders and how we can explore, interpret, and manipulate images by looking at it's latent space representations. The CelebA dataset and the DFC-VAE model (arxiv.org/abs/1610.00291) shows some pretty interesting results that I found super enlightening about this mysterious topic of Machine Learning.
To support the channel and access the Word documents/slides used in this video, consider JOINING the channel on KZhead or Patreon. Members get access to scripts, slides, animations, and illustrations for most of the videos on my channel!
Join and support the channel - www.youtube.com/@avb_fj/join
Patreon - / neuralbreakdownwithavb
Follow on Twitter: @neural_avb
Timestamps-
0:00: Intro
0:48: Intuition
2:11: Autoencoders
2:50: Nearest Neighbor Search
4:30: VAE and Generative AI
5:41: Latent Space Arithmetic
8:05: Finding patterns and trends

Пікірлер
  • This is the most concise, simplest and best explanation of latent space that I've found so far!

    @balcofono666@balcofono6665 ай бұрын
    • Thanks a lot! Glad you enjoyed it! 🙏🏽

      @avb_fj@avb_fj5 ай бұрын
  • Excellent clarification of multiple concepts in one pass. Helped me relate the encoders relative to latent space in a much more accessible metaphor. Thank you.

    @ManiSaintVictor@ManiSaintVictorАй бұрын
  • Great content!

    @khanhbq@khanhbq19 күн бұрын
  • Excellent video

    @Shaunmcdonogh-shaunsurfing@Shaunmcdonogh-shaunsurfing21 күн бұрын
  • I hope you can fix the quiet audio!

    @DaeOh@DaeOh8 ай бұрын
    • Unfortunately YT doesn't allow to increase the volume after posting videos. I could re-upload it, but idk if it'll be really worth it. I'll just take this as lesson for future videos, thanks for the comment!

      @avb_fj@avb_fj8 ай бұрын
  • This was a insanely good video and explanation ty

    @coder-c6961@coder-c69617 ай бұрын
    • Thanks! Awesome to hear that! 😊

      @avb_fj@avb_fj7 ай бұрын
    • @avb_fj Thank you! I'd like to see one where you code the encoder and decoder I'm coding a autoencoder and it's a little tough trying to find a good balance of reduction but also keeling the important details

      @coder-c6961@coder-c69617 ай бұрын
  • This video is clear and concise, amazing work!

    @matthewli647@matthewli6472 ай бұрын
  • Truly Amazing

    @ParthivShah@ParthivShahАй бұрын
  • Amazing Explanation Thanks a lot!

    @mohanpugaz@mohanpugaz2 ай бұрын
  • 10 min felt like 30 min, I had so many rewinds during the vid. the video is so full of info, thanks a lot.

    @abdellahfazza4534@abdellahfazza45342 ай бұрын
  • This video is so fascinating. Amazing work.

    @intellectualobserver@intellectualobserver2 ай бұрын
    • Thanks! Glad you enjoyed it!

      @avb_fj@avb_fj2 ай бұрын
  • Thanks for great video! Very well explained!

    @qwioqwe@qwioqwe5 ай бұрын
  • Great explanations! Getting better understanding how some parts of Stable Diffusion work without any efforts )

    @qwertyuuytrewq825@qwertyuuytrewq8255 ай бұрын
  • This video is super cool. It's good to see those visualized concept

    @draftbkshen9088@draftbkshen90882 ай бұрын
    • Thanks!😊

      @avb_fj@avb_fj2 ай бұрын
  • Great video, thanks

    @JeremyMamisao@JeremyMamisao3 ай бұрын
  • A great description of interpreting deep learing models. Well done!

    @calum.macleod@calum.macleod9 ай бұрын
  • This is a fantastic break down. Great pacing, wonderful examples with easy to follow metaphors. Fix your audio and keep em coming!

    @geoffgodwin@geoffgodwin8 ай бұрын
  • Awesome content!

    @TP-ct7qm@TP-ct7qm5 ай бұрын
    • 🙏🏽🙌🏼

      @avb_fj@avb_fj5 ай бұрын
  • I really appreciate your lucid explanation. Superb. I wanna request you if you could enhance the sound quality a bit. Good wishes and thanks for such videos

    @shahriarshaon9418@shahriarshaon94186 ай бұрын
    • Thanks! I’ll keep that in mind going forward…

      @avb_fj@avb_fj6 ай бұрын
  • Excellent video! Thanks for your work! QQ: Is there a repo for the real-time image manipulation software you used as your demo?

    @DanielTheCunningham@DanielTheCunningham3 ай бұрын
  • Thank you sir ! You clear the concept of latent space for me ! And I can’t wait to click on your multi modal video in this channel

    @exoticcoder5365@exoticcoder53658 ай бұрын
  • Just subscribed after your NeRF video, and this one is awesome too! You, Yannic, and Two Minute Papers are great at making AI content relatable and interesting and freaking cool :) What a time to be alive! lol

    @w000w00t@w000w00t8 ай бұрын
    • Wow that’s high praise! Those two are definitely an inspiration, so I’m kinda feeling surreal reading this! Thank you so much!! 🙌🏼🙌🏼

      @avb_fj@avb_fj8 ай бұрын
  • Which tools did you use to be able to change each principal component and see its effect on output image?

    @FrejLorenzen@FrejLorenzen5 күн бұрын
    • If I remember correctly, I just used ipywidgets inside a jupyter notebook to do the UI and display. I also wrote the logic for the PCA (sklearn), encoding/decoding, and interpolating the latent vectors.

      @avb_fj@avb_fj5 күн бұрын
  • can we do same with the pixels to enhance the image?

    @karmeshmeartia3784@karmeshmeartia37842 ай бұрын
    • Can you clarify what you meant by “doing the same with pixels”?

      @avb_fj@avb_fj2 ай бұрын
  • Oh man. I was a bit lost when you where saying encoder this decoder that but the smile example at 6:50 hit right on the nail. It's indeed mindblowing. I'd love to know more about AI for outsiders, subscribed. PS: A concept I picked up from Ezra is that AI turns semantics into geometry. So you can do king - man + woman and get queen! (paraphrasing). If you could expand on this and give more examples in different modalities... that'd be awesome.

    @neithanm@neithanm8 ай бұрын
    • Nice…glad you enjoyed it and stuck around for the whole thing. The semantic example is pretty awesome yeah… I’ve brought it up in the channel in my History of NLP video, but more examples on different modalities seems like a nice idea for a video!

      @avb_fj@avb_fj8 ай бұрын
  • Great content! Just as a FYI, might want to turn up the Mic volume. It's easier to lower the volume than to turn it up from the consumers POV.

    @teleprint-me@teleprint-me8 ай бұрын
    • Thanks for the feedback! Will keep it in mind for the next one…

      @avb_fj@avb_fj8 ай бұрын
  • Great video but the audio level is way too low. Also the video and audio is not in sync.

    @Essentia-Channel@Essentia-Channel3 ай бұрын
  • Great content!

    @fokushodl52103@fokushodl521039 ай бұрын
KZhead