Text to Image in 5 minutes: Parti, Dall-E 2, Imagen

2022 ж. 10 Там.
11 533 Рет қаралды

The key ideas and intuition for how these AI image generation systems work.
Part 2: • Text to Image: Part 2...
Text + Image Generation Playlist: • Text + Image Generation

Пікірлер
  • Really like how the explanation is not very mathy but very intuitive!

    @xuanluo6997@xuanluo6997 Жыл бұрын
  • The most intuitive explanation I've seen. Why just 2.5k views?!

    @matveyshishov@matveyshishov Жыл бұрын
  • I really like your work. Only thing i will ask for is that you add (part 1) to this video. I was a bit confused when i found part 2 first and kept looking for a part 1. Looking forward to watching more of your work in the future 🙂

    @mohammadyasser785@mohammadyasser785 Жыл бұрын
  • This is awesome ❤ thanks for explaining, have you considered doing AI 101 for all the modern AI ?

    @saikatnextd@saikatnextd11 ай бұрын
  • thank you for this. but how LLM can suddenly knows how to build a multiline triangle?

    @azwaabrasid@azwaabrasid Жыл бұрын
  • Does Midjourney have some differences?

    @Dron008@Dron008 Жыл бұрын
  • At 4:28 , it should be a 32*32 section instead of 8*8 (which is what was said in the video) or did I miss something?

    @AshrayMalhotra@AshrayMalhotra Жыл бұрын
    • The 256x256 image is represented as a 32x32 grid of patches, where each patch is 8x8 -- hopefully that helps!

      @g5min@g5min Жыл бұрын
KZhead