Neural Networks: From Fundamentals to Modern AI · Generative Models: Autoencoders, VAEs, and GANs
VAE: the reparametrization trick and ELBO
Generative Models: Autoencoders, VAEs, and GANs
Introduction
A Variational Autoencoder (Kingma & Welling 2014, Rezende et al. 2014) replaces the deterministic encoding of an AE with a probabilistic one: the encoder produces parameters of q_φ(z|x) = N(μ_φ(x), σ²_φ(x)), the prior is p(z) = N(0, I), and training maximises the ELBO = E_q[log p_θ(x|z)] − KL(q_φ(z|x) ‖ p(z)). This lesson breaks down: why naive sampling from q gives no gradient, how the reparametrization trick (z = μ + σ ⊙ ε, ε ~ N(0,I)) fixes it, how to derive the ELBO from log p(x), what the KL and reconstruction terms represent, and pathologies like posterior collapse and KL vanishing.