1st piece is average log probability when both x part ,z part are observed.when z is infered from q model distribution.2nd piece is function of q and doesn't depend on our generative modelif q is chosen as posterior distribution z given x under our generative model, now there are no gap between inequalityposterior is hard to computeinvert neural network in probabilistic way which is very hard be..