Hiroki Naganuma

8.

Screenshot 2021-12-09 at 5 31 33 PM

(a).

Write down the log-likelihood objective.

(b).

Show that maximizing this likelihood objective is equivalent to minimizing the KL divergence to the sampled data, DKL(pˆdata(x) ∥ pmodel(x; θ)).