#
Useful (information theory)
#
Good materials:
best 10 mins - Entropy, Cross-Entropy and KL-Divergence
Excellent explanaiton of binary vs categorical cross-entropy:
some intuition about KL-divergence (not very robust)