Why linear discriminants?
Optimal if classes are Gaussian with
same covariances.
Linear separators easier to find.
Hyperplanes have few parameters,
prevents overfitting.
Have lower VC dimension, but we dont
have time for this.