Why linear discriminants?
•Optimal if classes are Gaussian with same covariances.
•Linear separators easier to find.
•Hyperplanes have few parameters, prevents overfitting.
–Have lower VC dimension, but we don’t have time for this.
–