•There are a class of functions that could label the data.
•Our goal is to select the correct function, with as little information as possible.
•Don’t think of data coming from a class described by probability distributions.
•Look at worst-case performance.
–This is CS’ey approach.
–In statistical model, worst case not meaningful.