Split data into and set
Do until further pruning is harmful: Evaluate impact on set of pruning each possible node (plus those below it) Greedily remove the one that most improves set accuracy
produces smallest version of most accurate subtree
What if data is limited?