Next:
Up:
Previous:
Split data into training and
validation set
Do until further pruning is harmful:
- 1.
- Evaluate impact on
validation set of pruning each possible node (plus those
below it)
- 2.
- Greedily remove the one that most improves
validation set accuracy
- produces smallest version of most accurate subtree
- What if data is limited?