Robots Atlas>ROBOTS ATLAS

Machine Learning · Classification

Decision trees — splits, Gini, and pruning

Classification

Introduction

A decision tree is a non-linear classifier that recursively partitions feature space with axis-aligned splits. Each split is a question "x_i ≤ t?", and leaves contain class predictions. This lesson dissects the CART algorithm (Breiman et al. 1984): Gini impurity, entropy, information gain, stopping criteria, post-pruning, and why trees are insensitive to feature scaling but extremely prone to overfitting without regularization.