![[Pasted image 20240529234208.png]]
![[Pasted image 20240529235935.png]]
节点混杂度 ![[Pasted image 20240529234513.png]] [[Entropy]] ![[Pasted image 20240529234610.png]]
根据信息的增益来进行选择 ![[Pasted image 20240529235712.png]]
![[Pasted image 20240529235725.png]] 情形3并不好,可能未完全分裂 ![[Pasted image 20240529235825.png]]
![[Pasted image 20240530000007.png]]
![[Pasted image 20240530000055.png]]
![[Pasted image 20240530000244.png]] ![[Pasted image 20240530000329.png]]
- 基于样本数 ![[Pasted image 20240530000420.png]]
- 基于信息增益的阈值 ![[Pasted image 20240530000444.png]]
- 错误降低剪枝
![[Pasted image 20240530000536.png]]
在验证集上能提升则剪枝,提升泛化性
- 剪枝后新的叶节点标签赋值策略 ![[Pasted image 20240530000643.png]]
- 规则后剪枝 ![[Pasted image 20240530000830.png]]![[Pasted image 20240530000913.png]]
![[Pasted image 20240530001726.png]]
![[Pasted image 20240530001759.png]]
![[Pasted image 20240530001920.png]]
![[Pasted image 20240530002004.png]]
(from 人智导)
![[Pasted image 20240619155902.png]] ![[Pasted image 20240619155910.png]]
![[Pasted image 20240619160136.png]] ![[Pasted image 20240619160146.png]]
![[Pasted image 20240619160240.png]]
![[Pasted image 20240619160316.png]]
![[Pasted image 20240619160332.png]]![[Pasted image 20240619160348.png]]
![[Pasted image 20240619160620.png]]