Table of Contents¶
- Welcome
- Principal Component Analysis (PCA)
- Decision Tree and Ensemble Learning
- Decision Tree
- Titanic Example
- Loading data using BeautifulSoup and urlib3 as the htm client
- Processing data
- Check the importance of each feature
- Information gain analysis
- Pick features and lable to construct the decision tree
- Process categorical variables to numerical
- Split to train and test data
- Build a decision tree
- Tree visualization
- Evaluate the tree
- Ensemble learning to avoid over fitting
- Statistical Analysis for Income Data
- National Water Data: HYDAT
- Simple Implementation of Dijkstra Algorithm
- Summary Statistics for Well Data
- Reshaping in Pandas - Pivot, Pivot-Table, Stack and Unstack