NY R Meetup

Statistician

Data Scientist

Model Visualization

The Data

Correlation

Heatmap

Correlation

Linear Model

coefplot

Two Models

Two Models

Elastic Net

Elastic Net

s0 s1 s2 s3 s4
Acres1-10 0 0 0 0 0
Acres10+ 0 0 0 0 0
AcresSub 1 0 0 0 0 0
FamilyTypeFemale Head 0 0 0 0 0
FamilyTypeMale Head 0 0 0 0 0
FamilyTypeMarried 0 0 0 0 0
NumBedrooms 0 0 0 0 0

Coefficient Path

Decision Tree

n= 22745 

node), split, n, loss, yval, (yprob)
      * denotes terminal node

 1) root 22745 4565 FALSE (0.7992965 0.2007035)  
   2) HouseCosts< 1825 16105 1832 FALSE (0.8862465 0.1137535) *
   3) HouseCosts>=1825 6640 2733 FALSE (0.5884036 0.4115964)  
     6) HouseCosts< 3160 4931 1687 FALSE (0.6578787 0.3421213) *
     7) HouseCosts>=3160 1709  663 TRUE (0.3879462 0.6120538)  
      14) FamilyType=Female Head,Male Head 169   46 FALSE (0.7278107 0.2721893) *
      15) FamilyType=Married 1540  540 TRUE (0.3506494 0.6493506) *

Draw the Tree

Random Forest

K-means

Multidimensional Scaling

Multidimensional Scaling

Multidimensional Scaling

Hierarchical Clustering

Now What?

Learn More

Learn More

Learn More

Learn More

Jared P. Lander

The Tools