Given a year of EHR data for patients without Diabetes, we predict which patients will be diagnosed with Diabetes in the next year.
Data Scientist A
This step is meant to evaluate the model from a business perspective and seeks to determine if there is some business reason why this model is deficient.
This step is a quality assurance step insuring we used all the the appropriate items in the models.
Team decides whether to finish the project and move on to deployment, initiate further iterations, or set up new data science projects.
List of possible actions
Decisions
===================================================================================================
Data Scientist B
Assessment of Data Science results
Approved models
Below is our list of approved models:
*Graident Boosting Machine*
GBM Model | CV Error | Folds | Trees | Depth | Shrinkage | Bag Frac | Node Size | |
---|---|---|---|---|---|---|---|---|
gbm 10_5 0.003_0.80_30 | 0.31210 | 10 | 7,550 | 5 | 0.003 | 0.80 | 30 | 0.001 |
gbm10_5_0.003_0.80_30_tolhalf_ext | 0.31153 | 10 | 8,500 | 5 | 0.003 | 0.80 | 30 | 0.0005 |
gbm10_5_0.0025_0.80_30 | 0.31319 | 10 | 8,000 | 5 | 0.003 | 0.80 | 30 | 0.001 |
gbm10_5_0.0025_0.80_30_ext | 0.31312 | 10 | 7,750 | 5 | 0.0025 | 0.80 | 30 | 0.001 |
gbm10_5_0.0025_0.80_30_tolhalf | 0.31122 | 10 | 11,000 | 5 | 0.0025 | 0.80 | 30 | 0.0005 |
gbm10_5_0.0025_0.80_30_tolhalf_ext | 0.31139 | 10 | 10,450 | 5 | 0.0025 | 0.80 | 30 | 0.0005 |
gbm20_5_0.002_0.80_10 | 0.31049 | 20 | 13,450 | 5 | 0.002 | 0.80 | 10 | 0.001 |
gbm20_5_0.002_0.80_15 | 0.31040 | 20 | 12,950 | 5 | 0.002 | 0.80 | 15 | 0.001 |
gbm20_5_0.0025_0.80_20 | 0.30878 | 20 | 12,300 | 5 | 0.0025 | 0.80 | 20 | 0.001 |
gbm20_5_0.0025_0.80_40 | 0.31040 | 20 | 10,500 | 5 | 0.0025 | 0.80 | 40 | 0.001 |
gbm20_6_0.002_0.80_30 | 0.30931 | 20 | 12,200 | 6 | 0.002 | 0.80 | 30 | 0.001 |
*Random Forest*
RF Model | OOB MSE | Trees | Node Size |
---|---|---|---|
RF1 | 0.10055 | 15,000 | 5 |
RF5 | 0.10076 | 30,000 | 15 |
RF2 | 0.10089 | 15,000 | 20 |
RF3 | 0.10102 | 15,000 | 40 |
List of possible actions
Decisions