Stars
56
Forks
44
Watchers
56
Open Issues
1
Overall repository health assessment
No package.json found
This might not be a Node.js project
22
commits
Added automated feature generation, removing highly correlated features, and plotting ROC curve
609625bView on GitHubmoved hyperparam optimization reporting to another file. Extensive tweaks to randomforest and started over in randomforest2.py
7b51e0bView on GitHubImplement PCA and KMeans clustering as data preparation step before training model. Acheived submission improvement to .794
bda4932View on GitHubRenamed linearsvm.py to svc.py to indicate that this will contain SVM implementations with kernels rather than linear, which is covered by the SGDClassifier code in sgdclassifier.py file
edc9bf3View on GitHubLoss function modified_huber is a tweak of hinge which has been best so far, so I tried that. Slight decrease in accuracy
45a8e97View on GitHubPerceptron loss function didn't work at all, it couldn't even overfit successfully. Not submitting
b194ed1View on GitHubSGDClassifier optimized as logistic regression. Updated to run multicore for faster hyperparameter optimization and learning curve generation at higher iterations
86d63c1View on GitHubsmall updates to parameters for final tweak and submission
1ef5109View on GitHubUpdated SGDClassifier implementation and tweaked params for the 'hinge' loss function. Minor updates to naivebayes and random forests code
fefe9beView on GitHubLots and lots of changes. Refactor data prep into separate file. Update random forest to acheive new submission record .78+
0362f34View on GitHubexperiments with sgdclassifier model, including a simple top-n voting ensemble. Ensemble hurt but single model improved my top score slightly
d85d6e6View on GitHub