Materials for STATS 418 - Tools in Data Science course taught in the Master of Applied Statistics at UCLA
updated at July 17, 2024, 5:55 p.m.
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
updated at Sept. 13, 2024, 3:54 a.m.