Classification and Regression - RDD-based API
The spark.mllib package supports various methods for
binary classification,
multiclass
classification, and
regression analysis. The table below outlines
the supported algorithms for each type of problem.
| Problem Type | Supported Methods |
|---|---|
| Binary Classification | linear SVMs, logistic regression, decision trees, random forests, gradient-boosted trees, naive Bayes |
| Multiclass Classification | logistic regression, decision trees, random forests, naive Bayes |
| Regression | linear least squares, Lasso, ridge regression, decision trees, random forests, gradient-boosted trees, isotonic regression |
More details for these methods can be found here:
