Linear Regression

The dataset that we'll use for this recipe, for lack of creativity, is lifted from the UCI website at http://archive.ics.uci.edu/ml/machine-learning-databases/wine-quality/. This dataset has 1599 instances of various red wines, their chemical composition, and their quality. We'll use it to predict the quality of a red wine.

  1. Creating LabeledPoint
  2. Preparing the training and test data.
  3. Scaling the features.
  4. Training the model.
  5. Predicting against the test data.
  6. Evaluating the model.
  7. Regularizing the parameters.
  8. Mini batching.