Sign In Start Free Trial

Book Overview & Buying
Table Of Contents
Feedback & Rating

Scala for Machine Learning

By : R. Nicolas

3.8 (12)

Scala for Machine Learning

3.8 (12)

By: R. Nicolas

Overview of this book

Are you curious about AI? All you need is a good understanding of the Scala programming language, a basic knowledge of statistics, a keen interest in Big Data processing, and this book!

Preface

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Free Chapter

1. Getting Started

1. Getting Started

Mathematical notation for the curious

Why machine learning?

Why Scala?

Model categorization

Taxonomy of machine learning algorithms

Tools and frameworks

Source code

Let's kick the tires

Summary

2. Hello World!

2. Hello World!

Modeling

Designing a workflow

Assessing a model

Summary

3. Data Preprocessing

3. Data Preprocessing

Time series

Moving averages

Fourier analysis

The Kalman filter

Alternative preprocessing techniques

Summary

4. Unsupervised Learning

4. Unsupervised Learning

Clustering

Dimension reduction

Performance considerations

Summary

5. Naïve Bayes Classifiers

5. Naïve Bayes Classifiers

Probabilistic graphical models

Naïve Bayes classifiers

Multivariate Bernoulli classification

Naïve Bayes and text mining

Pros and cons

Summary

6. Regression and Regularization

6. Regression and Regularization

Linear regression

Regularization

Numerical optimization

The logistic regression

Summary

7. Sequential Data Models

7. Sequential Data Models

Markov decision processes

The hidden Markov model (HMM)

Conditional random fields

CRF and text analytics

Comparing CRF and HMM

Performance consideration

Summary

8. Kernel Models and Support Vector Machines

8. Kernel Models and Support Vector Machines

Kernel functions

The support vector machine (SVM)

Support vector classifier (SVC)

Anomaly detection with one-class SVC

Support vector regression (SVR)

Performance considerations

Summary

9. Artificial Neural Networks

9. Artificial Neural Networks

Feed-forward neural networks (FFNN)

The multilayer perceptron (MLP)

Evaluation

Benefits and limitations

Summary

10. Genetic Algorithms

10. Genetic Algorithms

Evolution

Genetic algorithms and machine learning

Genetic algorithm components

Implementation

GA for trading strategies

Advantages and risks of genetic algorithms

Summary

11. Reinforcement Learning

11. Reinforcement Learning

Introduction

Learning classifier systems

Summary

12. Scalable Frameworks

12. Scalable Frameworks

Overview

Scala

Scalability with Actors

Akka

Apache Spark

Summary

A. Basic Concepts

A. Basic Concepts

Scala programming

Mathematics

Finances 101

Suggested online courses

References

Index

Index

Customer Reviews

3.8 (12)

5 star

41.7%

4 star

16.7%

3 star

25%

2 star

8.3%

1 star

8.3%

Regularization

The ordinary least squares method for finding the regression parameters is a specific case of the maximum likelihood. Therefore, regression models are subject to the same challenge in terms of overfitting as any other discriminative model. You are already aware that regularization is used to reduce model complexity and avoid overfitting as stated in the Overfitting section of Chapter 2, Hello World!.

L_n roughness penalty

Regularization consists of adding a penalty function J(w) to the loss function (or RSS in the case of a regressive classifier) in order to prevent the model parameters (or weights) from reaching high values. A model that fits a training set very well tends to have many features variable with relatively large weights. This process is known as shrinkage. Practically, shrinkage involves adding a function with model parameters as an argument to the loss function:

Ln roughness penalty

The penalty function is completely independent from the training set {x,y}. The penalty term is usually...

Search

Your notes and bookmarks