
Data Cleaning and Exploration with Machine Learning
By :

We often have enough domain knowledge to take an approach that is more nuanced than simply minimizing prediction errors in our training data. Using this knowledge may allow us to accept more bias in our model, when small amounts of bias do not matter much substantively, to reduce variance. With SVR, we can adjust hyperparameters such as epsilon (the acceptable error range) and C (which adjusts the tolerance for errors outside of that range) to improve our model’s performance.
If a linear model can perform well on your data, linear SVR might be a good choice. We can build a linear SVR model with scikit-learn’s LinearSVR
class. Let’s try creating a linear SVR model with the gasoline tax data that we used in the previous chapter:
LinearSVR
and uniform
modules...