Sign In Start Free Trial
Account

Add to playlist

Create a Playlist

Modal Close icon
You need to login to use this feature.
  • Hands-On Predictive Analytics with Python
  • Toc
  • feedback
Hands-On Predictive Analytics with Python

Hands-On Predictive Analytics with Python

By : Alvaro Fuentes
4.4 (8)
close
Hands-On Predictive Analytics with Python

Hands-On Predictive Analytics with Python

4.4 (8)
By: Alvaro Fuentes

Overview of this book

Predictive analytics is an applied field that employs a variety of quantitative methods using data to make predictions. It involves much more than just throwing data onto a computer to build a model. This book provides practical coverage to help you understand the most important concepts of predictive analytics. Using practical, step-by-step examples, we build predictive analytics solutions while using cutting-edge Python tools and packages. The book's step-by-step approach starts by defining the problem and moves on to identifying relevant data. We will also be performing data preparation, exploring and visualizing relationships, building models, tuning, evaluating, and deploying model. Each stage has relevant practical examples and efficient Python code. You will work with models such as KNN, Random Forests, and neural networks using the most important libraries in Python's data science stack: NumPy, Pandas, Matplotlib, Seaborn, Keras, Dash, and so on. In addition to hands-on code examples, you will find intuitive explanations of the inner workings of the main techniques and algorithms used in predictive analytics. By the end of this book, you will be all set to build high-performance predictive analytics solutions using Python programming.
Table of Contents (11 chapters)
close

Bivariate EDA

Now that we understand our features individually, it is time to start exploring whether there are any relationships between them. Bivariate EDA techniques are used to explore pairs of variables, and start understanding how they relate to each other.

How many pair relationships will we have? For a dataset of k features, we will have distinct pairs. In our original dataset we have 10 features, so we will have pairs of variables to analyze. This is a very small dataset, (in terms of the number of features)—but as you can see, the formula is basically a quadratic term, so for a large dataset, the number of pairs goes up quickly.

Of course, you don't actually have to analyze every possible pair; only choose those that are interesting or which will answer a particular question you may have about the dataset. In addition, pandas and Seaborn will make our task...

Unlock full access

Continue reading for free

A Packt free trial gives you instant online access to our library of over 7000 practical eBooks and videos, constantly updated with the latest in tech
bookmark search playlist download font-size

Change the font size

margin-width

Change margin width

day-mode

Change background colour

Close icon Search
Country selected

Close icon Your notes and bookmarks

Delete Bookmark

Modal Close icon
Are you sure you want to delete it?
Cancel
Yes, Delete