Sign In Start Free Trial
Account

Add to playlist

Create a Playlist

Modal Close icon
You need to login to use this feature.
  • Book Overview & Buying Developing Kaggle Notebooks
  • Table Of Contents Toc
  • Feedback & Rating feedback
Developing Kaggle Notebooks

Developing Kaggle Notebooks

By : Gabriel Preda
5 (29)
close
close
Developing Kaggle Notebooks

Developing Kaggle Notebooks

5 (29)
By: Gabriel Preda

Overview of this book

Developing Kaggle Notebooks introduces you to data analysis, with a focus on using Kaggle Notebooks to simultaneously achieve mastery in this fi eld and rise to the top of the Kaggle Notebooks tier. The book is structured as a sevenstep data analysis journey, exploring the features available in Kaggle Notebooks alongside various data analysis techniques. For each topic, we provide one or more notebooks, developing reusable analysis components through Kaggle's Utility Scripts feature, introduced progressively, initially as part of a notebook, and later extracted for use across future notebooks to enhance code reusability on Kaggle. It aims to make the notebooks' code more structured, easy to maintain, and readable. Although the focus of this book is on data analytics, some examples will guide you in preparing a complete machine learning pipeline using Kaggle Notebooks. Starting from initial data ingestion and data quality assessment, you'll move on to preliminary data analysis, advanced data exploration, feature qualifi cation to build a model baseline, and feature engineering. You'll also delve into hyperparameter tuning to iteratively refi ne your model and prepare for submission in Kaggle competitions. Additionally, the book touches on developing notebooks that leverage the power of generative AI using Kaggle Models.
Table of Contents (14 chapters)
close
close
12
Other Books You May Enjoy
13
Index

Join our book community on Discord

https://packt.link/EarlyAccessCommunity

Qr code Description automatically generated

We continue our journey around the world of data by exploring two datasets with geographical distributed data. The first dataset is Every Pub in England. This dataset contains the unique id, name, address, postcode and geographical position data for every pub in England. The second dataset is Starbucks Locations Worldwide. This dataset contains the store number, name, ownership, as well as street address, city and geographical information (latitude and longitude) for all Starbucks stores in the world. We will combine these two datasets and will add also additional geographical support data. We will learn how to work with missing data and perform data imputation if needed, how to visualize geographical data, how to clip and merge polygon data, how to generate custom maps and create multiple layers over maps.

Unlock full access

Continue reading for free

A Packt free trial gives you instant online access to our library of over 7000 practical eBooks and videos, constantly updated with the latest in tech
bookmark search playlist download font-size

Change the font size

margin-width

Change margin width

day-mode

Change background colour

Close icon Search
Country selected

Close icon Your notes and bookmarks

Delete Bookmark

Modal Close icon
Are you sure you want to delete it?
Cancel
Yes, Delete

Confirmation

Modal Close icon
claim successful

Buy this book with your credits?

Modal Close icon
Are you sure you want to buy this book with one of your credits?
Close
YES, BUY