Sign In Start Free Trial
Account

Add to playlist

Create a Playlist

Modal Close icon
You need to login to use this feature.
  • Book Overview & Buying Developing Kaggle Notebooks
  • Table Of Contents Toc
  • Feedback & Rating feedback
Developing Kaggle Notebooks

Developing Kaggle Notebooks

By : Gabriel Preda
5 (29)
close
close
Developing Kaggle Notebooks

Developing Kaggle Notebooks

5 (29)
By: Gabriel Preda

Overview of this book

Developing Kaggle Notebooks introduces you to data analysis, with a focus on using Kaggle Notebooks to simultaneously achieve mastery in this fi eld and rise to the top of the Kaggle Notebooks tier. The book is structured as a sevenstep data analysis journey, exploring the features available in Kaggle Notebooks alongside various data analysis techniques. For each topic, we provide one or more notebooks, developing reusable analysis components through Kaggle's Utility Scripts feature, introduced progressively, initially as part of a notebook, and later extracted for use across future notebooks to enhance code reusability on Kaggle. It aims to make the notebooks' code more structured, easy to maintain, and readable. Although the focus of this book is on data analytics, some examples will guide you in preparing a complete machine learning pipeline using Kaggle Notebooks. Starting from initial data ingestion and data quality assessment, you'll move on to preliminary data analysis, advanced data exploration, feature qualifi cation to build a model baseline, and feature engineering. You'll also delve into hyperparameter tuning to iteratively refi ne your model and prepare for submission in Kaggle competitions. Additionally, the book touches on developing notebooks that leverage the power of generative AI using Kaggle Models.
Table of Contents (14 chapters)
close
close
12
Other Books You May Enjoy
13
Index

To get the most out of this book

You should have a basic understanding of Python and familiarity with Jupyter Notebooks. Ideally, you will also need some basic knowledge of libraries like pandas and NumPy.

The chapters contain both theory and code. If you want to run the code in the book, the easiest way is to follow the links on the README.md introduction page in the GitHub project for each notebook, fork the notebook, and run it on Kaggle. The Kaggle environment is pre-installed with all the needed Python libraries. Alternatively, you can download the notebooks from the GitHub project, upload them on Kaggle, attach the dataset resources mentioned in the book for each specific example, and run them. Another alternative is to download the datasets on Kaggle, install your own local environment, and run the notebooks there. In this case, however, you will need more advanced knowledge about how to set up a conda environment locally and install Python libraries using pip install or conda install.

Requirements for the chapter exercises

Version no.

Python

3.9 or higher

All exercises developed on the Kaggle platform use the current Python version, which is 3.10 at the time of writing this book.

Download the example code files

The code bundle for the book is hosted on GitHub at https://github.com/PacktPublishing/Developing-Kaggle-Notebooks. We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Download the color images

We also provide a PDF file that has color images of the screenshots/diagrams used in this book. You can download it here: https://packt.link/gbp/9781805128519.

Conventions used

There are a number of text conventions used throughout this book.

CodeInText: Indicates code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. For example: “Run the info() function for each dataset.”

A block of code is set as follows:

for sentence in selected_text["comment_text"].head(5):
    print("\n")
    doc = nlp(sentence)
    for ent in doc.ents:
        print(ent.text, ent.start_char, ent.end_char, ent.label_)
    displacy.render(doc, style="ent",jupyter=True)

Any command-line input or output is written as follows:

!pip install kaggle

Bold: Indicates a new term, an important word, or words that you see on the screen. For instance, words in menus or dialog boxes appear in the text like this. For example: “You will have to start a notebook and then choose the Set as Utility Script menu item from the File menu.”

Warnings or important notes appear like this.

Tips and tricks appear like this.

Unlock full access

Continue reading for free

A Packt free trial gives you instant online access to our library of over 7000 practical eBooks and videos, constantly updated with the latest in tech
bookmark search playlist download font-size

Change the font size

margin-width

Change margin width

day-mode

Change background colour

Close icon Search
Country selected

Close icon Your notes and bookmarks

Delete Bookmark

Modal Close icon
Are you sure you want to delete it?
Cancel
Yes, Delete

Confirmation

Modal Close icon
claim successful

Buy this book with your credits?

Modal Close icon
Are you sure you want to buy this book with one of your credits?
Close
YES, BUY