Big Data Forensics: Learning Hadoop Investigations

By : Joe Sremack

5 (3)

Buy this Book

Big Data Forensics: Learning Hadoop Investigations

5 (3)

By: Joe Sremack

Buy this Book

Overview of this book

Big Data forensics is an important type of digital investigation that involves the identification, collection, and analysis of large-scale Big Data systems. Hadoop is one of the most popular Big Data solutions, and forensically investigating a Hadoop cluster requires specialized tools and techniques. With the explosion of Big Data, forensic investigators need to be prepared to analyze the petabytes of data stored in Hadoop clusters. Understanding Hadoop’s operational structure and performing forensic analysis with court-accepted tools and best practices will help you conduct a successful investigation. Discover how to perform a complete forensic investigation of large-scale Hadoop clusters using the same tools and techniques employed by forensic experts. This book begins by taking you through the process of forensic investigation and the pitfalls to avoid. It will walk you through Hadoop's internals and architecture, and you will discover what types of information Hadoop stores and how to access that data. You will learn to identify Big Data evidence using techniques to survey a live system and interview witnesses. After setting up your own Hadoop system, you will collect evidence using techniques such as forensic imaging and application-based extractions. You will analyze Hadoop evidence using advanced tools and techniques to uncover events and statistical information. Finally, data visualization and evidence presentation techniques are covered to help you properly communicate your findings to any audience.

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Free Chapter

1. Starting Out with Forensic Investigations and Big Data

An overview of computer forensics

What is Big Data?

Big Data forensics

Summary

2. Understanding Hadoop Internals and Architecture

The Hadoop architecture

Hadoop data analysis tools

Managing files in Hadoop

The Hadoop forensic evidence ecosystem

Running Hadoop

Summary

3. Identifying Big Data Evidence

Identifying evidence

Locating sources of data

The chain of custody documentation

Summary

4. Collecting Hadoop Distributed File System Data

Forensically collecting a cluster system

Physical versus remote collections

HDFS collections through the host operating system

The Hadoop shell command collection

Collection via Sqoop

Other HDFS collection approaches

Summary

5. Collecting Hadoop Application Data

Application collection approaches

Validating application collections

Collecting Hive evidence

Collecting HBase evidence

Collecting other Hadoop application data and non-Hadoop data

Summary

6. Performing Hadoop Distributed File System Analysis

The forensic analysis process

Analysis preparation

Analysis

Summary

7. Analyzing Hadoop Application Data

Preparing the analysis environment

Pre-analysis steps

Analyzing data

Summary

8. Presenting Forensic Findings

Types of reports

Developing the report

Testimony and other presentations

Summary

Index

Customer Reviews

5 (3)

5 star

100%

4 star

3 star

2 star

1 star

Big Data Forensics: Learning Hadoop Investigations

By : Joe Sremack

Big Data Forensics: Learning Hadoop Investigations

By: Joe Sremack

Overview of this book

Collecting HBase evidence

Delete Bookmark