Appendix | Deep Learning for Natural Language Processing

Sign In Start Free Trial

Book Overview & Buying
Table Of Contents
Feedback & Rating

Deep Learning for Natural Language Processing

By : Karthiek Reddy Bokka, Shubhangi Hora , Tanuj Jain, Monicah Wambugu

1.5 (2)

Deep Learning for Natural Language Processing

1.5 (2)

By: Karthiek Reddy Bokka, Shubhangi Hora , Tanuj Jain, Monicah Wambugu

Overview of this book

Applying deep learning approaches to various NLP tasks can take your computational algorithms to a completely new level in terms of speed and accuracy. Deep Learning for Natural Language Processing starts by highlighting the basic building blocks of the natural language processing domain. The book goes on to introduce the problems that you can solve using state-of-the-art neural network models. After this, delving into the various neural network architectures and their specific areas of application will help you to understand how to select the best model to suit your needs. As you advance through this deep learning book, you’ll study convolutional, recurrent, and recursive neural networks, in addition to covering long short-term memory networks (LSTM). Understanding these networks will help you to implement their models using Keras. In later chapters, you will be able to develop a trigger word detection application using NLP techniques such as attention model and beam search. By the end of this book, you will not only have sound knowledge of natural language processing, but also be able to select the best text preprocessing and neural network models to solve a number of NLP issues.

About the Book

About the Book

About the Authors

Description

Learning Objectives

Audience

Approach

Hardware Requirements

Software Requirements

Conventions

Installation and Setup

Install Python on Windows

Install Python on Linux

Install Python on macOS X

Installing Keras

Additional Resources

Free Chapter

Introduction to Natural Language Processing

Introduction to Natural Language Processing

Introduction

The Basics of Natural Language Processing

Capabilities of Natural language processing

Applications of Natural Language Processing

Word Embeddings

Summary

Applications of Natural Language Processing

Applications of Natural Language Processing

Introduction

POS Tagging

Applications of Parts of Speech Tagging

Chunking

Chinking

Named Entity Recognition

Summary

Introduction to Neural Networks

Introduction to Neural Networks

Introduction

Neural Networks

Training a Neural Network

Designing a Neural Network and Its Applications

Fundamentals of Deploying a Model as a Service

Summary

Foundations of Convolutional Neural Network

Foundations of Convolutional Neural Network

Introduction

Understanding the Architecture of a CNN

Training a CNN

Application Areas of CNNs

Summary

Recurrent Neural Networks

Recurrent Neural Networks

Introduction

Previous Versions of Neural Networks

RNNs

Updates and Gradient Flow

Gradients

Summary

Gated Recurrent Units (GRUs)

Gated Recurrent Units (GRUs)

Introduction

The Drawback of Simple RNNs

Gated Recurrent Units (GRUs)

Sentiment Analysis with GRU

Summary

Long Short-Term Memory (LSTM)

Long Short-Term Memory (LSTM)

Introduction

The Input Gate and the Candidate Cell State

Output Gate and Current Activation

Neural Language Translation

Summary

State-of-the-Art Natural Language Processing

State-of-the-Art Natural Language Processing

Introduction

Other Architectures and Developments

Activity 11: Build a Text Summarization Model

Summary

A Practical NLP Project Workflow in an Organization

A Practical NLP Project Workflow in an Organization

Introduction

Problem Definition

Data Acquisition

Google Colab

Flask

Deployment

Summary

Appendix

Appendix

Chapter 1: Introduction to Natural Language Processing

Chapter 2: Applications of Natural Language Processing

Chapter 3: Introduction to Neural Networks

Chapter 4: Introduction to convolutional networks

Chapter 5: Foundations of Recurrent Neural Network

Chapter 6: Foundations of GRUs

Chapter 7: Foundations of LSTM

Chapter 8: State of the art in Natural Language Processing

Chapter 9: A practical NLP project workflow in an organisation

Customer Reviews

1.5 (2)

5 star

0

4 star

0

3 star

0

2 star

50%

1 star

50%

Chapter 9: A practical NLP project workflow in an organisation

Code for LSTM model

Check if GPU is detected
import tensorflow as tf
tf.test.gpu_device_name()
Setting up collar notebook
from google.colab import drive
drive.mount('/content/gdrive')
# Run the below command in a new cell
cd /content/gdrive/My Drive/Lesson-9/
# Run the below command in a new cell
!unzip data.csv.zip
Import necessary Python packages and classes.
import os
import re
import pickle
import pandas as pd
from keras.preprocessing.text import Tokenizer
from keras.preprocessing.sequence import pad_sequences
from keras.models import Sequential
from keras.layers import Dense, Embedding, LSTM
Load the data file.
def preprocess_data(data_file_path):
data = pd.read_csv(data_file_path, header=None) # read the csv
data.columns = ['rating', 'title', 'review'] # add column names
data['review'] = data['review'].apply(lambda x: x.lower()) # change all text to lower
data['review'] = data['review'].apply((lambda x: re.sub('[^a-zA-z0-9\s]','',x))) # remove all numbers
return data
df = preprocess_data('data.csv')
Initialize tokenization.
max_features = 2000
maxlength = 250
tokenizer = Tokenizer(num_words=max_features, split=' ')
Fit tokenizer.
tokenizer.fit_on_texts(df['review'].values)
X = tokenizer.texts_to_sequences(df['review'].values)
Pad sequences.
X = pad_sequences(X, maxlen=maxlength)
Get target variable
y_train = pd.get_dummies(df.rating).values
embed_dim = 128
hidden_units = 100
n_classes = 5
model = Sequential()
model.add(Embedding(max_features, embed_dim, input_length = X.shape[1]))
model.add(LSTM(hidden_units))
model.add(Dense(n_classes, activation='softmax'))
model.compile(loss = 'categorical_crossentropy', optimizer='adam',metrics = ['accuracy'])
print(model.summary())
Fit the model.
model.fit(X[:100000, :], y_train[:100000, :], batch_size = 128, epochs=15, validation_split=0.2)
Save model and tokenizer.
model.save('trained_model.h5') # creates a HDF5 file 'trained_model.h5'
with open('trained_tokenizer.pkl', 'wb') as f: # creates a pickle file 'trained_tokenizer.pkl'
pickle.dump(tokenizer, f)
from google.colab import files
files.download('trained_model.h5')
files.download('trained_tokenizer.pkl')

Code for Flask

Import the necessary Python packages and classes.
import re
import pickle
import numpy as np
from flask import Flask, request, jsonify
from keras.models import load_model
from keras.preprocessing.sequence import pad_sequences
Define the input files and load in dataframe
def load_variables():
global model, tokenizer
model = load_model('trained_model.h5')
model._make_predict_function() # https://github.com/keras-team/keras/issues/6462
with open('trained_tokenizer.pkl', 'rb') as f:
tokenizer = pickle.load(f)
Define preprocessing functions similar to the training code:
def do_preprocessing(reviews):
processed_reviews = []
for review in reviews:
review = review.lower()
processed_reviews.append(re.sub('[^a-zA-z0-9\s]', '', review))
processed_reviews = tokenizer.texts_to_sequences(np.array(processed_reviews))
processed_reviews = pad_sequences(processed_reviews, maxlen=250)
return processed_reviews
Define a Flask app instance:
app = Flask(__name__)
Define an endpoint that displays a fixed message:
@app.route('/')
def home_routine():
return 'Hello World!'
We'll have a prediction endpoint, to which we can send our review strings. The kind of HTTP request we will use is a 'POST' request:
@app.route('/prediction', methods=['POST'])
def get_prediction():
# get incoming text
# run the model
if request.method == 'POST':
data = request.get_json()
data = do_preprocessing(data)
predicted_sentiment_prob = model.predict(data)
predicted_sentiment = np.argmax(predicted_sentiment_prob, axis=-1)
return str(predicted_sentiment)
Start the web server.
if __name__ == '__main__':
# load model
load_variables()
app.run(debug=True)
Save this file as app.py (any name could be used). Run this code from the terminal using app.py:
python app.py
The output is as follows:

Figure 9.31: Output for flask

Figure 9.31: Output for flask

Search

Your notes and bookmarks