
Mastering Predictive Analytics with R, Second Edition
By :

For you to determine if your data source qualifies as big data or as needing special handling, you can start by examining your data source in the following areas:
Let's examine each of these areas.
If you are talking about the number of rows or records, then most likely your data source is not a big data source since big data is typically measured in gigabytes, terabytes, and petabytes. However, space doesn't always mean big, as these size measurements can vary greatly in terms of both volume and functionality. Additionally, data sources of several million records may qualify as big data, given their structure (or lack of structure).
Data used in predictive models may be structured or unstructured (or both) and include transactions from databases, survey results, website logs, application messages, and so on (by using a data source consisting...