Veracity of Big Data

Veracity of Big Data

Examine the problem of maintaining the quality of big data and discover novel solutions. You will learn the four V’s of big data, including veracity, and study the problem from various angles. The solutions discussed are drawn from diverse areas of engineering and math, including machine learning, statistics,…

Complete Guide to Open Source Big Data Stack

Complete Guide to Open Source Big Data Stack

See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together. In the Complete Guide to Open Source Big Data Stack, the author begins by creating a private…

Processing Big Data with Azure HDInsight

Processing Big Data with Azure HDInsight

Get a jump start on using Azure HDInsight and Hadoop Ecosystem components. As most Hadoop and Big Data projects are written in either Java, Scala, or Python, this book minimizes the effort to learn another language and is written from the perspective of a .NET developer. Hadoop components…

Handbook of Big Data Technologies

Handbook of Big Data Technologies

This handbook offers comprehensive coverage of recent advancements in Big Data technologies and related paradigms.  Chapters are authored by international leading experts in the field, and have been reviewed and revised for maximum reader value. The volume consists of twenty-five chapters organized into four main parts. Part one…

Big Data Analytics with R and Hadoop

Big Data Analytics with R and Hadoop

Big data analytics is the process of examining large amounts of data of a variety of types to uncover hidden patterns, unknown correlations, and other useful information. Such information can provide competitive advantages over rival organizations and result in business benefits, such as more effective marketing and increased…

SQL on Big Data

SQL on Big Data

Learn various commercial and open source products that perform SQL on Big Data platforms. You will understand the architectures of the various SQL engines being used and how the tools work internally in terms of execution, data movement, latency, scalability, performance, and system requirements. This book consolidates in…

Big Data SMACK

Big Data SMACK

This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology―Scala/Spark, Mesos, Akka, Cassandra, and Kafka―in every layer. Big data architecture is becoming a requirement for many different enterprises. So far, however, the focus has largely been on collecting,…