Getting Started with Kudu: Perform Fast Analytics on Fast Data

Getting Started with Kudu: Perform Fast Analytics on Fast Data

English | July 11th, 2018 | ISBN: 1491980257 | 288 Pages | EPUB (True/HQ) | 4.36 MB

Fast data ingestion, serving, and analytics in the Hadoop ecosystem have forced developers and architects to choose solutions using the least common denominator-either fast analytics at the cost of slow data ingestion or fast data ingestion at the cost of slow analytics. There is an answer to this problem. With the Apache Kudu column-oriented data store, you can easily perform fast analytics on fast data. This practical guide shows you how.

Begun as an internal project at Cloudera, Kudu is an open source solution compatible with many data processing frameworks in the Hadoop environment. In this book, current and former solutions professionals from Cloudera provide use cases, examples, best practices, and sample code to help you get up to speed with Kudu.

Explore Kudu's high-level design, including how it spreads data across servers
Fully administer a Kudu cluster, enable security, and add or remove nodes
Learn Kudu's client-side APIs, including how to integrate Apache Impala, Spark, and other frameworks for data manipulation
Examine Kudu's schema design, including basic concepts and primitives necessary to make your project successful
Explore case studies for using Kudu for real-time IoT analytics, predictive modeling, and in combination with another storage engine


[Fast Download] Getting Started with Kudu: Perform Fast Analytics on Fast Data

Ebooks related to "Getting Started with Kudu: Perform Fast Analytics on Fast Data" :
Scalability Patterns: Best Practices for Designing High Volume Websites
Visualizing Streaming Data: Interactive Analysis Beyond Static Limits
A Common-Sense Guide to Data Structures and Algorithms: Level Up Your Core Programming Skills
DevOps, DBAs, and DBaaS: Managing Data Platforms to Support Continuous Integration
The No-nonsense Guide to Born-digital Content
Fundamentals of Database Management Systems, 2nd Edition
Hadoop MapReduce v2 Cookbook Second Edition
Regression Analysis with Python
Apache Flume: Distributed Log Collection for Hadoop - Second Edition
SQL Queries for Mere Mortals: A Hands-On Guide to Data Manipulation in SQL, 3rd Edition
Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.