Apache Flume: Distributed Log Collection for Hadoop - Second Edition

Apache Flume: Distributed Log Collection for Hadoop - Second Edition

English | Feb 25, 2015 | ISBN: 1784392170 | 175 Pages | PDF (Converted) | 1.82 MB

Design and implement a series of Flume agents to send streamed data into Hadoop. Apache Flume is a distributed, reliable, and available service used to efficiently collect, aggregate, and move large amounts of log data. It is used to stream logs from application servers to HDFS for ad hoc analysis.

About This Book

Construct a series of Flume agents using the Apache Flume service to efficiently collect, aggregate, and move large amounts of event data
Configure failover paths and load balancing to remove single points of failure
Use this step-by-step guide to stream logs from application servers to Hadoop's HDFS

Who This Book Is For

If you are a Hadoop programmer who wants to learn about Flume to be able to move datasets into Hadoop in a timely and replicable manner, then this book is ideal for you. No prior knowledge about Apache Flume is necessary, but a basic knowledge of Hadoop and the Hadoop File System (HDFS) is assumed.

In Detail

This book starts with an architectural overview of Flume and its logical components. It explores channels, sinks, and sink processors, followed by sources and channels. By the end of this book, you will be fully equipped to construct a series of Flume agents to dynamically transport your stream data and logs from your systems into Hadoop.

A step-by-step book that guides you through the architecture and components of Flume covering different approaches, which are then pulled together as a real-world, end-to-end use case, gradually going from the simplest to the most advanced features.

Download:

http://longfiles.com/m1uabjwgbc3j/Apache_Flume_Distributed_Log_Collection_for_Hadoop_-_Second_Edition.pdf.html

[Fast Download] Apache Flume: Distributed Log Collection for Hadoop - Second Edition


Ebooks related to "Apache Flume: Distributed Log Collection for Hadoop - Second Edition" :
SQL Server 2005 T-SQL Recipes
HBase: The Definitive Guide
Database Systems for Advanced Applications
High Performance MySQL
Getting Started with Couchbase Server
Getting Started with SQL: A Hands-On Approach for Beginners
New Data: a Field Guide: Intro to No and New SQL
Pro MongoDB Development
Frequent Pattern Mining
Microsoft SQL Server 2005 For Dummies by Andrew Watt
Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.