

Hari Shreedharan is a PMC Member and Committer on the Apache Flume Project. As a PMC member, he is involved in making decisions on the direction of the project.
Meer over Hari ShreedharanUsing Flume
Flexible, Scalable, and Reliable Data Streaming
Samenvatting
How can you get your data from frontend servers to Hadoop in near real time? With this complete reference guide, you’ll learn Flume’s rich set of features for collecting, aggregating, and writing large amounts of streaming data to the Hadoop Distributed File System (HDFS), Apache HBase, SolrCloud, Elastic Search, and other systems.
'Using Flume' shows operations engineers how to configure, deploy, and monitor a Flume cluster, and teaches developers how to write Flume plugins and custom components for their specific use-cases. You’ll learn about Flume’s design and implementation, as well as various features that make it highly scalable, flexible, and reliable.
- Learn how Flume provides a steady rate of flow by acting as a buffer between data producers and consumers
- Dive into key Flume components, including sources that accept data and sinks that write and deliver it
- Write custom plugins to customize the way Flume receives, modifies, formats, and writes data
- Explore APIs for sending data to Flume agents from your own applications
- Plan and deploy Flume in a scalable and flexible way—and monitor your cluster once it’s running
Specificaties
Inhoudsopgave
Preface
1. Apache Hadoop and Apache HBase: An Introduction
-HDFS
-Apache HBase
-Summary
-References
2. Streaming Data Using Apache Flume
-The Need for Flume
-Is Flume a Good Fit?
-Inside a Flume Agent
-Configuring Flume Agents
-Getting Flume Agents to Talk to Each Other
-Complex Flows
-Replicating Data to Various Destinations
-Dynamic Routing
-Flume’s No Data Loss Guarantee, Channels, and Transactions
-Agent Failure and Data Loss
-The Importance of Batching
-What About Duplicates?
-Running a Flume Agent
-Summary
-References
3. Sources
-Lifecycle of a Source
-Sink-to-Source Communication
-HTTP Source
-Spooling Directory Source
-Syslog Sources
-Exec Source
-JMS Source
-Writing Your Own Sources*
-Summary
-References
4. Channels
-Transaction Workflow
-Channels Bundled with Flume
-Summary
-References
5. Sinks
-Lifecycle of a Sink
-Optimizing the Performance of Sinks
-Writing to HDFS: The HDFS Sink
-HBase Sinks
-RPC Sinks
-Morphline Solr Sink
-Elastic Search Sink
-Other Sinks: Null Sink, Rolling File Sink, Logger Sink
-Writing Your Own Sink*
-Summary
-References
6. Interceptors, Channel Selectors, Sink Groups, and Sink Processors
-Interceptors
-Channel Selectors
-Sink Groups and Sink Processors
-Summary
-References
7. Getting Data into Flume*
-Building Flume Events
-Flume Client SDK
-Embedded Agent
-log4j Appenders
-Summary
-References
8. Planning, Deploying, and Monitoring Flume
-Planning a Flume Deployment
-Deploying Flume
-Monitoring Flume
-Summary
Index
Anderen die dit boek kochten, kochten ook
Net verschenen
Rubrieken
- aanbestedingsrecht
- aansprakelijkheids- en verzekeringsrecht
- accountancy
- algemeen juridisch
- arbeidsrecht
- bank- en effectenrecht
- bestuursrecht
- bouwrecht
- burgerlijk recht en procesrecht
- europees-internationaal recht
- fiscaal recht
- gezondheidsrecht
- insolventierecht
- intellectuele eigendom en ict-recht
- management
- mens en maatschappij
- milieu- en omgevingsrecht
- notarieel recht
- ondernemingsrecht
- pensioenrecht
- personen- en familierecht
- sociale zekerheidsrecht
- staatsrecht
- strafrecht en criminologie
- vastgoed- en huurrecht
- vreemdelingenrecht