Monitoring with Ganglia: Tracking Dynamic Host and Application Metrics at Scale Contributor(s): Massie, Matt (Author), Li, Bernard (Author), Nicholes, Brad (Author) |
|
ISBN: 1449329705 ISBN-13: 9781449329709 Publisher: O'Reilly Media OUR PRICE: $28.49 Product Type: Paperback - Other Formats Published: December 2012 |
Additional Information |
BISAC Categories: - Computers | System Administration - General - Computers | Internet - General - Computers | Software Development & Engineering - Quality Assurance & Testing |
Dewey: 006.312 |
Physical Information: 0.58" H x 7.02" W x 9.06" (0.91 lbs) 254 pages |
Descriptions, Reviews, Etc. |
Publisher Description: Written by Ganglia designers and maintainers, this book shows you how to collect and visualize metrics from clusters, grids, and cloud infrastructures at any scale. Want to track CPU utilization from 50,000 hosts every ten seconds? Ganglia is just the tool you need, once you know how its main components work together. This hands-on book helps experienced system administrators take advantage of Ganglia 3.x. Learn how to extend the base set of metrics you collect, fetch current values, see aggregate views of metrics, and observe time-series trends in your data. You'll also examine real-world case studies of Ganglia installs that feature challenging monitoring requirements.
Contributors include: Robert Alexander, Jeff Buchbinder, Frederiko Costa, Alex Dean, Dave Josephsen, Peter Phaal, and Daniel Pocock. Case study writers include: John Allspaw, Ramon Bastiaans, Adam Compton, Andrew Dibble, and Jonah Horowitz. |
Contributor Bio(s): Li: - Bernard Li is a High Performance Computing (HPC) Systems Engineer at Lawrence Berkeley National Laboratory. He is currently one of the maintainers of the Ganglia project. He has been involved with HPC since 2003 and has worked on Open Source projects such as OSCAR, SystemImager and Warewulf.Massie: - Matt Massie open-sourced Ganglia in 2000 while working as a Staff Researcher at the University of California, Berkeley. He designed ganglia to monitor a shared computational grid of clusters distributed across the United States for scientific research. In 2010, he contributed a chapter on cluster monitoring for the O'Reilly book "Web Operations: Keeping the Data On Time" by John Allspaw and Jesse Robbins. Matt is currently a software engineer at Cloudera focused on Apache Hadoop enterprise management and monitoring.Nicholes, Brad: - Brad Nicholes is a member of the Apache Software Foundation and is currently working as a Consultant Software Engineer for NetIQ. In addition to being a committer on the Apache HTTPD and APR projects, Brad is also a developer as well as one of the administrators of the Ganglia project. As a developer on the Ganglia project, Brad developed and introduced the C/C++ and Python metric module interface into Gangla 3.1.x. He also developed and contributed several of the initial metric modules that currently ship with Ganglia. Brad attended school at the University of Utah and Brigham Young University and holds a degree in Computer Science. Massie, Matt: -Matt Massie open-sourced Ganglia in 2000 while working as a Staff Researcher at the University of California, Berkeley. He designed ganglia to monitor a shared computational grid of clusters distributed across the United States for scientific research. In 2010, he contributed a chapter on cluster monitoring for the O'Reilly book "Web Operations: Keeping the Data On Time" by John Allspaw and Jesse Robbins. Matt is currently a software engineer at Cloudera focused on Apache Hadoop enterprise management and monitoring. Li, Bernard: -Bernard Li is a High Performance Computing (HPC) Systems Engineer at Lawrence Berkeley National Laboratory. He is currently one of the maintainers of the Ganglia project. He has been involved with HPC since 2003 and has worked on Open Source projects such as OSCAR, SystemImager and Warewulf. Vuksan, Vladimir: -Vladimir Vuksan (Broadcom) has worked in technical operations, systems engineering and software development for over 15 years. Prior to Broadcom he has worked at Mocospace, Rave Mobile Safety, Demandware, University of New Mexico implementing high availability solutions and building tools to make managing and running infrastructure easier. |