The Metric Cluster Widget allows you to visualize the health level of each member of a cluster - giving you a quick way to check the health of a whole group, or "cluster," of machines.
Select the Cluster Widget:
Choose the Cluster (and set the threshold):
That's it - You get a grid of all the cluster members with color coded health levels.
When you view your Dashboard, you can hover over a cluster member to see the actual values, then click on it to jump to the check and begin your analysis.
The cluster health is calculated by taking 2 hours of time series data from each of the metrics and running a linear correlation against them. We then walk the matrix to determine the covariance and use a voting algorithm to allow those that are highly correlated to vote that others metrics are “bad.”
Basically, the health measurement is determined by how much the current signal looks like the majority of signals. We are continuing to improve, and once we refine this procedure a bit, the new procedure will be documented through the UI.