This operation doesn't solve every problem (no operation does, ha!), but it should solve many. Your top appliances by utilization are the most important ones to monitor, most of the time. Appliances trending towards 100% might be candidates for hardware upgrades. However, if an appliance isn't in the top 10 by utilization for that interval, then it means the other appliances in the organization all have average utilization lower than the least utilized appliance in the response. If your least utilized appliance is normally pretty low (e.g. under 50%) then it's pretty unlikely that hardware utilization is going to be a concern for any of the other sites. But you don't need to take my word for it! The endpoint supports t0 & t1 controls so you can slice the data (e.g. 1 day at a time) and build your own graphs. You can use those stats to establish a performance baseline (e.g. a typical utilization percentage) for those top 10. If you measure this over periods of time where you have known real-world usage spikes, then that will yield you some stats about how usage spikes can impact the appliance utilization. For example, imagine that the norm for a given office is 50% appliance utilization, but you've measured that a big team on-site event caused the utilization to spike to 75%. It'd be reasonable to predict that subsequent onsite events for that office on that hardware could trigger a utilization increase of about 25 percentage points. Generally, 75% isn't high enough to be concerning in most scenarios, especially if it's a temporary spike. On the other hand, if your "normal" utilization for that office creeps up to 75%, then you might predict that an onsite event could push the utilization (for that site and that hardware) to 100%. Then, that might be an actionable problem and/or a sign that it's time to upgrade the MX hardware at that location. Separately, if you correlate these stats with real-world user experience data (e.g. support cases for "slow Internet" or "slow VPN") then you can build some personalized metrics about which utilization percentages might be of concern to your users for specific models. For example, if users at a site are suddenly opening lots of support tickets about slow WAN/VPN etc., and you see that the utilization is 95%, then there might be a hardware bottleneck. All of this is to say that, if your top 10 by utilization are typically pretty low, then additional datapoints about the other appliances are probably not much of a concern. And separately, if your top 10 are usually pretty high, then you probably have enough data already to start addressing potential hardware bottlenecks that might be causing user experience issues.
... View more