As the Internet titans struggle to find more cost effective methods of keeping their datacenters cool, while simultaneously lowering their impact on the environment, I can’t help but smile. No I’m not insensitive the impending crisis that is global warming – to the contrary, I am a big believer in thinking globally and acting locally. It’s just that those of us in the field of BSM have already been executing on this global-local philosophy for years:sweeping, expensive measures aren’t always the best remedy – sometimes you just need to improve how you use what you already have.
Several years ago I was asked to provide some consulting services to a customer, an unnamed manufacturer, with a plant in Canada that desperately needed a solution to a simple but a very expensive problem. The servers in the datacenter that hosted the
technology controlling the operation of the assembly line had to maintain a temperature that did not vary above or below a (+/-) 5 degree Celsius. If it did the servers would automatically shut themselves down to prevent overheating. With a vehicle rolling off the production line every 27 seconds, any stoppages on the manufacturing line had a clear and measurable impact on revenue.
Some proposals called for grand
schemes to overhaul the datacenter with sweeping and sophisticated new solutions…until I arrived and offered a simple answer: integrate the tools you already have to develop an early warning system to monitor the temperature in the datacenter.
The HVAC systems already installed in the datacenter had onboard SNMP servers, which allowed a periodic polling of the datacenter temperature at three different points. The outcome of this poll was correlated to the required mean temperature in the datacenter – and a
severity level for the production floor then conveyed to the Center. For example, if the temperature varied
more than 2 degrees Celsius either way, IT operations staff was automatically paged and the manager on duty received an email on his blackberry. In just a week after installing this very simple early warning system, the number of production line stops was reduced by over 90 percent.
This situation was perfect for a BSM solution which is designed to integrate and correlate IT and business data and visualize it in a way that is meaningful to the business. More importantly, it exemplifies what is often symptomatic of a larger problem: providing a way for IT to understand that given current course and speed, a crash will occur in X-number of minutes. Whether it’s global warming or cooling the datacenter, sometimes we already have the ends of a solution in our hands – we just need to tie them together.
-- Jonathan