SEARCH 


Recently in SLM Category

As a follow up to my recent post titled “End User Experience Monitoring as lynchpin for BSM,” I spoke with Eden Shochat, CTO of Aternity, to learn more about their offerings in this space and discuss unique contributions they could make as part of a larger BSM strategy. For those who aren’t familiar with them, Aternity provides a new class of Application Performance Management software via its Frontline Performance Intelligence Platform.

An excerpt of our very interesting discussion is presented below:

Abbas: I’ve described the role that EUEM solutions like Aternity’s Frontline Performance Intelligence play in the context of BSM, but what is unique about your technology in this area?

Eden: Aternity’s Platform fuses application, desktop and user performance monitoring with real-time business intelligence. This approach generates the most accurate and comprehensive user experience information from multiple levels of the network and application stack on the end-user machines (physical or virtual). By combining this unique data with correlation, clustering and anomaly detection analysis algorithms, Aternity’s Platform is able to perform preemptive problem detection, usage & usability analysis, fact-based capacity planning and activity-oriented compliance. 


Abbas: What is the typical deployment architecture for one of your Global 2000 customers? Does the solution require agents, appliances, or a combination? 

Eden: There is basically a four-tiered architecture approach we follow during the course of our enterprise deployments: 

> The Microsoft Certified Agent(s) collects information and measures the performance of the desktop, applications and user productivity 
> Next, Aggregation services communicate with the Agent(s) to aggregate the measurements and further compress the traffic 
> Then, Analytics Services perform analysis on any incoming data, such as activity usage metrics, and clusters similar data together, performing anomaly detection and correlating endpoints with similar characteristics to locate probable cause 
> Finally, the architecture is supported by a Management Console and a historical database store for enabling the management, configuration and interactive drill-down into specific user experience business intelligence data. 

The aggregation, analytics and management tiers can all run on the same server or they can be split into multiple servers, scaling horizontally or vertically to support tens of thousands of monitored users. 

Unlike appliances which are located within the data center, the Microsoft Certified Agent(s) resides on the end-point where the service is consumed, providing for an in-depth level of accuracy that is impossible to achieve with sniffer based technologies. Additionally, Agent distribution is performed as a software update versus having to distribute multiple hardware appliances.
 
And, by having aggregation separated, the architecture can more easily support distributed models. These could for example include those found in the oil and gas industry, where some of the services that are consumed there are behind very high cost vsat links. Bank branches are another good example, where some of the network and application services are local and don’t go to through a general corporate network.
 
Abbas: When monitoring performance of applications, do you treat them all the same (i.e. agnostic) or are their specialized analyses for common applications such as Exchange, SAP, and Web?
 
Eden: The Aternity Agent performs both protocol agnostic monitoring, supporting virtually all applications as well as technology specific monitoring. This includes:
 
> Generic network Cartridge, supporting any request à response type protocol, e.g: Java RMI, CIFS, 3270 or other, unpublished protocols.
> HTTP/s Cartridge supporting any HTTP-based application, web or otherwise, without requiring the secure breaches by appliance-type key management
> Win32 Client/Server Cartridge: Passive monitoring of any win32 user interface application, be it .NET forms, Powerbuilder or plain vanilla win32 programming.
> Oracle EBusiness Cartridge: Generic monitoring of JInitiator and JDK based form applications, including customizations performed by customers.
> Java: Monitor any Swing-based (AWT is supported by the Win32 Cartridge) applications and applets
 > Server-based Computing ICA/RDP Support: We monitor both the latency of the screen refreshes as well as the actual applications on the Citrix/Terminal Server for published desktops and applications.
 
The support for technology-based instrumentation means that most (if not all) of the applications, shrink wrapped or custom, can be monitored by the Aternity Platform.
 
In addition, the Agent collects environmental information, e.g: network statistics, process information (including crashed and hung processes, user activity) and operating system, service packs, installed applications & patches. The agent is a Windows service monitoring the endpoint providing insight into the network, desktop, server-based computing protocols.
 
This monitoring can be applied to standard desktop/laptop clients, server-based computing environments like Citrix XenApp and virtual desktop infrastructure (VDI) deployments, all this with incredibly minimal footprint of less than 10MB of physical RAM and under 0.1 percent CPU on average.
 
Abbas: Analytics seems to play a prominent role in how Aternity positions itself. Can you go into more detail about how it works and the value that provides to your customers?
 
Eden: Attempting to understand or derive business intelligence from volumes of end user performance metrics is like looking for the proverbial, “needle in the haystack”. Sophisticated, real-time analytics are therefore necessary to truly bring about what we call, Frontline Performance Intelligence.
 
The issue that plagued the early attempts of monitoring user experience is having the capability to transform huge volumes of data into actionable intelligence. Previously, organizations would try to lessen the flow of data from end users’ desktops by only supporting partial deployments of Application Performance Management (APM) technologies. These deployments would be applied to PCs that were exhibiting performance issues. This mode of operation prohibits on-going introspection of user productivity and experience.
 
By collecting a comprehensive set of end user performance and productivity metrics at the Frontline, and processing this data with analytics, the Aternity Platform generates Frontline Performance Intelligence from real frontline performance metrics. The analytical components in the Aternity algorithm engine include:
 
Autonomic Performance Profiling: An Autonomic Performance Profile™ is the mathematical model used for automatic, real-time identification of groups of homogenous users sharing the same behavior at a particular time, and is used to quantify, detect and distinguish between normal and abnormal behavior.
 
Deviation detection: Autonomic Performance Profiles were designed to provide the earliest possible detection of performance problems that impact multiple users while simultaneously eliminating the need for manual alert configuration and tuning, which many other products in the market require. The Analytic Engine performs continuous correlation between the real-time performance measurements captured by the Agents, and the Baselines of the Autonomic Performance Profiles. In this way, performance deviations of any magnitude can be automatically detected, for groups of users of any size, with no manual configuration and/or intervention.
 
Problem Minimization: Each of the detected symptoms is analyzed for commonalities to tie multiple symptoms together into a problem. This has been shown to greatly reduce the number of alerts going to the IT operations.
 
Problem Isolation through Endpoint Classification: End users with like symptoms are first grouped together into an “Effect group”, and an alert is raised. The analytic engine then automatically identifies the end users’ unique commonalities, with two levels of correlation, across the effect group:
 
1. Positive Correlation: the attributes that the affected group have in common
2. Negative Correlation: the attributes common to the effect group that are also common to the non-affected user groups
 
The intersection of these two correlations, i.e. the Query Group and the Effect Group is shown as the “Match Group” above. The attributes that produce the strongest Match Group are “surfaced” as a Probable Cause. Any attributes collected by the Aternity Agent (e.g.: the amount of memory, installed application or the subnet where the endpoint resides) may be used for Dynamic Problem Isolation, i.e. Probable Cause Analysis.
 
Abbas: Given that FPI may well be the early warning system that companies would rely on to get ahead of end user performance issues, what mechanisms do you provide by which another management platform can gain access to the results of the analytics so that they can be presented inside of a BSM view?
 
Eden: When we designed the Aternity Platform, it was clear that we are generating a new type of a data stream - user experience combined with activity data. As such, we architected the system to be totally open. The system components communicate over a message bus among themselves. And, the complete database schema is open, documented and simple for custom-built reports. The problem detection analytics are exposed through our object-oriented Problem Life Cycle Manager and CLI layers.
 
Some of the existing integrations at customers include to Ticketing Systems (CA, BMC), Portals (IBM WebSphere, Microsoft Sharepoint), SNMP alert systems (HPOV) and other proprietary systems.
 
Contact information for Aternity is available here.


Bits and bytes from itSMF Fusion 2008

| Comments (0)

 
CindywinstheWii (Small).jpg
We had a great week at the itSMF Fusion 2008 show in San Francisco this past week – certainly time well spent. We had some really insightful conversations with current and prospective customers, engaged a handful of analysts, scoped out the competition and sat in on a handful of very interesting sessions which unveiled some rather unique data points.

For example, in one session on CMDBs, in excess of 50% of the audience (by show of hands) said they were currently implementing a CMDB. Two-percent admitted they were on their second try having failed the first time around. No surprise integration was routinely cited as the main culprit and that’s an area Managed Objects has certainly mastered.

StackSafe has posted some notes from the show here – and we’d like to offer some bits and bytes – mostly paraphrases – as well:

>> Congratulations to Cindy from Hallmark (photo nearby) who won our Wii raffle.

>> IT is good at measuring performance, but poor at measuring quality.  A help desk that aims to solve 60% of incidents on the first call is really just encouraging staff to close a ticket with a poor answer and reopen a new one with another call. – Malcolm Fry, “CIO and the 366 Degree Circle

>> Roughly 10% of the audience raised their hand when asked “do you know what BSM is?” – Lisa Erickson Harris, “BSM and Best Practices, Elevating the Role of the Service Desk”

>> IT investments will continue to grow, but they must either produce cost savings in the supply chain or improve the customer experience – Charlie Feld “Enabling 21st Century Business Model with IT”

>> A well run IT department is like air – it’s taken for granted. – Dennis Ravenelle, IT Service Continuity Management, Where do I start?

>> “An inaccurate CMDB is worse than no CMDB.” – Richard Peasley, Building Decision Support Systems that Work

- Abbas


B-SaaS-M

| Comments (0)

For contemporary computer usage, the SaaS model has many attractive advantages that account for its growing popularity. For instance SaaS avoids the initial up front cost of purchasing software and the associated hardware; the ongoing maintenance, and salary costs for employees to maintain a new systems are all avoided.

At the expense of IT as competitive differentiation, these cost advantages are coupled with the fact that many types of software applications have been commoditized: networking speed and availability are trusted, centralised resources are shared across many users and afford higher levels of security, fault tolerance, disaster recovery and access greater expertise. Plus, SaaS examples like Salesforce.com demonstrate the credibility of the concept. So potentially you end up with less cost, less worries and higher quality of service – sounds good eh?

As the business model has matured the SaaS vendors are using increasingly sophisticated techniques to optimise their business, optimise client usage of resources, and deploy up selling techniques. All of this demonstrates the early maturity of a successful business concept.

Users have the right to expect SaaS providers will deploy best practices and the best technology in order to ensure security, high availability, good performance and continual improvement to ensure the quality of service (QoS) they require. Business Service Management (BSM) has an important role to play because of its unprecedented ability to provide service management the proven best practice way of managing IT systems.

SaaS systems, like all IT systems, will be subject to change, for example, upgrades, system expansion and maintenance – and incidents will also occur causing user-impacting problems. To ensure the viability of SaaS is maintained as these systems grow in size and complexity with their increasing popularity then SaaS providers need automated management solutions that can scale to cope with demand, size and complexity.

BSM components like Service Level Management (SLM) and Service Contract Management provide automated and proactive ability to ensure the QoS consumers expect. Since they provide automated root cause analysis of QoS threatening issues. Whereas, BSM’s Configuration Management System (CMS) ensures the risk of change is minimised by making change management aware of the potential impact of a proposed change.

We anticipate as SaaS becomes more popular, more complex and more competitive between SaaS providers. As such we predict that we will see the same rapid adoption of BSM as we have seen by service provision vendors.

 


SLAs Just Don't Matter...

| Comments (0)

I see the title grabbed your attention in these days where there is a lot of talk about managing services, writing service catalogs, defining the managing service level agreement (SLA), SLA reporting and the list goes on. Why is there so much ado about SLAs? Let me explain.

When we engage in a service offering with someone on the outside, we expect a certain level of availability, performance and responsiveness when something does go wrong. This is the basis of the SLA and the agreement we must enter when contracting outside services and these do matter.

Internal SLA’s have long since been hard to calculate, report and are just that...after the fact reporting, much like financial reporting after the quarter. Depending upon faith of accuracy, the reports tend not to be very meaningful in light of the cost of developing and generating them in a timely manner.

In addition, as an internal service provider, the penalty for missing the objective tends to go two ways: it’s either nothing or very high with the threat of an outsourcer knocking on the CEO’s door.

What would make all this effort more relevant? The ability to understand what drives the organization, business KPI’s, managing impact based upon business relevance, historical trend analysis for purposes of positive changes and proactive management before the end of the reporting period. Internal SLAs can be invaluable as a tool to understand the business and manage the supporting technology in alignment to the business of the organization, however, most look at them as after the fact reporting.

SLAs don’t matter when used just to report, SLAs can be invaluable when used to manage and drive the business!


Long time CMP writer and veteran technology watcher Penny Crosman recently blogged about a flood of new start-ups developing solutions aimed at reducing data latency for financial institutions. For example, one provider offers “software that sits on network switch mirror ports and monitors the traffic going by, combined with correlation algorithms that, the company says, can pinpoint latency to the box level.”

While such a tool may pinpoint latency to box level, component-based management only takes you so far. The next step absolutely must be to put this information in a business context - to understand the effect of any single box’s latency on IT’s ability to deliver business services. This is important because financial institutions often have tens of thousands of servers (among other components) in their enterprise, which means potentially tens of thousands of latency alerts. Without business context, the question becomes, ‘which latent box do you fix first?’

One way would be to incorporate Business Service Management (BSM) solutions to allow IT operations to make this determination quickly and easily. BSM manages IT from a holistic, service perspective, and dynamically links the underlying IT components according to the business service they provide – so, for example, IT operations can place the latency at a box level in the context of business impact. Since BSM platforms can integrate to a range of heterogeneous IT management tools from performance monitoring to service desks, it may prove more useful than trade data latency monitoring tools.

In many ways because of their reliance on technology, financial services institutions were among the early adopters of BSM technology in the late 1990s. Today, BSM has evolved to include a range of modular components to include discovery, service level management (SLM) and the configuration management database (CMDB) or as ITIL perhaps more aptly calls it a configuration management system.

-- Jim


IT budgets not all doom and gloom

| Comments (1)

Oh, what doom and gloom. The forecasts for IT spending seems destine for more cuts. The Incredible Shrinking Tech-Spending Forecasts laments the headline from The Wall Street Journal. While the Journal is technically correct, several IT research firms have indeed lowered their IT budget forecasts, it overshadows some important details.

As our CTO pointed out in a recent article, AMR Research reported that IT departments plan to spend 9.3% more on performance improvement than in the previous year, which contrasts with plans for an overall IT spending increase of 5%. One pundit’s well-reasoned analysis is that IT thinks it “can deliver almost twice as much bang this year for each new IT buck compared with their colleagues in the wider business.”

This jives with other analyst forecasts including Enterprise Management Associates. EMA has a soon-to-be released report indicating that spending on Business Service Management (BSM) is “poised to surpass” is more mature precursor of Service Level Management (SLM).

All this goes to show that IT simply spending its pennies more wisely in an effort to meet its two lasting, if not dichotomous, priorities to improve service quality and maintain or lower costs.

- Randy Jones


Good or Great IT?

| Comments (0)

In a recent article in CIO Magazine, “IT departments do not deliver ‘great’ IT, say CIOs”, a survey of CIOs and IT Directors revealed that almost half do not deliver “great” IT. “Great” IT is defined by 3/4ths as being able to to add real value to the organizations strategy and bottom line.

I am always amazed when I speak to leaders of IT organizations for enterprise organizations that may know what sector of the market their organization falls into, but cannot describe to me what the business units do day-to-day and how they determine success for the business unit. I find often times I may know more by reading annual reports and surfing the company websites prior to a meeting than the IT leaders I meet within the organization. So I ask how would an IT organization believe it should have a say in strategic decision making within the organization without basic knowledge of what drives and grows the business?

Alignment to the business has almost become a passé phrase due to the lack of meat behind the initiative and the continued siloed, technology focus of many IT organizations. True alignment will only be achieved when IT organizations immerse themselves into the business and become fully integrated. A recent Information Age article quotes: “There is no such thing as an IT project: all IT’s activity should be about business projects.” The only way an IT organization can better manage change and impact is for it to be immersed and integrated in the business. This is not to suggest that IT organizations become distributed by business units, rather there are emerging roles of service managers that work side by side with the business teams on projects leveraging technology to the success of the business.

I often work with organizations on “IT projects” and the business case for an “IT project”. These are always problematic because most are “soft” in their cases of cost management. Cost saving projects that reduce hardware and software licenses are easy cases with hard savings. It’s the CMDB, SLA, ITIL process improvement, etc. projects that are difficult and I would point the finger at the root of the problem, these are typically internally, IT focused projects without links back to the business they support. I’m often looked at like I turned green and grew horns if I ask questions like, “How does this improve performance of the business? What impact on the business will this project have?”

I find ITSM (IT Service Management) projects that have a focus on managing business impacting events and can be quantified as such for business driving applications are easy cases. BSM (Business Service Management) projects that also bring in business data providing analytics regarding the effectiveness of the business providing information during the quarter versus a financial report after the fact after the quarter also have greater success. My humble advice is to always error on the side of the business and find the high impact services and understand how the technology can be better applied or improved.

IT has a difficult job in managing two sides of a coin: efficiency internally managing costs and effectiveness externally improving the business. Often times IT organizations error on what we know best, IT, but by better understanding the impact to the business and integration to the business provides the greatest benefits all the way around the business speeding up projects and decision making of projects.

– Michele


It's a smaller world

| Comments (0)

Anyone who has ever had to hike a long mile to the nearest gas station to call AAA will attest to the fact that the world can sometimes seem very large. These days it is almost unheard of for someone to have to walk anywhere to make a call. Last month I arrived at the scene of a car accident and my first action, after ensuring the occupants of the car were safe, was to dial 911 from my blackberry.  The mobile device also provided me with the altitude, latitude and longitude of the accident to better inform first responders.

The world has gone mobile. Nobody can dispute this fact.

The ramifications of a mobile world are broad. Motorola’s fad has turned into a cultural phenomenon with rural villages in developing countries now having mobile phone service, but no sewage. The steady buzz of the blackberry can be heard everywhere from ski lifts in Colorado to weddings in India. In major economies, the workforce is now steadily becoming more mobile, where just a few years ago – it was unheard of to work from home. These days I work out of my home full time, for a company based on another coast, and hook into client systems half a world away on a daily basis.

The advent of the mobile workforce has created a huge infrastructure that requires a somewhat high level of maintenance. If I work remotely in Denver our IT staff in Virginia need to be able to push patches to my laptop, maintain my blackberry and keep my VPN secure, all without affecting my ability to produce the work for which I was hired to do in the first place. Stuck on a client site with no access to the internet, but need to urgently send an email? No problem. Pull out the blackberry and the email has been securely forwarded in a matter of seconds.  Need to quickly get online and log into a clients system to fix an urgent production issue? No problem, the mobile broadband card that came with your laptop will get you on there in seconds.

But what happens when things break? What if security has been compromised or if the new patch that Microsoft just put out renders other work critical software inoperable? What then? I can no longer log into my network let alone my clients’ networks.

This drives the need for a high level of automation, exception based monitoring and what the medical field might call Preventative Care. IT is now a business service and must be treated as such.

Working for a company that pioneered BSM puts me in a unique situation, as I watch this transformation firsthand. In some cases the transformation has been dramatic, with entire workforces becoming mobile. The fact that this has made these workforces more productive is no surprise and to maintain this trend, it is essential that corporations move towards practices like exception based monitoring, Service Level Agreements and proactive interdiction. The future of the mobile workforce lies in managing the business of IT as a service to its customers.