The Start of the Monitoring Journey in DevOps

Under pressure to innovate and deliver rapid updates to their applications, developers often place monitoring as an after-thought. In today’s world, where a slowdown can translate to customer attrition and an outage can lead to millions of dollars in losses and negative press, ensuring satisfactory response time has evolved to be one of the most critical needs in the early stages of development and is a common goal to ensure DevOps success.

The Shift-Left of Monitoring: To study emerging patterns of how and when DevOps teams start monitoring, we interviewed several DevOps squads. Almost unanimously the responses were:

  1. They care most about knowing, Is the application up and loading quickly? Are all critical user interactions with the application responding within acceptable standards?
  2. They would like to get this information proactively throughout the development life cycle so they can determine if any code check-in impacted the performance/
  3. They would like to monitor with minimum setup and configuration and don’t have the time to learn complex tools/

To satisfy these needs, synthetic monitoring becomes the starting point for monitoring.

Read more at :


Monitoring Pokemon Go

In July 2016, the world of gaming was taken over by a new phenomenon – Pokémon Go. Within a matter of days “augmented reality” became mainstream and the app, which was launched mainly in the US and Australia, overtook (link is external) Tinder and Twitter in the total number of downloads. Pokémon Go surpassed the wildest expectations of its creators, Niantic Labs, and then some.

With popularity comes scale, and with scale comes an overload of requests to the gaming servers. If you are not prepared enough, requests fail and users are frustrated. Frustrated with Pokémon Go crashes, laymen were talking about server status and memes were being created and circulated on social networks. Overnight, websites spun up just to report if the game was up or down in different countries. Being closely related to the APM space, my head was drawing up various ways in which Pokémon Go was perhaps addressing the issue and what monitoring they had to put in place to retain their popularity. Here is my list of probable solutions Pokémon Go could employ to improve the experience for their users and avid fans.

Read more here…

The DevOps Scorecard : Key Metrics to track DevOps team success (post on

Mid-last year our team switched from doing Agile to doing DevOps. As we forayed into the journey trying to learn about DevOps and practice it at the same time,  a lot of questions arose in the team.  How was this different from agile and most importantly How were we going to be successful?

That’s when we wrote down what would be the success criteria for our team: Ship code frequently without causing a customer outage.

As the team matured we started evaluating a more granular way to track success.  Could the team mantra be broken down into quantifiable success metrics that could be represented in a scorecard? Based on our experience the DevOps scorecard should contain these 9 metrics to track DevOps team success:

DevOps Scorecrd

Read more on

5 Techniques to troubleshoot app slow down (Guest Post on APMDigest)


Applications are getting more complex by the day. First you have the various hosting platforms that your app can span across like private cloud, public cloud, your own data center.

Second, you have applications for the web being accessed through different browsers and mobile apps being accessed from several hundred different devices and various device OSs.

Third, the same app is being accessed from around the world, 24X7.

Fourth, the number of users accessing apps have grown significantly requiring rapid scalability of the app’s infrastructure.

To top it all, users, today, have very little patience to deal with poor performance.

Application Performance Management (APM) tools have evolved over the last decade to cater to this complexity and yet be able to troubleshoot application performance issues quickly. Let us look at some of the key features and visualization techniques that are enabling quicker troubleshooting….   Read more on APMDigest

Stop users from abandoning your app. Use End User Monitoring

With the growth of web and mobile apps, the app owner’s focus has shifted from keeping his app infrastructure available to keeping his users happy. Hence, End User Experience Monitoring has evolved as one of the five functional dimensions of APM by Gartner (login required). In this blog post, I will take you through the fundamentals of End User Monitoring and what APM can do to ensure you detect app issues before they impact your end user.

Let’s start with a few basics. The most important metric of your user’s satisfaction is a healthy response time. Which means no matter what the geography, device, carrier or browser your end user is using, no matter if it’s peak time or idle time, the app does not slow down at any point in time…..

Read more on ServiceManagement360

Reduce Failed Deployments. Innovate Rapidly. Use App Diagnostics

In this age of relentless innovation, where web and mobile startups are changing the game everyday, the need for companies to stay ahead of the curve by continuously innovating and delivering, is key to survival. In order to get quick time to value companies are adopting the DevOps model which allows code to be deployed not in weeks or days, but several times a day. For example Etsy, a popular web marketplace, deploys 50 times a day and Netflix deploys hundred times per day.

Now for a second, think about the complexity behind this. Every end user interaction with a web or mobile application results in execution of several hundred or thousand lines of code running across a complex infrastructure that includes various backend components like databases, application servers, web servers and so forth in the cloud or in a data center.  To add to the complexity there are many developers constantly adding, enhancing code to the code base which is deployed by Operations team continuously, several times a day. How can the application owner ensure that every line of code he checks in is not causing a failure somewhere in this big ecosystem?  Can he catch problems before it impacts his end user? This brings into the picture tools that can help you monitor your application’s performance and availability and correlate that to code issues.

Read more here

5 Ways APM can solve your DevOps worries (Guest post on

This week at the interop conference in New York I had the chance to talk to a lot of IT managers transforming their organizations to adopt a DevOps culture. One of the key questions I heard repeatedly was – are there some best practices to break down the wall between Dev and Ops?  How do you get to a point where we can deploy code multiple times a day like Facebook , Amazon and Netflix without impacting our users’ experience ? How do we minimize the risk of deployment failures and the blame game that ensues? With the need to innovate and close the customer feedback loop quickly Ops is being asked to deploy code more rapidly than ever before.  The complexity of hundreds of developers updating code makes it a very sensitive environment. The last thing Operations wants is an outage leading to financial loss and the last thing development wants is to get a call in the middle of the night to figure out what code change caused it.

To have tight control over this fast paced environment we need Application Performance Management at every step. APM can help break down the Chinese wall between Dev and Ops and eliminate the finger pointing that happens during outages.