Devops for Growth
107.7K views | +24 today
Follow
Devops for Growth
For Product Owners/Product Managers and Scrum Teams: Growth Hacking, Devops, Agile, Lean for IT, Lean Startup, customer centric, software quality...
Curated by Mickael Ruau
Your new post is loading...
Your new post is loading...

Popular Tags

Current selected tags: 'USE - Utilization Saturation & Errors', 'monitoring'. Clear
Scooped by Mickael Ruau
Scoop.it!

The Why, How, and What of Metrics and Observability

The Why, How, and What of Metrics and Observability | Devops for Growth | Scoop.it
We have a varied tech stack ranging from the many services that power the cloud, from hardware to virtualization software. But with many moving pieces comes a need for observability.
Mickael Ruau's insight:

In addition to monitoring our services, we also monitor our infrastructure. As a former member of the team that maintained our container clusters, I noticed enormous benefits when leveraging the USE method: utilization, saturation, and errors. Coined by Brendan Gregg, the USE method allows one to solve “80% of server issues with 5% of the effort”.

Let us take a look at how we leveraged these metrics to monitor our Kubernetes clusters.

No comment yet.
Scooped by Mickael Ruau
Scoop.it!

The RED Method: A New Approach to Monitoring Microservices

The RED Method: A New Approach to Monitoring Microservices | Devops for Growth | Scoop.it
Monitoring microservices effectively still can be a challenge, as many of the traditional performance monitoring techniques are ill-suited for providing the required granularity of system performance. Now a former Google and Weave engineer has developed an approach, called the RED Method, that seems to be gaining favor with administrators. RED “encourages you to come to …
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

How to Monitor the SRE Golden Signals - Faun

Site Reliability Engineering (SRE) is very popular lately, including the “Golden Signals” that you should be monitoring, but HOW do you actually get these data? This is a guide.
Mickael Ruau's insight:

There are three common lists or methodologies:

  • From the Google SRE book: Latency, Traffic, Errors, and Saturation
  • USE Method (from Brendan Gregg): Utilization, Saturation, and Errors
  • RED Method (from Tom Wilkie): Rate, Errors, and Duration

You can see the overlap, and as Baron Schwartz notes in his Monitoring & Observability with USE and RED blog, each method varies in focus. He suggests USE is about resources with an internal view, while RED is about requests, real work, and thus an external view (from the service consumer’s point of view). They are obviously related, and also complementary, as every service consumes resources to do work.

For our purposes, we’ll focus on a simple superset of five signals:

  • Rate — Request rate, in requests/sec
  • Errors — Error rate, in errors/sec
  • Latency — Response time, including queue/wait time, in milliseconds.
  • Saturation — How overloaded something is, which is related to utilization but more directly measured by things like queue depth (or sometimes concurrency). As a queue measurement, this becomes non-zero when you are saturated, often not much before. Usually a counter.
  • Utilization — How busy the resource or system is. Usually expressed 0–100% and most useful for predictions (as Saturation is probably more useful). Note we are not using the Utilization Law to get this (~Rate x Service Time / Workers), but instead looking for more familiar direct measurements.
No comment yet.
Scooped by Mickael Ruau
Scoop.it!

A Deep Dive into Kubernetes Metrics — Part 2 - FreshTracks.io

Selecting the metrics that reveal the the utilization, saturation and errors (the so-called USE method) for these core metrics are a great place to start. Brendan Gregg does an excellent job…
No comment yet.