Join Free

Research and publish the best content.

Devops for Growth

107.5K views | +0 today

Tags
Current selected tag: 'golden signals'. Clear

api - web services 1

cloud 1

golden signals 6

Kubernetes 1

monitoring 3

RED - Rate Erros & Duration 1

SRE - Site Reliability Engineering 2

USE - Utilization Saturation & Errors 1

Technology

Devops for Growth

For Product Owners/Product Managers and Scrum Teams: Growth Hacking, Devops, Agile, Lean for IT, Lean Startup, customer centric, software quality...

Curated by Mickael Ruau

Your new post is loading...

Scooped by Mickael Ruau

Scoop.it!

From www.slideshare.net - April 13, 2020 2:43 AM

No comment yet.

Scooped by Mickael Ruau

Scoop.it!

From www.back2code.me - April 9, 2020 2:41 AM

Mickael Ruau's insight:

The additional mandatory metrics is availability. Everyone will always inquire about availability. Availability can be computed in various ways from incidents duration to formulas built from other metrics. My advice is to track it through dedicated availability tests. Availability tests are a kind of smoke tests from a simple test (perform a connection to a database) to more complex tests involving several operations performed in black-box mode (Testing externally visible behaviour as a user would see it). Start simple and improve it them each time the test is not representative of the availability. In this case the availability is expressed as a percentage of time when the service is available (when the test is OK) over the total time of the measure. It’s also possible to measure a degraded availability when the test ends in WARNING–for example when the result is OK but late. There is always a lot of discussions around availability mainly concerning the downtime for maintenance or when the team is out of office. This topic deserve a dedicated article.

No comment yet.

Scooped by Mickael Ruau

Scoop.it!

From sysdig.com - April 1, 2020 2:50 AM

No comment yet.

Scooped by Mickael Ruau

Scoop.it!

From blog.netsil.com - April 9, 2020 2:56 AM

No comment yet.

Scooped by Mickael Ruau

Scoop.it!

From medium.com - April 2, 2020 3:13 AM

Mickael Ruau's insight:

There are three common lists or methodologies:

From the Google SRE book: Latency, Traffic, Errors, and Saturation
USE Method (from Brendan Gregg): Utilization, Saturation, and Errors
RED Method (from Tom Wilkie): Rate, Errors, and Duration

You can see the overlap, and as Baron Schwartz notes in his Monitoring & Observability with USE and RED blog, each method varies in focus. He suggests USE is about resources with an internal view, while RED is about requests, real work, and thus an external view (from the service consumer’s point of view). They are obviously related, and also complementary, as every service consumes resources to do work.

For our purposes, we’ll focus on a simple superset of five signals:

Rate — Request rate, in requests/sec
Errors — Error rate, in errors/sec
Latency — Response time, including queue/wait time, in milliseconds.
Saturation — How overloaded something is, which is related to utilization but more directly measured by things like queue depth (or sometimes concurrency). As a queue measurement, this becomes non-zero when you are saturated, often not much before. Usually a counter.
Utilization — How busy the resource or system is. Usually expressed 0–100% and most useful for predictions (as Saturation is probably more useful). Note we are not using the Utilization Law to get this (~Rate x Service Time / Workers), but instead looking for more familiar direct measurements.

No comment yet.

Scooped by Mickael Ruau

Scoop.it!

From aspetraining.com - March 27, 2020 3:56 AM

No comment yet.

Devops for Growth

Popular Tags

How to Monitoring the SRE Golden Signals (E-Book)

The 4 Golden Signals + 1 - Back 2 Code

How to monitor Golden signals in Kubernetes.

The 4 Golden Signals of API Health and Performance in Cloud-Native Applications

How to Monitor the SRE Golden Signals - Faun

System Monitoring in the Age of Site Reliability Engineering | ASPE