We ran the Allthethings Prioritization Matrix on our teams On-call Alerts!

July 23, 2021

An alert is a specialized log from a software component in a computing system, which indicates a problem. Tools like JSM and Opsgenie can help us manage alerts and avert incidents. If they do end up being incidents, they help us mitigate them as well.

An On-call is an engineer that is supposed to Keep The Lights On. They have to be on the lighthouse, on the lookout for alerts and preventing issues that might impact the customer. We also have a very similar process in our teams.

Our alert process has become deeply refined over the years. We have closed down multiple alerts, reduced the occurrence of many others, and we know the common alerts that we see, by intuition, but even more so, by run-books.

Allthethings Prioritization Matrix, also known as the "Eisenhower-Matrix" is a tabular system of filtering tasks, issues, alerts, and everything that has information into their order of attack based on the urgency and importance of the given data point.

Based on the information, we can categorize things as Important, Not Important, Urgent, and Non-Urgent. Which therefore leads us to 4 kinds of tasks.

Urgent and Important: Fix it right now! It is crucial.
Urgent and Not Important: Fix it right now, it is not important, but it is still a problem.
Not Urgent but Important: It is important to fix things, maybe not today, but whenever we have time.
Not Urgent and Not Important: We don't need to fix it.

Source: Picture from WikiMedia Commons by Davidjcmorris

Questions to ask

Well, in case you encounter stuff in these categories, you gotta ask yourself the following questions:

If it's Urgent and Important: Why did we not realise it before? Did it become urgent and important suddenly? What is the way to fix this?
If it's Urgent and Not Important: Why is this even here? Do we have a way to never have this unimportant thing again?
If it's Not Urgent and Important: When will we do this? When it becomes urgent and important? No, right? The best time to plant a tree was 25 years ago and the second-best time is right now!
If it's Not Urgent and Not Important: Why is this here!? Can we please please remove it?

How did we run it?

We collected our closed alerts for 6 Months
We ran them through a sort by alias
We prioritised our existing alerts using this by voting
We readjusted priorities

Outcomes

Time-based patterns
Reprioritisation
Action Items for long recurring alerts
Tagging of actionable and non-actionable items

Forums

Q&A

Community resources

Support

Top groups

Community resources

Support

Learn

Community resources

Support

Events

Community resources

Support

We ran the Allthethings Prioritization Matrix on our teams On-call Alerts!

Questions to ask

How did we run it?

Outcomes

2 comments

Comment

Was this helpful?

Thanks!

About this author

TAGS

Atlassian Community Events