Monthly Archive

How we page ourselves if incident.io goes down

Nov 27, 2024 By Lawrence Jones In Incident.io

Picture this: your alerting system needs to tell you it's broken. Sounds like a paradox, right? Yet that’s exactly the situation we face as an incident management company. We believe strongly in using our own products - after all, if we don’t trust ourselves to be there when it matters most, why should the thousands of engineers who rely on us every day? However, this poses an obvious challenge.

Read Post

Incident.io

Read more about How we page ourselves if incident.io goes down

Weekly demo: Post-mortems in-app

Nov 27, 2024 By Incident.io In Incident.io

This week we walk through writing post-mortems in the app, from resolving the incident to building a comprehensive post-incident summary directly in-app.

View Video

Incident.io

Incident Management

Read more about Weekly demo: Post-mortems in-app

Organizing ownership: How we assign errors in our monolith

Nov 18, 2024 By Martha Lambert In Incident.io

At incident.io, we run on a monolith. This brings a whole load of benefits that we don’t want to give up any time soon. We don’t have to worry about the speed of internal network requests, complex deployments, or optimizing work that touches multiple services. This blog post isn’t about the relative benefits of monoliths though (but we’ve written more about that here if you are interested)! Ownership in monoliths is tricky.

Read Post

Incident.io

Read more about Organizing ownership: How we assign errors in our monolith

How we handle sensitive data in BigQuery

Nov 14, 2024 By Lambert Le Manh In Incident.io

As a provider of incident management software, we at incident.io manage sensitive data regarding our customers. This includes Personally Identifiable Information (PII) about their employees, such as emails, first names, and last names, as well as confidential details regarding customer incidents, such as names and summaries. Consequently, we approach the management of this data with a great deal of care.

Read Post

Incident.io

Read more about How we handle sensitive data in BigQuery

How we model our data warehouse

Nov 8, 2024 By Jack Colsey In Incident.io

We've written several times about our data stack here incident, but never about our underlying data warehouse and the design principles behind it. This blog post will run through the high-level structure of our data warehouse and then will go in-depth into the underlying layers.

Read Post

Incident.io

Read more about How we model our data warehouse

Stop, Drop, and SEV4: Why small incidents are a big deal with Derek Brown

Nov 7, 2024 By Incident.io In Incident.io

Watch Derek's full talk from SEV0 here: https://go.incident.io/a8xPaeB

View Video

Incident.io

Incident Management

Read more about Stop, Drop, and SEV4: Why small incidents are a big deal with Derek Brown

Lessons from 4 years of weekly changelogs

Nov 7, 2024 By Pete Hamilton In Incident.io

Writing a meaningful update for customers every week has been held sacred at incident.io since we started the company. We've written over 200 of them in the past 4 years, and we recently celebrated going 2 years straight without missing a single a single week The numbers themselves are not the goal, but the consistency of this habit and what it represents for our customers and our team is very real, and special to me.

Read Post

Incident.io

Read more about Lessons from 4 years of weekly changelogs

Observability as a superpower

Nov 4, 2024 By Sam Starling In Incident.io

With every job I have, I come across a new observability tool that I can’t live without. It’s also something that’s a superpower for us at incident.io: we often detect bugs faster than our customers can report them to us. A couple of jobs ago, that was Prometheus. In my previous job, it was the fact that we retained all of our logs for 30 days, and had them available to search using the Elastic stack (back then, the ELK stack: Elasticsearch, Logstash, and Kibana).

Read Post

Incident.io

Read more about Observability as a superpower

Operations | Monitoring | ITSM | DevOps | Cloud

How we page ourselves if incident.io goes down

Weekly demo: Post-mortems in-app

Organizing ownership: How we assign errors in our monolith

How we handle sensitive data in BigQuery

How we model our data warehouse

Stop, Drop, and SEV4: Why small incidents are a big deal with Derek Brown

Lessons from 4 years of weekly changelogs

Observability as a superpower

Monthly Archive

Follow Us