Operations | Monitoring | ITSM | DevOps | Cloud

Alerting

Hrushikesh shares his journey into SRE and his thoughts on the future of this space

Hrushikesh is passionate about making a complex design with simple and reliable solutions. He is technology and platform agnostic and doesn’t believe in limiting himself to just a few. He started his career in 2006 with a Media company where he was responsible for introducing new technologies along with driving a team to deliver quickly. He does not limit his role to just development and operations and loves exploring everything in the tech space.

Dynamic alerts

The power and value that’s embedded in logs are reflected by the status and behavior of our applications and infrastructure. Many times we would like to be alerted when the application or its components show abnormal behavior. This behavior can be reflected by the application sending some logs at a higher than usual volume. Figuring out exactly what ‘higher than usual’ means, or in other words, setting the threshold value at which the alert should trigger can be a daunting task.

Chatbot integration with Microsoft Teams and Slack

SIGNL4 provides plug-and-play chatbot integrations with Microsoft Teams and Slack, both via certified chatbot apps. Why does it makes sense to integrate SIGNL4 with chat tools after all? There are two basic uses cases that we address with the integration into Teams and Slack. By default, SIGNL4 notifies by mobile push, text and voicecalls, all according to user preference. The focus is clearly on mobile alert notifications. And of course, tracking and escalation of critical alerts is built-in.

3 Things We Learned from EMA About AIOps and the Automation Handshake

AIOps is the trendy cool new kid on the block in the IT operations world. No doubt about it. However, with all the buzz surrounding AIOps, it’s easy to skip over some of the basics. How many IT operations professionals can clearly define what AIOps is? Beyond the baseline definition, why should you care? What about plugging it into your existing automation and analytics ecosystem?

How do we Apply SRE Outside of Engineering with Google's Dave Rensin

The first keynote speaker, he is a senior director of engineering at Google. You might know him as they guy who founded and leads the customer reliability engineering function at Google. CRE, this is a team that teaches the world SRE principles and practices. Now I want to tell you a bit more about him, because I think he has a very unique view and perspective. He is deeply compassionate and intuitive as a teacher, not just a lecturer.

New in Grafana 6.6: Forcing minimum alert evaluation frequency

There has long been a request from administrators to have the ability to enforce a minimum interval between alert rule evaluations. This is useful for restricting unrealistic user-defined alert rules that evaluate too often and create unnecessary load in the backend. @Uepoch took the initiative and made all the necessary modifications for this configuration in Grafana’s backend, and we finally pushed it forward and introduced the feature in Grafana v6.6.