Operations | Monitoring | ITSM | DevOps | Cloud

Incident Review - AWS Outages Crash Major Online Services - Including Amazon

The following is an analysis of the Amazon Web Services incident on 12/07/2021. Millions of users were affected by an Amazon Web Services outage that took down major online services such as Amazon, Amazon Prime, Amazon Alexa, Venmo, Disney+, Instacart, Roku, Kindle, and multiple online gaming sites. The outage, which originated in the US-EAST-1 region on Dec. 7, 2021, is still ongoing at the time of blog publication.

Python JSON Log Limits: What Are They and How Can You Avoid Them?

Python JSON logging has become the standard for generating readable structured data from logs. While logging in JSON is definitely much better than using the standard logging module, it comes with its own set of challenges. As your server or application grows, the number of logs also increases exponentially. It’s difficult to go through JSON log files, even if it’s structured, due to the sheer size of logs generated.

AWS Outage on Dec. 7, 2021 - When Did You Know About It?

If something isn’t working as expected, your customers will want to know. How quickly did you know that AWS’s us-east-1 region was having issues? Was it from an article online? Customer requests flooding into your support queue? A tweet?? Not being able to get into a PUBG match? Or speaking of matches, were you unable to message your last Tinder connection?

ScienceLogic's DoDIN APL Certification Journey: Watch out for Spiders & Snakes

This is the fourth and final in a series of ScienceLogic blogs on the topic of the Department of Defense Information Network (DoDIN), including what it is, what it means to be approved under DoDIN standards, why it is important to both our federal and private industry customers, and the process for being approved for listing.

A successful Monitoring as a Service Case: Drilling & Mining Industry

Migrating or restructuring a SCOM environment can seem like an overwhelming, even impossible, task. For one of our customers in the mining industry, however, it went more than well, and the improvements have been exceptional. Applying our Monitoring as a Service, they could benefit from the aggregated experiences of programmers, system administrators, and DevOps engineers that our service builds on to make the migration as smooth and trouble-free as possible.

Website and Performance Monitoring for Edge Cases

Specific needs are compelling but also hard to plan for. Your use case may be the reason you are searching for a monitoring provider, but the ability of your provider to adapt to your edge cases will be the reason you stay. The challenge is in discerning if a provider will be able to rise to meet your needs in unknown circumstances. In monitoring, there are some uniform needs. Everyone wants to know if their site is UP so HTTP(S) checks meet use cases universally.

Percepio Wins Coveted Elektra Award for Tracealyzer for Linux

Percepio®, the leader in visual trace diagnostics for embedded systems and the Internet of Things (IoT), has been awarded the prestigious Elektra Award 2021 for its visual trace diagnostics tool Tracealyzer for Linux. Tracealyzer for Linux was voted best product in the “Design Tools and Development Software” category by the jury, ahead of developer tools from five other companies, including Cadence and Synopsys.

Getting Started with the InfluxDB 2.0 API and Postman

Whether you’re using InfluxDB Cloud or InfluxDB OSS, the InfluxDB API provides a simple way to interact with your InfluxDB instance. The InfluxDB v2.0 API offers a unified approach to querying, writing data to, and assessing the health of your InfluxDB instances. Today we want to share a Postman project to help you use the API easily. Postman is “an API platform for building and using APIs”.