Operations | Monitoring | ITSM | DevOps | Cloud

Office 365 Monitoring: The Challenges, and What to Do About Them

Office 365 is used by more than one million companies around the world. Business employees count on these apps constantly to do their jobs, whether they’re writing documents, updating spreadsheets, building slides, or checking email. While cloud-based apps like Office 365 offer undeniable advantages for enterprises and business users, they also create tough challenges for IT operations and network operations (NetOps) teams.

Customer-Centric Observability: Experiences, Not Just Metrics

Martin and Jess recently conversed with Todd Gardner of RequestMetrics as part of the O11ycast podcast. We don’t normally write blogs based on these conversations, but there were impactful comments in that episode that bear repeating. You can listen to the full conversation if you wish. Let’s get into it!

Observability: Working with Metrics, Logs and Traces

The concept of observability centers around collecting data from all parts of the system to provide a unified view of the software at large. Fault tolerance, no single point of failure and redundancy are prominent design principles in modern software systems. But that doesn’t mean errors, degradation, bugs or even the occasional catastrophe don’t happen.

Software Maintenance Best Practices for 2023

Businesses rely on software solutions increasingly in our modern age, and it’s constantly evolving. Compared to some of the software being used in the early 2000s, we’ve seen large changes, resulting in more complex frameworks, which come with their own unique changes. As software and systems become more complex, so increases the probability of errors occurring and the level of jeopardy those errors might present.

Automate end-to-end processes and quickly respond to events with Datadog Workflow Automation

Developer, SRE, IT, and security teams often perform complex and error-prone processes in response to disruptions and changes in their systems. Relying on these processes requires a significant amount of time switching between tools to gather the relevant context needed for remediation, domain expertise, and the manual execution of tasks for incident management—which can significantly prolong disruptions and downtime.

Complete Guide to tracing Kafka clients with OpenTelemetry in Go

OpenTelemetry can be used to trace Go applications that use Kafka to find performance issues and bugs. OpenTelemetry is an open-source project under the Cloud Native Computing Foundation (CNCF) that aims to standardize the generation and collection of telemetry data. Telemetry data includes logs, metrics, and traces. Apache Kafka introduced the ability to add headers to Kafka messages from version 0.11 onwards.

Our broken links check has been improved

One of our unique monitoring features is that we crawl your entire site to discover links that might be broken. When we discover a broken link, we'll send you a notification and display every broken link in our Broken Links Report. We've made a nice quality-of-life improvement to that Broken Links Report. In addition to displaying the broken link URL and the page on which that broken link was found, we now also display the link text of that broken link.

Top 11 MYSQL monitoring tools in 2023 [open-source included]

Database monitoring is a critical component in your application performance monitoring. Apart from application code issues, database issues are one of the most common reasons for a bad user experience. MySQL is one of the most popular open-source DBMS that businesses have widely adopted. MySQL monitoring tools can help you identify potential issues with your database, keep a continuous check on your database instances, improve performance and detect and alert you about real-time issues.

Cloud Provider Uptime Monitoring: May 2023 Insights

Explore our insightful May 2023 report on the uptime of top cloud providers. We've carefully assessed the health of these leading services by monitoring outages and issues throughout the month. Using data from their official status pages, we've normalized the information to create a clear and concise overview of their reliability. Find out how your favorite cloud provider stacks up in this essential report.