Operations | Monitoring | ITSM | DevOps | Cloud

Latest posts

5 AWS Tagging Challenges - And How CloudZero Solves Them

As an AWS user, you know how important it is that your organization understands what you’re spending and why. AWS is a tremendously powerful resource, and when you have complete visibility into the cost efficiency of your AWS environment, you can use it to remove many of the scaling obstacles that companies faced in the past.

ScienceLogic Secures the TrustRadius Best of Award in AIOps: A Triumph of Value, Features, and Relationships!

At ScienceLogic, we’ve always believed in the power of innovation and the importance of customer satisfaction. We are excited to announce that we have been honored with the TrustRadius Best of Award in the AIOps category for 2023. This is a testament to our dedication to providing exceptional value, top-notch features, and building enduring relationships with our customers.

Counting Crashes to Improve Device Reliability

The first step to making reliable IoT devices is understanding that they are inherently unreliable. They will never work 100% of the time. This is partially because we firmware engineers will never write perfect code. Even if we did, our devices need to operate through various networks and gateways, such as cellular modems, mobile phone Bluetooth applications, Wi-Fi routers, cloud backends, and more, and each of these may introduce unreliability.

ITIM and the Public Sector: How Network Monitoring Rises to the Challenge

Government agencies and public sector organizations are a tantalizing hacker target. Cybercriminals go after public sector organizations because they hold confidential, often classified, information – the exact data state-sponsored and other criminal groups salivate over. The Cybersecurity and Infrastructure Security Agency, or CISA, along with the United States Computer Emergency Readiness Team, or CERT, have warned public sector IT of key threats.

Formalize your organization's best practices with custom Scorecards in Datadog

The Datadog Service Catalog is a centralized hub of information around the performance, reliability, security, efficiency, and ownership of your distributed services. By using the Service Catalog, teams can eliminate knowledge silos and realize seamless DevSecOps workflows.

Troubleshooting Container Network Latency in Kubernetes with Kentik Kube

Kentik Kube brings network observability to Kubernetes. In this Kentik Kube product demo, we navigate a real-time scenario of troubleshooting high latency within a Kubernetes cluster. The Kentik Kube map offers a visualization of our environments, complete with automated alerts and the ability to correlate performance metrics directly to affected pods.

Custom Container Network Monitoring and Alerting in Kubernetes with Kentik Kube

Discover the power of proactive network monitoring in Kubernetes with Kentik Kube. This demo highlights the critical importance of custom dashboards and alerts in maintaining optimal container performance and service availability. We take you through creating a tailored alert for a checkout service within Kentik Kube. From selecting services to diving deep into performance metrics via the Kentik Data Explorer, we show you how Kentik Kube makes it easy to set up policies that monitor and alert you to Kubernetes network issues as they arise.

Tackling Staffing, Funding, and Data Challenges Head-On with TAQA

Join Ed Bailey and TAQA Group's Andrew Ochse as they discuss the diverse services that TAQA offers, look at the challenges with scaling and staffing, and explore in great detail the solutions to classic problems such as insufficient funding, poor data quality, and slow connections linking global sites to their Security Operations Center (SOC).