Operations | Monitoring | ITSM | DevOps | Cloud

Blog

How the Cortex and Thanos projects collaborate to make scaling Prometheus better for all

Cortex and Thanos are two brilliant solutions to scale out Prometheus, and many companies are now running them in production at scale. These two projects, both in the CNCF Sandbox, initially started with different technical approaches and philosophies: Cortex has been designed for scalability and high performances since day zero, while Thanos was originally focused on operational simplicity and cost-effectiveness.

Auditing and Reporting In Cloudsmith

What software assets does your organization use? What sounds like a simple question is anything but. If we include every package and dependency that ends up in the code we produce then for most development teams the truthful answer is ‘we don’t know’. As we’ve said enough times already, that really isn’t good enough anymore. And that’s one of the core motivations behind Cloudsmith.

How to Cohesively Execute Change and Release Management

Strategic changes and new deployments in all growing businesses are expected. So by design, businesses need to be adaptive to new changes. When such transformations are being undertaken, organizations need to continue preserving their infrastructures while complying with pre-defined timelines. To efficiently manage such deployments, organizations make use of ITIL concepts like change and release management.

Icinga Web permissions and restrictions (how do they work, examples)

Permissions and restrictions are something used to control how much access is provided to a particular user or user group. Therefore one must first understand what is the role of a user or a group. Here, you will understand permissions and restrictions through configuring users, user groups and roles. They can either be created using configuration files or web interface. When the Icinga Web 2 is installed a default user with admin access is configured.

5 Tips for Getting Alert Fatigue Under Control

What happens when you receive a notification that something is wrong with your system and you have no clue what it means, or why you’re receiving that alert? Maybe you have to parse through the alert conditions to suss out what the alert indicates, or maybe you need to ping a coworker and ask. Not knowing what to do with an alert also contributes to alert fatigue, because it increases the toil and time required to respond.

A Clear Path to Automated Optimization of Application Performance

Rancher has helped thousands of organizations manage their transitions to containerized applications and Kubernetes. With its innovative distribution and suite of services, Rancher has given IT and DevOps teams the roadmap they needed to drive Kubernetes success. Given the complexity of Kubernetes and the shortage of engineers with Kubernetes-specific skills, Rancher’s offerings meet a critical and growing market need.

PagerDuty Paying Dividends for Form3's Digital Payment Platform

Your payment systems have slowed to a crawl, customers are getting impatient and abandoning their shopping carts both online and in stores, and you’re losing money every minute this problem goes on. Behind the scenes, technical responders are scrambling to resolve the issue before it impacts more customers—and before even more money is lost.

Announcing Spring Cloud Data Flow for Kubernetes

Spring Cloud Data Flow first emerged in 2015 as an evolution of the innovative Spring XD. Since then, hundreds of organizations such as CoreLogic have adopted this open source technology, which is available on Cloud Foundry and Kubernetes. Generally available today, Spring Cloud Data Flow for Kubernetes addresses the needs of streaming and batch processing in the enterprise. This new offering is only available through the VMware Spring Runtime subscription.