Blog

Five Things Your APM Platform Should do for Your Container Application Deployments.

May 17, 2019 By Vamsi Chemitiganti In Tigera

One of the chief complexities in running large scale containerized applications is the need for continuous systems/application monitoring. Containers are very different from traditional VMs and the 3 tier applications that run on them. Monitoring that needs to ensure that SLAs promised to the business are being met as well as an ability to forecast usage trends while identifying problem areas such as bugs, capacity challenges, slowing performance, and any potential downtime.

Read Post

Tigera

Read more about Five Things Your APM Platform Should do for Your Container Application Deployments.

Dynamic Sampling by Example

May 17, 2019 By Liz Fong-Jones In Honeycomb

Last week, Rachel published a guide describing the advantages of dynamic sampling. In it, we discussed varying sample rates to achieve a target collection rate overall, and having different sample rates for distinct kinds of keys. We also teased the idea of combining the two techniques to preserve the most important events and traces for debugging without drowning them out in a sea of noise.

Read Post

Honeycomb

Read more about Dynamic Sampling by Example

Why Your Lambda Functions May Be Doomed To Fail

May 17, 2019 By Renato Byrro In Dashbird

AWS Lambda has a cool feature that can be both a blessing and a nightmare for a serverless application, depending on whether it’s properly handled by our code: the retry behavior. A retry occurs when an invocation of a Lambda function results in an error and the AWS Lambda platform automatically invokes the function again, with the same event payload. Before we get deeper, make sure you are familiar with the AWS documentation on the subject.

Read Post

Dashbird

Read more about Why Your Lambda Functions May Be Doomed To Fail

Alert escalation - How it works in SIGNL4

May 17, 2019 By Matt In SIGNL4

Part of any managers role is to make sure their team is taking accountability. Managers are not the front lines resolvers that handle issues, that is what they have a team for. However, managers do need to be aware of incidents that are occurring as well as making sure their team is taking ownership and resolving those issues. SIGNL4 takes the managerial work out of being a manager by providing alert ownership transparency.

Read Post

SIGNL4

Read more about Alert escalation - How it works in SIGNL4

Week #20 Freyja Updates

May 17, 2019 By Lucian Daniliuc In Monitive

It’s been a while since the new Monitive codename “Freyja” was launched in private beta, and even though a lot of things have happened, development is moving forward a bit slow than I’ve hoped. Nevertheless, I’m happy that the core monitoring engine is running smoothly and it’s a great foundation for the years to come.

Read Post

Monitive

Read more about Week #20 Freyja Updates

Speeding up Security Investigations with Drilldown

May 16, 2019 By Daniel Berman In logz.io

At RSA this year, we introduced a series of new enhancements to Security Analytics – our new app for helping organizations combat security threats and meet compliance requirements. We are now happy to announce the official release of one of these features — Drilldown!

Read Post

logz.io

Read more about Speeding up Security Investigations with Drilldown

Firefox add-on outage: Yet another reminder for companies to enforce PKI life cycle automation

May 16, 2019 By Key Manager Plus In ManageEngine

More often than we’d like to admit, we tend to underestimate the impact of every moving part within an organization—especially those that seem small or insignificant. And usually, it’s not until we’re facing the fallout of neglecting that seemingly insignificant factor when we realize what a mistake we’ve made.

Read Post

ManageEngine

Read more about Firefox add-on outage: Yet another reminder for companies to enforce PKI life cycle automation

Introducing the Rancher 2 Terraform Provider

May 16, 2019 By Jason Van Brackel In Rancher

Infrastructure as code is an important methodology for ensuring that your distributed systems are treated as cattle and not pets. Your Kubernetes and Rancher clusters are no different. You should be able to provision your Rancher clusters, your Kubernetes clusters, and all of your apps with automation.

Read Post

Rancher

Read more about Introducing the Rancher 2 Terraform Provider

Nutanix .NEXT Wrap-Up

May 16, 2019 By Anirban Chatterjee In Zenoss

Last week, a team of us had the pleasure of representing Zenoss at the Nutanix .NEXT Conference for IT operations professionals, held in beautiful Anaheim, California, this year.

Read Post

Zenoss

Read more about Nutanix .NEXT Wrap-Up

Worth a Look: Public Grafana Dashboards

May 16, 2019 By Julie Dam In Grafana

There are countless Grafana dashboards that will only ever be seen internally. But there are also a number of large organizations that have made their dashboards public for a variety of uses. These dashboards can be interesting to browse, giving you an insider’s peek into how real Grafana users set up their visualizations, with actual live data to boot. Perhaps some of them will inspire you to get to work on your own Grafana?

Read Post

Grafana

Read more about Worth a Look: Public Grafana Dashboards

Operations | Monitoring | ITSM | DevOps | Cloud

Blog

Five Things Your APM Platform Should do for Your Container Application Deployments.

Dynamic Sampling by Example

Why Your Lambda Functions May Be Doomed To Fail

Alert escalation - How it works in SIGNL4

Week #20 Freyja Updates

Speeding up Security Investigations with Drilldown

Firefox add-on outage: Yet another reminder for companies to enforce PKI life cycle automation

Introducing the Rancher 2 Terraform Provider

Nutanix .NEXT Wrap-Up

Worth a Look: Public Grafana Dashboards

Monthly Archive

Follow Us