Operations | Monitoring | ITSM | DevOps | Cloud

Latest Videos

It's better to declare incidents early #incidentmanagement #sitereliabilityengineering

In this clip, Viktor Stanchev explains why it's better to declare incidents early rather than too late. Whether you’re a seasoned vet when it comes to incident response, or just getting started out, it can be easy to fall into the trap of doing too much all at once. And it just makes sense. Incident response is one of those things that doesn’t have a single, perfect formula, so teams can be left doing a little bit of everything in an effort to get it right.

Learning is an iterative process #incidentmanagement #incidentresponse #sitereliabilityengineering

In this clip, Viktor Stanchev explains why it's important to remember that learning is an iterative process. Whether you’re a seasoned vet when it comes to incident response, or just getting started out, it can be easy to fall into the trap of doing too much all at once. And it just makes sense. Incident response is one of those things that doesn’t have a single, perfect formula, so teams can be left doing a little bit of everything in an effort to get it right.

What is clinical troubleshooting? #incidentmanagement #incidentresponse #sitereliabilityengineering

In this clip, Dan Slimmons explains what this clinical troubleshooting framework entails. It’s no secret that teamwork is one of those things that, when done right, can make a world of a difference. So sometimes, when responding to a particularly complicated incident, it can be best to bring a team together to figure out what’s going on and work towards a fix. But it’s not enough to just jam a bunch of folks into a room and hope for the best. You need a framework in place to ensure that everyone stays focused, diagnoses the issue and resolves it as quickly as possible.

Mastering Cybersecurity: Essential OWASP Guidelines for Effective Protection

Join Dwayne McDaniel as he discusses the challenges and essentials of effective cybersecurity, highlighting the impact of bad security practices, the benefits of robust security measures, and the importance of community collaboration. This talk explores practical insights on improving security protocols, leveraging community knowledge, and the significant role of automation in ensuring safe, uninterrupted digital environments.

ISO 27001: 8 Steps to Implement The Security Standard

ISO 27001 is the globally recognized standard for Information Security Management. It uses a Risk Management approach to provide a framework for managing, controlling, and protecting privileged and sensitive information. The standard sets out the requirements for establishing, maintaining, implementing, and continually improving an organization's Information Security Management System (ISMS).

What are some startups Solomon Hykes is rooting for?

What are some startups Solomon Hykes is rooting for? What's his most controversial opinion? Who are some community members that more people should follow? Discover the answers to these questions, and a lot more in the Incidentally Reliable Podcast with Solomon Hykes, live on all major platforms! Tune in as Solomon shares stories from the early days of Docker, Inc, the rollercoaster journey leading to 20 million active developers worldwide, the heavy crown of a tech leader and his vision to revolutionize CI/CD with Dagger today.

The relationship between cloud FinOps and Security - Expert tips

In this episode, we delve into the relationship between Azure Cost Management and security in cloud computing with FinOps certified practitioner Michael Stephenson and Microsoft MVP for Security Nino Crudele. Learn how security measures impact cloud costs and explore strategies for balancing robust security with cost-effectiveness. Discover the crucial role of governance, policy enforcement, and FinOps in optimizing both cost and security postures.

What Can a Service Mesh Do for Your Kubernetes Environment? with Tony Pope-Cruz

Explore the essentials of Kubernetes management with Tony Pope-Cruz from @dynatrace in this detailed walkthrough. Understand how to avoid common pitfalls in Kubernetes deployments, such as mismanagement of resources that can lead to significant outages. Gain insights into how service meshes provide robust solutions for traffic management, service reliability, and observability.