Operations | Monitoring | ITSM | DevOps | Cloud

Datadog

Mitigate cold starts in your Java Lambda functions with Datadog and AWS Lambda SnapStart

AWS Lambda enables engineering teams to build modern, scalable services without the need to provision underlying infrastructure resources. But monitoring Lambda functions requires visibility into performance indicators that differ from those of traditional architectures—and cold starts are a key example.

Datadog acquires Cloudcraft

A well-designed cloud architecture is essential to ensure that the underlying infrastructure stays operational, within budget, and compliant over time. These days, organizations are rapidly spreading their infrastructure across a broad, complex mesh of interconnected resources and services. It can be difficult to make high-level decisions about the design and management of these systems. This is why many organizations are now turning to cloud infrastructure modeling tools.

Architecting for Reliability

As modern systems become increasingly more complex, the risk of incidents and outages increases. Old approaches to reliability can sometimes be adapted to novel system designs, but other times new methods need to be invented. In this panel session moderated by Datadog’s Jason Yee, you’ll hear from SRE leaders and systems architects across the industry about how they’re designing and operating systems to achieve greater reliability.

Democratizing Observability

DevOps principles have helped many organizations improve cross-team collaboration, which has in turn led to increased reliability and velocity in the development lifecycle. In this session moderated by Jason Yee, we hear from panelists who have applied these same DevOps principles to observability, helping them unlock data-based insights and empower teams to make smarter, more informed decisions.

RUM now offers React Native Crash Reporting and Error Tracking

React Native has become the predominant development framework for cross-platform mobile applications. By interacting with native APIs largely under the hood and requiring only a fractional proportion of platform-specific code, it allows you to build applications for iOS, Android, and the browser using the same declarative JavaScript. But this cross-platform adaptability has its downsides.

Dash Panel Discussion: What Users Really Want

Measuring user experience is typically done by tracking metrics like latency and purchase frequency. But these metrics can often obscure real user sentiment. In this panel session moderated by Miranda Kapin, you can learn about better ways to uncover how users are truly experiencing your application and methods for improving their engagement.

Generate RUM-based metrics to track historical trends in customer experience

Datadog Real User Monitoring (RUM) provides end-to-end visibility into the user experience and performance of your browser and mobile applications. RUM allows you to capture and retain complete user sessions for 30 days. This means you can pinpoint bugs, prioritize issues, and determine fixes with data collected across an entire quarter.

Building a Multi-Tenant Insurance Platform

In 2020, CoverWallet—a multi-tenant insurance platform—was acquired by Aon, which led to a rapid expansion in both the size and global presence of its engineering organization. In his talk, CoverWallet’s Hylke Alons walks through the changes that were necessary to meet their platform's new expectations, including improving growth and scalability while ensuring reliability, automating security, and reducing maintenance. He also discusses some best practices for scaling up engineering and product teams to handle demand in a complex and highly regulated industry like insurance.

Stress test your Kubernetes application with Speedscale's offering in the Datadog Marketplace

Properly testing a service’s APIs to ensure that it can handle production traffic presents many challenges for engineers—SREs need to guarantee the resiliency of their application, while developers must ensure that their features perform well at any given scale. Speedscale is a testing framework built for Kubernetes applications that enables you to load test with real-world production scenarios by replaying actual API traffic that your application has experienced.