Operations | Monitoring | ITSM | DevOps | Cloud

%term

Is the Data Economy the World's Greatest Operating Environment?

This is the second piece in our Data Economy series. While the first, available here, looked at the data economy as a whole and why it is important, this piece looks more specifically at the environment around the data economy. This includes both the environment needed for it to thrive and the ecosystem of companies and businesses that are being created in order to evolve it.

Monitor AWS Trainium and AWS Inferentia with Datadog for holistic visibility into ML infrastructure

AWS Inferentia and AWS Trainium are purpose-built AI chips that—with the AWS Neuron SDK—are used to build and deploy generative AI models. As models increasingly require a larger number of accelerated compute instances, observability plays a critical role in ML operations, empowering users to improve performance, diagnose and fix failures, and optimize resource utilization.

The Journey to Autonomic IT: Why Enterprises Must Let Go to Learn

Several of our recent blog posts have introduced the characteristics of each phase of the Autonomic IT maturity model, from Siloed IT to Coordinated IT (an essential foundation for Autonomic IT) and the transition to Machine-Assisted IT and AI-Advised IT. We explored how you can identify where your organization stands on this transformative journey, why you might not be as far along as you believe, and what is needed to advance your journey. Now we arrive at IT nirvana: Phase 5, Autonomic IT.

Year-end recap: What's new in IT infrastructure monitoring: 2024

Effective IT monitoring is critical to maintaining seamless operations, and 2024 has been a year of addressing challenges and delivering solutions with Site24x7. From upgrading server health and performance to streamlining Kubernetes and VM administration, let's plunge into how Site24x7’s updates have helped IT teams tackle their monitoring challenges and enhance infrastructure reliability.

Mobile crash reporting and debugging best practices

Maintaining a crash-free, stable mobile app should be top priority for all mobile developers. App stores penalize mobile apps that have high crash rates, and more importantly, buggy apps create poor user experiences, resulting in bad reviews and lost customers. Watch this session to learn key tips for identifying, resolving, and preventing crashes, fast, so you can spend less time troubleshooting and more time building.

State of Cloud Costs

Cloud spending continues to grow, but managing costs effectively remains a challenge for many organizations. In this video, Datadog Senior Product Manager Kayla Taylor dives into our recent State of Cloud Costs report—which analyzed AWS cloud cost data from hundreds of organizations—to understand the key factors driving cloud expenses. We explore the impact of adopting emerging compute technologies like Arm-based processors, GPUs, and AI capabilities, how usage patterns and previous-generation technologies affect cloud costs, and the role of AWS discount programs in cost management.