Latest Posts

Cloud Log Management: A Developer's Guide to Scalable Observability

Jul 10, 2025 By Anjali Udasi In Last9

As systems move to microservices, serverless, and multi-cloud setups, debugging gets harder. You’re no longer dealing with a single log file; you’re looking at logs from dozens of services, running across different environments. Traditional debugging methods like SSH-ing into servers or adding print statements don’t scale in these environments. Cloud log management tools help by collecting logs from all your services into one place.

Read Post

Last9

Read more about Cloud Log Management: A Developer's Guide to Scalable Observability

What is Log Loss and Cross-Entropy

Jul 10, 2025 By Faiz Shaikh In Last9

You're building a classification model, and your framework throws around terms like "log loss" and "cross-entropy loss." Are they the same thing? When should you use binary cross-entropy versus categorical cross-entropy? What about focal loss? This blog breaks down these loss functions with practical examples and real-world implementations.

Read Post

Last9

Read more about What is Log Loss and Cross-Entropy

How to Get Logs from Docker Containers

Jul 9, 2025 By Preeti Dewani In Last9

When a container misbehaves, logs are the first place to look. Whether you're debugging a crash, tracking API errors, or verifying app behavior—docker logs gives you direct access to what's happening inside. This blog covers the full workflow: how to retrieve logs, filter them by time or service, and set up logging for production environments.

Read Post

Last9

Read more about How to Get Logs from Docker Containers

Troubleshooting LangChain/LangGraph Traces: Common Issues and Fixes

Jul 9, 2025 By Anjali Udasi In Last9

We’ve covered how to get LangChain traces up and running. But even when everything’s instrumented, traces can still go missing, show up half-broken, or look nothing like what you expected. This guide is about what happens after setup, when traces exist, but something’s off.

Read Post

Last9

Read more about Troubleshooting LangChain/LangGraph Traces: Common Issues and Fixes

Improve Consistency Across Signals with OTel Semantic Conventions

Jul 8, 2025 By Anjali Udasi In Last9

It’s 2 AM. Your API is timing out. Logs show a slow query. Metrics flag a spike in DB connections. Traces reveal a 5-second delay on a database call. But then the questions start:- Which database?- Does the query match the delay?- Why doesn’t this align with the connection pool metrics? Each tool uses different labels, db.name, database, sometimes nothing at all. Without a shared schema, connecting the dots is slow and frustrating.

Read Post

Last9

Read more about Improve Consistency Across Signals with OTel Semantic Conventions

How Replicas Work in Kubernetes

Jul 8, 2025 By Faiz Shaikh In Last9

Replicas in Kubernetes control how many copies of your pods run simultaneously. They're the foundation of scaling, availability, and recovery in your cluster. When you're running a stateless API or a background worker, understanding how replicas work directly impacts your application's reliability and performance. This blog walks through replica management, from basic concepts to production monitoring patterns that help you maintain healthy, scalable applications.

Read Post

Last9

Read more about How Replicas Work in Kubernetes

Instrument LangChain and LangGraph Apps with OpenTelemetry

Jul 7, 2025 By Anjali Udasi In Last9

In our previous blog, we talked about how LangChain and LangGraph help structure your agent’s behavior. But structure isn’t the same as visibility. This one’s about fixing that. Not with more logs. Not with generic dashboards. You need to see what your agent did, step by step, tool by tool, so you can understand how a simple query turned into a long, expensive run.

Read Post

Last9

Read more about Instrument LangChain and LangGraph Apps with OpenTelemetry

Prometheus Group By Label: Advanced Aggregation Techniques for Monitoring

Jul 7, 2025 By Faiz Shaikh In Last9

Your Prometheus dashboard shows 847 CPU metrics. The alert fired—but is the problem in us-east or us-west? You're trying to rule out whether that new feature caused a latency spike, but the sheer number of time series isn’t helping. Grouping can make this manageable. By organizing metrics by shared label values, you can quickly spot which service or region is behaving differently, without digging through every metric.

Read Post

Last9

Read more about Prometheus Group By Label: Advanced Aggregation Techniques for Monitoring

Docker Status Unhealthy: What It Means and How to Fix It

Jul 4, 2025 By Faiz Shaikh In Last9

If your container shows Status: unhealthy, Docker's health check is failing. The container is still running, but something inside, usually your app, isn’t responding as expected. This doesn’t always mean a crash. It just means Docker can’t verify the app is working. Here’s how to debug the issue and restore the container to a healthy state.

Read Post

Last9

Read more about Docker Status Unhealthy: What It Means and How to Fix It

LangChain Observability: From Zero to Production in 10 Minutes

Jul 3, 2025 By Anjali Udasi In Last9

LangChain apps are powerful, but they’re not easy to monitor. A single request might pass through an LLM, a vector store, external APIs, and a custom chain of tools. And when something slows down or silently fails, debugging is often guesswork. In one instance, a developer ended up with an unexpected $30,000 OpenAI bill, with no visibility into what triggered it. This blog shows how to avoid that using OpenTelemetry and LangSmith. With this setup, you’ll be able to.

Read Post

Last9

Read more about LangChain Observability: From Zero to Production in 10 Minutes

Operations | Monitoring | ITSM | DevOps | Cloud

Cloud Log Management: A Developer's Guide to Scalable Observability

What is Log Loss and Cross-Entropy

How to Get Logs from Docker Containers

Troubleshooting LangChain/LangGraph Traces: Common Issues and Fixes

Improve Consistency Across Signals with OTel Semantic Conventions

How Replicas Work in Kubernetes

Instrument LangChain and LangGraph Apps with OpenTelemetry

Prometheus Group By Label: Advanced Aggregation Techniques for Monitoring

Docker Status Unhealthy: What It Means and How to Fix It

LangChain Observability: From Zero to Production in 10 Minutes

Monthly Archive

Follow Us