Operations | Monitoring | ITSM | DevOps | Cloud

Right Size Your Model Usage with Valkey and Semantic Routing

Benchmarks keep showing that picking the right LLM is hard. The easy answer is "just use the most powerful one." That works, but it is pricey. A small, cheap, or local model can handle many simple requests just as well as a frontier model, for a fraction of the cost. That is what semantic routing is for. Use middleware that looks at an incoming request and decides which model should answer it.

Apache Kafka Share Groups are NOT true queues. Here's why that's a good thing.

This Spring, the Apache Kafka community released version 4.2 with a “production-ready” Share Group feature. Also known as a “Kafka queue” people were eager to see this feature because it introduced elastic consumer scaling, individual message acknowledgments, and built-in "poison pill" handling; similar to what you'd find in traditional message brokers like RabbitMQ and ActiveMQ.