☁️ Cloud & Infrastructure

Cloud Run Worker Pools: 40% Cheaper for AI Scaling [Estée Lauder Case]

Holiday shoppers bombarding Jo Malone's AI Scent Advisor? Estée Lauder didn't blink—thanks to Cloud Run worker pools. But does this serverless shift actually save real money, or just Google's PR?

Estée Lauder AI Scent Advisor interface on Cloud Run worker pools dashboard

⚡ Key Takeaways

  • Estée Lauder scaled consumer AI for holidays using Cloud Run worker pools—no message loss, zero server ops. 𝕏
  • 40% cheaper than services/jobs for long-running tasks; supports GPUs for distributed ML. 𝕏
  • CREMA open-sources scaling for Pub/Sub, Kafka—handles surges, idles to zero. 𝕏
  • Producer-consumer model decouples UI from heavy LLM inference. 𝕏
Jordan Kim
Written by

Jordan Kim

Cloud and infrastructure correspondent. Covers Kubernetes, DevOps tooling, and platform engineering.

Worth sharing?

Get the best Developer Tools stories of the week in your inbox — no noise, no spam.

Originally reported by Google Cloud Blog

Stay in the loop

The week's most important stories from Dev Digest, delivered once a week.