Cloud Run Worker Pools: 40% Cheaper for AI Scaling [Estée Lauder Case]
Holiday shoppers bombarding Jo Malone's AI Scent Advisor? Estée Lauder didn't blink—thanks to Cloud Run worker pools. But does this serverless shift actually save real money, or just Google's PR?
⚡ Key Takeaways
- Estée Lauder scaled consumer AI for holidays using Cloud Run worker pools—no message loss, zero server ops. 𝕏
- 40% cheaper than services/jobs for long-running tasks; supports GPUs for distributed ML. 𝕏
- CREMA open-sources scaling for Pub/Sub, Kafka—handles surges, idles to zero. 𝕏
- Producer-consumer model decouples UI from heavy LLM inference. 𝕏
Worth sharing?
Get the best Developer Tools stories of the week in your inbox — no noise, no spam.
Originally reported by Google Cloud Blog