When the cost of different requests varies widely it’s difficult to get it right...

contravariant · 2025-08-02T22:01:05 1754172065

I'm not 100% sure if it's just load balancing. It would depend on the details of the setup but that situation also allows you to throw more resources at each request.

I mean obviously there is a point where splitting up the instances doesn't help because you're just leaving more instances completely idle, or with too little resources to be helpful.