Is prompt caching billed differently for serverless models?

Yes, cached prompt tokens are discounted compared to uncached tokens for serverless models. The default discount is 50%, but the exact discount varies by model. Check the Model Library for model-specific cached and uncached input token pricing.