Model hosting patterns in Amazon SageMaker, Part 1: Common design patterns for building ML applications on Amazon SageMaker
AWS Machine Learning Blog
JANUARY 9, 2023
Additionally, you pay only for the compute capacity used to process inference requests, which is ideal for intermittent workloads. They’re a good option for intermittent or infrequent traffic patterns. With a pay-as-you-run model, serverless inference is a cost-effective option if you have infrequent or intermittent traffic patterns.
Let's personalize your content