Skip to content

Hardware Configurations and Pricing

The following table outlines hardware configurations and their corresponding hourly prices (model credits) for a single replica on the AWS cloud:

Hardware Configuration Hourly Model Credits
A10 single 2.118
A10 cluster of 4 8.808
A10 cluster of 8 24.732
T4 single 1.428
T4 cluster of 4 6.168
T4 cluster of 8 12.036

Prices are per replica. You can define minimum and maximum replicas. For example, with 3 replicas of "A10 single", up to 3 "A10 singles" will be used during high load before requests enter a waiting period.

Also, configurations like "T4 cluster of 4" represent one replica. A second replica would be another complete "T4 cluster of 4".

Note

Azure and Google Cloud Platform (GCP) support is not yet available.