pricing

pay for what you use

start free with gpu access. scale with serverless, dedicated, or both.

freeforever

experiment with models and ship side projects.

$79/mo

for teams shipping ai products to production.

custom/org

reserved capacity, compliance, and dedicated support.

faq

frequently asked

one hour of compute on a single gpu. if you run inference on an h100 for 30 minutes, that is 0.5 gpu-hours.

yes. you can set the compute mode per-service. run your api serverless and your model inference on dedicated gpus in the same project.

qualifying startups can receive up to $100K in credits. contact us for details.

we will notify you before any overage. you can upgrade to pro or set hard spending limits. we never charge without your consent.