Skip to main content

Pricing Overview

PureAI uses a credit-based billing system. Purchase credits in USD and use them to deploy and run models on GPU instances.

How It Works

  1. Purchase Credits: Add credits to your account via Stripe
  2. Deploy Models: Select a model and instance type
  3. Pay Per Hour: Credits are charged per hour of active deployment
  4. Monitor Usage: Track costs in the dashboard

Pricing Model

ComponentPricing
Credits1 credit = $1 USD
DeploymentsPer-hour based on instance type
API CallsIncluded in deployment cost

Credit Purchase Options

AmountCredits
$5500 credits
$101,000 credits
$202,000 credits
$505,000 credits
$10010,000 credits
CustomMinimum $1

Cost Example

Deploying LLaMA 3.1 8B on GPU XS:
ComponentCost
Instance~$0.20/hour
8 hours of usage~$1.60
24 hours of usage~$4.80

Instance Tiers

TierStarting PriceModels
XS~$0.20/h7B - 13B
S~$0.60/h13B - 34B
M~$1.80/h30B - 70B INT4
L~$3.50/h70B FP16
XL~$12.00/h70B - 180B
XXL~$20.00/h405B
Prices are spot prices and may vary based on availability. On-demand pricing is higher.

Cost Management

  • Auto-pause: Deployments can be paused when not in use
  • Alerts: Set up notifications for usage thresholds
  • Dashboard: Real-time cost tracking and projections

Next Steps