
Comprehensive Service.
Best-in-Class cost efficiency.

Inference email us
We have the highest performance /cost llama inference cloud in existence
Llama 3 Models
Model Price/1M tokens
Up to 3B $0.06
8B $0.20
11B $0.20
70B $0.90
90B $1.20
405B $3.50
DeepSeek Models
Model Price/1M tokens
DeepSeek-V3 $1.25
DeepSeek-R1 $3 input / $7 output
DeepSeek-R1-Distill-Llama-70B $2.00
DeepSeek-R1-Distill-Qwen-14B $1.60
DeepSeek-R1_distill-Qwen-1.5B $0.18
DeepSeek LLM Chat 67B $0.90
VM email us
Virtual Machine (Per, Per Hour Pricing)
GPU Type 6 Mo 1 year 3+ year
B200 $6.00 $5.00 $4.00
H200 $4.25 $3.55 $2.60
H100 $3.75 $3.00 $2.25
A100 $2.25 $1.50 $1.00
Data Services & AI Model/Software
BlueSky’s data services and AI model/Software services are incredibly tailored and highly complex, meaning that no two projects or pricing are alike. Bundling our CSP infrastructure with these services enables an unparalleled cost efficiency and faster delivery:
helping you achieve faster
Already using another provider?
BlueSky plays well with existing cloud or on prem infrastructure you have - we never seek to re-invent the wheel. If you want to migrate to BlueSky, we’ll pay you egress fees for you, and offer you free ingress - removing barriers to progress.
Get a high-level, tailored quote this week.
Prefer we reach out to you?
All other Chat, Language, Code, and Moderation Models
Model Price/1M tokens
Up to 4B $0.10
4.1B – 8B $0.20
8.1B – 21B $0.30
21.1B – 41B $0.80
41.1B – 80B $0.90
80.1B – 110B $1.80
Baremetal email us
Baremetal & Managed On-Prem Solutions (Per, Per Hour Pricing)
GPU Type 6 Mo 1 year 3+ year
B200 $5.00 $4.25 $3.50
H200 $3.50 $2.75 $2.00
H100 $2.75 $2.00 $1.50
A100 $1.50 $1.00 $0.70