Comprehensive Service.

Best-in-Class cost efficiency.

Inference email us

We have the highest performance /cost llama inference cloud in existence

Llama 3 Models

Model Price/1M tokens

Up to 3B $0.06

8B $0.20

11B $0.20

70B $0.90

90B $1.20

405B $3.50

DeepSeek Models

Model Price/1M tokens

DeepSeek-V3 $1.25

DeepSeek-R1 $3 input / $7 output

DeepSeek-R1-Distill-Llama-70B $2.00

DeepSeek-R1-Distill-Qwen-14B $1.60

DeepSeek-R1_distill-Qwen-1.5B $0.18

DeepSeek LLM Chat 67B $0.90

VM email us

Virtual Machine (Per, Per Hour Pricing)

GPU Type 6 Mo 1 year 3+ year

B200 $6.00 $5.00 $4.00

H200 $4.25 $3.55 $2.60

H100 $3.75 $3.00 $2.25

A100 $2.25 $1.50 $1.00

Data Services & AI Model/Software

BlueSky’s data services and AI model/Software services are incredibly tailored and highly complex, meaning that no two projects or pricing are alike. Bundling our CSP infrastructure with these services enables an unparalleled cost efficiency and faster delivery:
helping you achieve faster

Already using another provider?

BlueSky plays well with existing cloud or on prem infrastructure you have - we never seek to re-invent the wheel. If you want to migrate to BlueSky, we’ll pay you egress fees for you, and offer you free ingress - removing barriers to progress.

Abstract digital background with binary code, grid lines, and blue light streams.

Get a high-level, tailored quote this week.

Prefer we reach out to you?

All other Chat, Language, Code, and Moderation Models

Model Price/1M tokens

Up to 4B $0.10

4.1B – 8B $0.20

8.1B – 21B $0.30

21.1B – 41B $0.80

41.1B – 80B $0.90

80.1B – 110B $1.80

Baremetal email us

Baremetal & Managed On-Prem Solutions (Per, Per Hour Pricing)

GPU Type 6 Mo 1 year 3+ year

B200 $5.00 $4.25 $3.50

H200 $3.50 $2.75 $2.00

H100 $2.75 $2.00 $1.50

A100 $1.50 $1.00 $0.70