Pricing

Prices do not include taxes by default. If you are a business registered in EU or an individual, please contact us so we can apply the correct VAT to your subscription.

Pay-As-You-Go

Use all the pre-trained models. You are invoiced after the fact, based on usage.

• No fixed cost: pay only if you consume

• Automatically get a $15 FREE credit

• All our pre-trained models are available

• Asynchronous mode included

• Monitor your usage in your dashboard

• Parallel requests: 10 (can be increased)

On CPU: $0.003 per request

On GPU: $0.005 per request

ChatDolphin/Yi-34B/Mixtral-8x7B: + $0.0005 per 1K tokens

LLaMA 3.1 405B and Fine-tuned LLaMA 3 70B: + $0.0018 per 1K tokens

Whisper: + $0.0001 per second (duration of your audio or video file)

Speech T5: + $0.0006 per 1K tokens