Pricing

Start building AI apps today
scale effortlessly

Open source. Free to use. Pay only for premium features.

HOBBY
For developers building their impressive LLM-powered side projects & apps
Free

Unlimited AI agents runs

Connect your LLMs providers

Add tools to agents

Real-time streaming

LLM output validation

Full type-safety

Smart error recovery

PRO
For developers & small teams that need voice responses, long-term memory, or agents with custom knowledge
$25
/month
Everything in the free plan, plus:

Serverless managed long-term memory

Expand agents knowledge from files & websites

Real-time voice responses

Higher rate limits

Faster audio files processing

SCALE
For bigger teams that need higher rate limits, requests, volumes, and memory storage
$70
/month
Everything in the Basic plan, plus:

Higher memory storage & requests

Higher knowledge monthly requests

More voice responses characters

Even higher rate limits (almost none)

Direct support and assistant

Special Offer

We'll build your full AI feature in a week, you name it, we build it!
You get 3 months of the scale plan, plus:

Build any AI feature you want with Niddam-Hub

We handle both server-side & client-side development

UI/UX is on us, based on your product design

Need full app? We got you too!

LIMITED DEAL
$750
Contact us
Limited to only 10 clients, get your spot now!
Let's break it down

Hobby - Free

Get started with:
Unlimited agents runs
Multimodal inputs (slow audio processing)
Real-time streaming
Auto errors recovery
LLM output validation

Basic - $25/month

Everything in Hobby, plus:
AI agents voice responses
Persistent conversations memory
Expand LLMs knowledge
Fast audio inputs processing
Email support

Scale - $70/month

Everything in Basic, plus:
Faster voice responses generation
Higher memory stores storage
More knowledge requests
Higher rate limits for everything
Direct support
Memory
Chat sessions with users (messages history or memory)
Free:
In-memory store (not persistent)
Basic:
Serverless managed memory (1 region)
Scale:
Serverless managed memory (replicated to 4 regions)
History store reads
Each time you read a session info or messages from the serverless store (4Kb = 1 operation)
Free:
---
Basic:
1M / month
Scale:
4M / month
History store writes
Each time you new session is created or an agent is executed with history enabled to save the chat messages (1Kb = 1 operation)
Free:
---
Basic:
500K / month
Scale:
2M / month
Speech Characters
When using the agent speech (agent audio response)
Free:
50 / month
Basic:
100K / month
Scale:
1.2M / month
Knowledge requests
If using custom knowledge, with each run we feth info relevant data from a vector database using RAG (this is cached and cache doesn't count to usage)
Free:
---
Basic:
300K / month
Scale:
1.5M / month
Image inputs
Supported only with LLMs that support vision
Free:
Unlimited
Basic:
Unlimited
Scale:
Unlimited
Audio inputs
You can input audio into your agent runs. fast is 2s average latency & slow is ~10s. If limit for fast processing is reached it falls back to slow mode till next month
Free:
Unlimited slow
Basic:
500mins fast & unlimited slow
Scale:
1000mins fast & unlimited slow