Pricing that scales with your workload
Choose on‑demand, usage-based pricing for flexibility, or provisioned throughput for guaranteed performance with monthly commitments.
Plans & Pricing
Compare features
Platform management
Number of users
Unlimited
Number of workspaces
Unlimited
Number of agents
Unlimited
Number of datastores
Unlimited
Number of workspaces
1
User roles and admin permissions
Document access entitlements
SOC2 Type II compliance
HIPAA compliance
SAML / SSO
–
Role-based access control (RBAC)
–
Usage analytics
–
Usage analytics
–
Uptime SLA
–
Data ingestion
Support for simple text documents
Support for complex docs, charts, and images
Support for unstructured data
Support for structured data
Contact sales
UI-based ingestion
Continuous data ingestion
Contact sales
Data integrations
Maximum of 1
Standard data retention
Custom data retention
–
Contextual AI agents
Query optimization
Retrieve
Rerank
Filtering
Generate
Groundedness & safety
Tokens per second (TPS) throughput Commitment
–
Deployment
Contextual SaaS
Customer VPC
Contact sales
Platform management | On-demand | Provisioned throughput |
Number of users | Unlimited | Unlimited |
Number of workspaces | Unlimited | Unlimited |
Number of agents | Unlimited | Unlimited |
Number of datastores | Unlimited | Unlimited |
Number of workspaces | 1 | Unlimited |
User roles and admin permissions | ||
Document access entitlements | ||
SOC2 Type II compliance | ||
HIPAA compliance | ||
SAML / SSO | – | |
Role-based access control (RBAC) | – | |
Usage analytics | – | |
Usage analytics Pipeline observability | – | |
Uptime SLA Pipeline observability | – | |
Data ingestion | On-demand | Provisioned throughput |
Support for simple text documents | ||
Support for complex docs, charts, and images | ||
Support for unstructured data | ||
Support for structured data | Contact sales | Contact sales |
UI-based ingestion | ||
Continuous data ingestion | Contact sales | Contact sales |
Data integrations | Maximum of 1 | |
Standard data retention | ||
Custom data retention | – | Contact sales |
Contextual AI agents | On-demand | Provisioned throughput |
Query optimization Reformulates and decomposes query for better retrieval | ||
Retrieve Gets relevant docs from knowledge base | ||
Rerank Reorders relevant docs by relevance | ||
Filtering Selects top-k most relevant docs | ||
Generate Produces response using selected docs | ||
Groundedness & safety Evaluates whether generated response is supported by retrieved docs | ||
Tokens per second (TPS) throughput Commitment | – | |
Deployment | On-demand | Provisioned throughput |
Contextual SaaS | ||
Customer VPC | Contact sales | Contact sales |
Components
Component APIs pricing overview
For AI teams needing more flexibility to work with their existing RAG architecture, Contextual AI provides powerful platform primitives as component APIs with usage-based pricing.
Parse
Our multi-stage document understanding pipeline for converting unstructured content into AI-ready formats
Price
- Basic (text only): $3 / 1,000 pages
- Standard (multimodal): $40 / 1,000 pages
Rerank
State-of-the-art instruction-following reranker, providing greater control over how retrieved knowledge is prioritized
Price
- Rerank-v2: $0.05 per million tokens
- Rerank-v2-mini: $0.02 per million tokens
Generate
The most grounded large language model in the world, engineered specifically to minimize hallucinations
Price
- Input: $3 / 1M tokens
- Output: $15 / 1M tokens
LMUnit
Our evaluation-optimized model for preference, direct scoring, and natural language unit test evaluation
Price
- Input: $3 / 1M tokens