Light
Dark
Free plan
Astronaut walking through rocky canyon landscape on planetary surface.

Free

For getting started
50 premium requests
500 standard requests
100 active agents
2 agent templates
1 GB of storage
Pro plan
Spacecraft with landed on planetary surface with astronaut figure nearby.

Pro $20 / month

For shipping agents in production
500 premium requests
5,000 standard requests
10,000 active agents
20 agent templates
10 GB of storage
Scale plan
Colony base with dome structures and buildings on planetary surface.

Scale $750 / month

For teams deploying agents at scale
5,000 premium requests
50,000 standard requests
10 million active agents
100 agent templates
100 GB of storage
Enterprise plan

Enterprise / contact us

For organizations with higher volume needs, our Enterprise plan offers increased quotas, dedicated support, role-based access control (RBAC), SSO (SAML, OIDC), and private model deployment options. Contact our team to learn more.
Up to ∞ agents & storage
Custom model deployments
SAML/OIDC SSO authentication
Role-based access control
BYOC deployment options
What are standard / premium requests?

Your Letta agents use large language models (LLMs) to reason and take actions. These model requests are what we count toward your monthly requests quota. Standard models (GPT-4o mini, Gemini 2.5 Flash, etc.) are faster and more economical. They’re ideal for simple tool calling and basic chat interactions. Premium models (GPT-4.1, Claude Sonnet 4, etc.) offer enhanced capabilities for complex agentic tasks. They excel at multi-step tool sequences and tasks requiring advanced reasoning.

How are model requests counted?

Each agent “step” or “action” counts as one model request. Complex tasks (such as deep research) may require multiple requests to complete. You can control request usage via tool rules that force the agent to stop on certain conditions.

How do plan limits work?

Your Letta agents use large language models (LLMs) to reason and take actions. These model requests are what we count toward your monthly requests quota. Request quotas refresh every month.

Can I use my own LLM API key?

Yes, you can connect your own API keys for providers such as OpenAI, Anthropic, and Google Gemini. Model requests do not count towards your request quota if you bring your own LLM API key and select your custom provider in the ADE model dropdown.

What’s the difference between cloud and open source?

Letta Cloud is our fully-managed service for stateful agents. While Letta can be self-hosted, Letta Cloud eliminates all infrastructure management, server optimization, and system administration so you can focus entirely on building agents. Letta Cloud also includes advanced features for power users, such as agent templates, monitoring, and observability.

Can I transfer my agents between open source and cloud?

Yes, Letta Cloud supports agent file, which allows you to move your agents freely between self-hosted instances of the Letta open source and Letta Cloud.

Do you have any startup discounts?

Yes, we offer discounts for qualifying startups. Submit a request using this form.

Where can I ask more questions?

We have an active developer community on Discord, join to chat with the Letta developer team and other Letta users!