Light
Dark
Free plan
Astronaut walking through rocky canyon landscape on planetary surface.

Free

For getting started
50 premium requests
500 standard requests
100 active agents
2 agent templates
1 GB of storage
Pro plan
Spacecraft with landed on planetary surface with astronaut figure nearby.

Pro $20 / month

For shipping agents in production
500 premium requests
5,000 standard requests
10,000 active agents
20 agent templates
10 GB of storage
Scale plan
Colony base with dome structures and buildings on planetary surface.

Scale $750 / month

For teams deploying agents at scale
5,000 premium requests
50,000 standard requests
10 million active agents
100 agent templates
100 GB of storage
Enterprise plan

Enterprise / contact us

For organizations with higher volume needs, our Enterprise plan offers increased quotas, dedicated support, role-based access control (RBAC), SSO (SAML, OIDC), and private model deployment options. Contact our team to learn more.
Up to ∞ agents & storage
Custom model deployments
SAML/OIDC SSO authentication
Role-based access control
BYOC deployment options
What are credits?

Credits are a standard cost unit for resources in Letta, such as LLM inference and CPU cycles. When agents run on Letta, they make LLM model requests and execute tools. Model requests consume credits at a rate depending on the model tier (standard vs. premium) and whether Max Mode is enabled for longer context sizes. Tool executions that run in Letta are charged at a flat rate per second of execution. See details on credit pricing here.

What tools are executed by Letta?

Sandbox code execution and execution of custom tools run inside of Letta, so incur a credit cost for CPU time. Remote MCP tools are executed by the MCP tool provider, so do not have a credit cost. Letta built-in tools are executed for free.

How do monthly credits work?

Your Letta agents use large language models (LLMs) to reason and take actions. These model requests consume credits from your monthly balance (or additional purchased credits). Your balance of monthly credits refreshes every month.

What is Max Mode?

Certain models have the ability to run with extended context windows. Turning on Max Mode extends the context window of the model driving your Letta agent beyond the 100k default, which may help when working with large files or codebases, but will increase cost (credit use) and latency.

What’s the difference between the Letta API and open source Letta?

The Letta API Platform is our fully-managed service for stateful agents, handling all agent infrastructure and state management to create scalable agent services. The Letta API Platform also has additional features beyond the open source such as durable execution for long-running agents, built-in sandboxing, agent templates, optimized vector search, message indexing, and observability.

Can I transfer my agents between open source and cloud?

Yes, the Letta API Platform supports agent file, which allows you to move your agents freely between self-hosted instances of the Letta open source and the Letta platform.

Do you have any startup discounts?

Yes, we offer discounts for qualifying startups. Submit a request using this form.

Where can I ask more questions?

Reach out to support@letta.com, or join our active community on Discord to chat with the Letta developer team and other Letta users!