Light
Dark
Free plan
Astronaut walking through rocky canyon landscape on planetary surface.

Free

For getting started
5,000 monthly credits
Access the Letta API
Chat and build in the ADE
Use agents locally with Letta Code
Pro plan
Spacecraft with landed on planetary surface with astronaut figure nearby.

Pro $20 / month

For building personal agents
20,000 monthly credits
Pay-as-you-go credit overage
BYOK support
10 GB of storage
Max plan
Colony base with dome structures and buildings on planetary surface.

Max $200 / month

For agent power users
Everything in Pro, plus:
Higher usage limits for power users
Optimized for Letta Code
Early access to new features
Enterprise plan

Enterprise / contact us

For organizations with higher volume needs, our Enterprise plan offers increased quotas, dedicated support, role-based access control (RBAC), SSO (SAML, OIDC).
Volume-based pricing
Increased quotas
Role-based access control
SAML/OIDC SSO authentication
Dedicated support
What are credits?

Credits are a standard cost unit for resources in Letta, such as LLM inference and CPU cycles. When agents run on Letta, they make LLM model requests and execute tools. Model requests consume credits at a rate depending on the model tier (standard vs. premium) and whether Max Mode is enabled for longer context sizes. Tool executions that run in the Letta Platform are charged at a flat rate per second of execution.

How is tool execution charged?

Server-side tools on the Letta API incur a credit cost for CPU time required to execute the tool. Remote MCP tools are executed by the MCP tool provider, so do not have a credit cost. Letta built-in tools are executed for free. Client-side tools (e.g. bash tools executed by Letta Code) are run on the client not the server, so they do not incur any credit cost for execution.

How do monthly credits work?

Your agents use large language models (LLMs) to reason and take actions. These model requests consume credits from your monthly balance (or additional purchased credits). Your balance of monthly credits refreshes every month.

What is Max Mode?

Certain models have the ability to run with extended context windows. Turning on Max Mode extends the context window of the model driving your Letta agent beyond 100k tokens, which may help when working with large files or codebases, but will increase cost (credit use) and latency.

How does the Max plan work?

The Letta Max plan is designed for agent power users that run token-intensive workloads, such as high-throughput agentic coding. The Max plan includes a generous request-based system with significantly higher usage limits than the Pro plan. Request quotas are reset multiple times per day.

Note that the Letta Max plan is for personal use only. Developers building applications on the Letta API for external end-users should use pay-as-you-go credits, in addition to included monthly credits in the Letta Pro plan.

What is Letta Code?

Letta Code is a memory-first coding harness, built on top of the Letta API. Instead of working in independent sessions, you work with a persisted agent that learns over time and is portable across models.

You can use the Letta Code CLI to run agents locally on your own computer, where they can access your local filesystem, and take other actions on your behalf. Learn more about Letta Code on our docs.

Do you have any startup discounts?

Yes, we offer discounts for qualifying startups. Submit a request using this form.

Where can I ask more questions?

Reach out to support@letta.com, or join our active community on Discord to chat with the Letta developer team and other Letta users!