AI Tech SuiteDiscover AI Tools, News, and Jobs

Letta

Click to visit website

About

Letta is an AI memory platform and operating system designed to solve the persistence problem inherent in standard large language models. While typical LLMs are stateless and lose information once a session concludes, Letta provides a structured framework for creating stateful agents that maintain context across interactions. Founded by the researchers behind MemGPT at UC Berkeley's Sky Computing Lab, the platform acts as a sophisticated middle layer that manages an agent's long-term memory, allowing it to evolve, learn from historical data, and retain a consistent personality or knowledge base over time. The system operates on a model-agnostic framework, meaning developers can connect various LLMs while keeping the agent's underlying memory intact. A core component of the ecosystem is Letta Code, a memory-first coding harness that allows agents to interact with local filesystems and learn from previous development tasks. The platform also introduces Context Repositories, which utilize git-based versioning to manage programmatic context dynamically. Additionally, Letta incorporates research-backed concepts like sleep-time compute, which allows agents to refine their memories and precompute potential responses during idle periods, effectively mimicking human cognitive cycles. This tool is primarily built for developers, AI engineers, and researchers moving beyond basic chatbots to build production-grade agents. It is particularly effective for creating digital humans, persistent coding assistants, and automated customer support systems where long-term relationship building and context retention are vital. Because it supports both local execution via a CLI and cloud-based management through a low-level API, it caters to solo hobbyists looking for personalized agents as well as enterprise teams requiring scalable, stateful AI infrastructure with high reliability. What sets Letta apart from traditional RAG (Retrieval-Augmented Generation) systems is its treatment of memory as a dynamic, self-improving state rather than a static document store. It transitions AI from a transactional tool into a perpetual entity. With its strong open-source foundation and model-agnostic nature, Letta ensures that developers are not locked into a specific provider's ecosystem, allowing an agent's history and intelligence to persist even if the underlying model is upgraded or changed.

Pros & Cons

Supports persistent memory that survives session resets and model changes.

Backed by an active open-source community with over 21,000 GitHub stars.

Offers a free tier with Bring Your Own Key (BYOK) support for local execution.

Includes innovative sleep-time compute features for offline agent learning and refinement.

The model-agnostic approach prevents vendor lock-in for your agent's state and history.

The Free plan is restricted to only three stateful agents at a time.

Server-side tool execution on the API incurs credit costs based on CPU usage.

The Max plan is strictly limited to personal use and not for building external applications.

Enterprise-grade security features like RBAC and SSO are missing from the lower tiers.

Use Cases

Software developers can use Letta Code to build persistent coding assistants that learn from their specific codebase over time.

AI researchers can utilize the framework to experiment with continual learning and memory management in a model-agnostic environment.

Customer support teams can deploy digital humans that maintain context and relationship history across multiple user interactions.

Solo creators can develop highly personalized agents that run locally on their filesystem with customized long-term memory.

Enterprise organizations can leverage the API and stateful agents to create internal knowledge assistants with secure SSO integration.

Platform

Web

Task

memory management

Features

• model-agnostic architecture

• low-level memory management api

• local filesystem access

• perpetual learning framework

• letta code cli

• sleep-time compute refinement

• git-based context repositories

• stateful agent memory

FAQs

How is tool execution charged on the Letta API?

Server-side tools executed on the Letta API incur credit costs based on the CPU time required for execution. However, remote MCP tools, built-in tools, and client-side tools run via Letta Code do not incur any credit costs.

Can I use my own LLM with Letta?

Yes, Letta is model-agnostic and specifically allows users on the Free plan to connect their own LLM API keys through a Bring Your Own Key (BYOK) system.

What is the difference between the Pro and Max plans?

The Pro plan is for building personal agents with basic monthly credits, while the Max plan is designed for power users running token-intensive workloads like high-throughput coding with significantly higher request limits.

What exactly is Letta Code?

Letta Code is a memory-first coding harness built on the Letta API that allows agents to learn over time and access local filesystems via a CLI, making them portable across different models.

Does Letta offer any security features for organizations?

Yes, the Enterprise plan includes professional security and management features such as Role-Based Access Control (RBAC), SSO via SAML/OIDC, and dedicated support for high-volume needs.

Pricing Plans

Pro

USD20.00 / per month

• Unlimited stateful agents

• Access frontier models on Letta API

• $20 of monthly API credits

• Pay-as-you-go credit overage

Max

USD200.00 / per month

• Higher usage limits for power users

• Optimized for Letta Code

• Early access to new features

• Request quotas reset multiple times daily

Enterprise

Unknown Price

• Volume-based pricing

• Increased quotas

• Role-based access control (RBAC)

• SAML/OIDC SSO authentication

• Dedicated support

Free

Free Plan

• 3 stateful agents

• Connect own LLM API keys (BYOK)

• Chat with agents in the ADE

• Run agents locally with Letta Code

Job Opportunities

Letta

Product Engineer, Agent Development Environment

Enable AI agents to remember every interaction and learn continuously with a memory-first operating system designed for developers building perpetual intelligence.

engineering onsite San Francisco, US full-time

Benefits:

Paid in-person work trial

Experience Requirements:

Strong frontend development skills (React)
Experience with fullstack development including API design
Proficiency with backend development (Next.js, Node, Typescript)
Interacting with databases (Postgres, Redis)
Experience crafting user experiences

Other Requirements:

Excellent communication skills
Excitement about AI agents
Ability to work in-person 5 days a week in San Francisco
Interest in how agents and LLMs work under the hood

Responsibilities:

Build, design and define how developers interface with agent framework
Build visual interfaces for innovative agent functionality
Interface with customers and open source community
Develop application SDKs for full-stack agent apps
Develop Cloud Platform for deployment and monitoring

Show more details

Letta

Software Engineer, Agent Platform

Enable AI agents to remember every interaction and learn continuously with a memory-first operating system designed for developers building perpetual intelligence.

engineering onsite San Francisco, US full-time

Benefits:

Paid in-person work trial

Experience Requirements:

Strong proficiency with Python
Understanding of architecting services for security, reliability, and performance
Ability to build scaleable backend services
Strong understanding of SQL databases (Postgres)
Familiarity with AI stack tooling (vLLM, Ollama, OpenAI APIs)

Other Requirements:

Ability to work in-person 5 days a week in San Francisco
Proficiency with TypeScript and React is a bonus

Responsibilities:

Develop Letta's OSS agents framework and cloud service
Design scaleable & resilient backend services and APIs
Work with researchers to support bleeding-edge agent architectures
Define developer APIs for agents
Lead development of company's OSS stack

Show more details

Letta

Research Engineer / Scientist, Post-Training

Enable AI agents to remember every interaction and learn continuously with a memory-first operating system designed for developers building perpetual intelligence.

science onsite San Francisco, US full-time

Benefits:

Paid in-person work trial

Experience Requirements:

Proficiency in Python and deep learning frameworks (PyTorch)
Expertise in post-training techniques (SFT, RL, reward models)
Proven track record of impactful research
Real-world impact beyond pure academic work

Other Requirements:

Ability to balance execution speed with empirical rigor
Ability to work in-person 5 days a week in San Francisco

Responsibilities:

Training models for better agentic tool-use
Designing mechanisms for continuous model weight updates
Designing and running experiments for data mixtures and models
Building infrastructure for generating synthetic data at scale
Building challenging evals for measuring agentic capabilities

Show more details

Explore AI Career Opportunities

Social Media

Ratings & Reviews

No ratings available yet. Be the first to rate this tool!

Alternatives

Pieces

Automatically capture and recall every coding snippet, research link, and chat message with a local-first AI memory engine designed to boost developer flow.

View Details

Featured Tools

adly.news

Connect with engaged niche audiences or monetize your subscriber base through an automated marketplace featuring verified metrics and secure Stripe payments.

View Details

DramaPixel

Streamline your creative workflow by generating professional images, videos, and music in one unified AI workspace designed for marketers and brand designers.

View Details

Frondex

Accelerate investment research and strategy with an AI copilot that provides deep industry dives, market trend analysis, and seamless tool integrations for investors.

View Details

Atomic Mail

Protect your data with end-to-end encryption and an AI suite that drafts, summarizes, and scans emails for sensitive content to ensure maximum privacy.

View Details

Rekap

Turn every meeting, call, and document into actionable takeaways with AI-powered transcription and custom automation tools designed for fast-moving teams.

View Details

Sketch To

Convert images into artistic sketches or transform hand-drawn drafts into realistic photos using advanced AI models designed for artists, designers, and hobbyists.

View Details

Seedance 4.0

Create high-definition AI videos from text prompts or images in seconds with built-in audio, commercial rights, and support for multiple cinematic models.

View Details