KL3M

Click to visit website
About
KL3M is a family of language models built on clean, high-quality, and legally compliant training data. It avoids copyright issues, breach of contract, synthetic data from other LLMs, and toxic sources. KL3M models have demonstrated best-in-class perplexity on legal domain data and excellent performance on general material. Models are already in use for tasks like drafting invoices, contracts, and SEC filings. Users can continue training KL3M, fine-tune it for conversational AI, or use it for specific tasks.
Platform
Keywords
Task
Features
• high-quality content
• no copyright issues
• no toxic sources
• no llm synthetic data
• no breach of contract
• clean provenance
• fairly trained
• can be used to continue training on your own content library, fine-tune for safe conversational ai, or fine-tune for specific tasks
FAQs
What kind of hardware do I need to run KL3M?
The first KL3M models have been designed with accessible use as a priority. kl3m-170 runs quickly on a MacBook Air M1, and kl3m-1.7b runs well on a $300 consumer GPU.
What architectures are your models?
Smaller KL3M models are trained using the GPT-NeoX architecture. Larger KL3M models are trained using the Mixtral Mixture-of-Experts architecture (trained from scratch).
How can I run KL3M?
KL3M is distributed as standard PyTorch model weights. KL3M architectures are supported for both HuggingFace transformers and vllm for inference.
Which languages are supported?
`kl3m-170m` and `kl3m-1.7b` have both been trained on a predominantly English-language content. Larger models include content in English, Spanish (es-ES and es-MX), French, and German. We are working on adding more languages.
Is it easy to fine-tune KL3M?
We have had excellent results fine-tuning KL3M on a number of use cases, including drafting, summarization, and classification. You can fine-tune kl3m-170 and kl3m-1.7b on consumer hardware.
How many tokens do you have?
We have collected over 2.5 trillion tokens of training data, and we are constantly adding more. Our training data is a mix of public domain and explicitly licensed content.
How many tokens have your models seen?
`kl3m-170m` and `kl3m-1.7b` have been trained on approximately 350B tokens of primarily English-language content. Larger models are being trained on between 500B to 1T tokens of content in English, Spanish, French, and German.
How do you pronounce KL3M?
As the 🍊 suggests, KL3M is pronounced like "Clem" or "Klem."
Why is it named KL3M?
KL3M was originally short for the Kelvin Legal Large Language Model, KLLLM. Because we're nerds, we shortened all those Ls to L cubed or L3, then shortened K-L3-M to KL3M.
Job Opportunities
There are currently no job postings for this AI tool.
Ratings & Reviews
No ratings available yet. Be the first to rate this tool!
Alternatives

RamenLegal
RamenLegal is an AI suite for legal documentation with over 100 customizable templates. It offers AI-powered legal drafting, simplifying complex legal research and enabling users to evaluate and review legal documents efficiently.
View Details
Spellbook
Spellbook is an AI-powered tool for commercial lawyers that helps draft and review contracts faster within Word. It provides features for redlining, drafting, answering complex questions, and benchmarking contracts.
View DetailsBrackets AI
Brackets uses generative AI to review contracts, suggest changes, and draft contract terms directly in Microsoft Word.
View Details
AI.Law
AI.Law is an AI-driven legal platform streamlining legal drafting and document analysis for lawyers and corporations, enhancing efficiency and accuracy.
View Details
DocDraft
DocDraft is an AI-powered legal tool that automates the creation of contracts, agreements, and legal documents. It offers expert review and instant legal support, making legal matters easier for individuals and businesses.
View DetailsFeatured Tools
Songmeaning
Songmeaning uses AI to reveal the stories and meanings behind song lyrics. It offers lyric translation and AI music generation.
View DetailsWhisper Notes
Offline AI speech-to-text transcription app using Whisper AI. Supports 80+ languages, audio file import, and offers lifetime access with a one-time purchase. Available for iOS and macOS.
View DetailsGitGab
Connects Github repos and local files to AI models (ChatGPT, Claude, Gemini) for coding tasks like implementing features, finding bugs, writing docs, and optimization.
View Details
nuptials.ai
nuptials.ai is an AI wedding planning partner, offering timeline planning, budget optimization, vendor matching, and a 24/7 planning assistant to help plan your perfect day.
View DetailsMake-A-Craft
Make-A-Craft helps you discover craft ideas tailored to your child's age and interests, using materials you already have at home.
View Details
Pixelfox AI
Free online AI photo editor with comprehensive tools for image, face/body, and text. Features include background/object removal, upscaling, face swap, and AI image generation. No sign-up needed, unlimited use for free, fast results.
View Details
Smart Cookie Trivia
Smart Cookie Trivia is a platform offering a wide variety of trivia questions across numerous categories to help users play trivia, explore different topics, and expand their knowledge.
View Details
Code2Docs
AI-powered code documentation generator. Integrates with GitHub. Automates creation of usage guides, API docs, and testing instructions.
View Details