Rubra

Click to visit website
About
Rubra is a collection of open-weight, tool-calling LLMs designed to enhance top open-source large language models. It adds deterministic tool-calling capability, making these models ideal for agentic use cases. Rubra models are post-trained and use methods to teach new skills while mitigating catastrophic forgetting. They extend popular inferencing projects like llama.cpp and vLLM for easy local use, providing OpenAI-compatible tool-calling. Rubra excels in complex, multi-step function calls, outperforming original models and other fine-tunes in this area. The models are trained on a high-quality dataset of over 1 million conversations and tool calls, with thousands of A100/H100 GPU hours. Rubra models are published under the same license as their parent models, while Rubra code is Apache 2.0 licensed.
Platform
Task
Features
• free demo available on huggingface spaces
• ideal for agentic use cases
• supports complex, multi-step function calls ('chain of function')
• extends popular inferencing projects (llama.cpp, vllm) for local use
• mitigates catastrophic forgetting during enhancement
• deterministic tool-calling capability
• enhances top open-source llms (meta-llama-3, gemma, mistral, phi-3, qwen2)
• open-weight, tool-calling llms
FAQs
Who are the intended users of Rubra?
Rubra models are for anyone looking to use open source LLMs with native function calling support, which yields superior results in local LLMs when compared to prompting models to return tool calls.
Why use Rubra models over Llama, Mistral, or other popular fine tunes like Hermes Pro or Gorilla OpenFunctions?
Rubra models are capable of complex, multi-step, function calls (chain of function) and enhance popular open source instruct LLMs while retaining their original capabilities.
Mistral-7B-Instruct-v0.3 has tool calling capability, so why use Rubra enhanced Mistral-7B-Instruct-v0.3
Rubra enhanced Mistral-7B-Instruct-v0.3 is capable of complex tool calling that the original model falls short of, demonstrating superior multi-step function chaining.
vLLM has tool calling capability, so why use Rubra?
Rubra enhanced models and custom vLLM give the LLM full discretion on when to make a tool call or reply with an assistant message, unlike vLLM which expects tool calls.
How were Rubra models trained?
Models were trained on a high-quality tool calling dataset (1M+ conversations) using thousands of A100/H100 GPU hours, with iterative training techniques for fast convergence.
Why do the benchmark results differ from the ones found in parent model cards?
Benchmark results differ due to constant improvements in evaluation tools (LM Evaluation Harness, FastChat LLM Judge) and updates to judging models, but Rubra does not game benchmarks.
How did you construct the training set for Rubra models?
The training set was constructed using the 'chain of function' concept, focusing on multi-step, consecutive function calls to achieve complex goals, exemplified by customer feedback processing.
Pricing Plans
Free
Free Plan• Open-weight, tool-calling LLMs
• Access to enhanced models
• Demo via Huggingface Spaces
• Local deployment with extended inferencing tools
Job Opportunities
There are currently no job postings for this AI tool.
Ratings & Reviews
No ratings available yet. Be the first to rate this tool!
Featured Tools
adly.news
Connect with engaged niche audiences or monetize your subscriber base through an automated marketplace featuring verified metrics and secure Stripe payments.
View DetailsReztune
Land more interviews by instantly tailoring your resume to any job description using AI-driven keyword optimization and professional, ATS-friendly templates.
View DetailsImage to Image AI
Transform photos and videos using advanced AI models for face swapping, restoration, and style transfer. Perfect for creators needing fast, professional visuals.
View DetailsNano Banana
Edit and enhance photos using natural language prompts while maintaining character consistency and scene structure for professional marketing and digital art.
View DetailsNana Banana Pro
Maintain perfect character consistency across diverse scenes and styles with advanced AI-powered image editing for creators, marketers, and storytellers.
View DetailsKling 4.0
Transform text and images into cinematic 1080p videos with multi-shot storytelling, character consistency, and native lip-synced audio for professional creators.
View DetailsAI Seedance
Generate 15-second cinematic 2K videos with physics-based audio and multi-shot narratives from text or images. Ideal for creators and marketing teams.
View DetailsMistrezz.AI
Engage in immersive NSFW roleplay and ASMR voice sessions with adaptive AI companions designed for structured escalation, fantasy scenarios, and personal connection.
View DetailsSeedance 3.0
Transform text prompts or static images into professional 1080p cinematic videos. Perfect for creators and marketers seeking high-quality, physics-aware AI motion.
View DetailsSeedance 3.0
Transform text descriptions into cinematic 4K videos instantly with ByteDance's advanced AI, offering professional-grade visuals for creators and marketing teams.
View DetailsSeedance 2.0
Generate broadcast-quality 4K videos from simple text prompts with precise text rendering, high-fidelity visuals, and batch processing for content creators.
View DetailsBeatViz
Create professional, rhythm-synced music videos instantly with AI-powered visual generation, ideal for independent artists, social media creators, and marketers.
View Details