Question 1

Who are the intended users of Rubra?

Accepted Answer

Rubra models are for anyone looking to use open source LLMs with native function calling support, which yields superior results in local LLMs when compared to prompting models to return tool calls.

Question 2

Why use Rubra models over Llama, Mistral, or other popular fine tunes like Hermes Pro or Gorilla OpenFunctions?

Accepted Answer

Rubra models are capable of complex, multi-step, function calls (chain of function) and enhance popular open source instruct LLMs while retaining their original capabilities.

Question 3

Mistral-7B-Instruct-v0.3 has tool calling capability, so why use Rubra enhanced Mistral-7B-Instruct-v0.3

Accepted Answer

Rubra enhanced Mistral-7B-Instruct-v0.3 is capable of complex tool calling that the original model falls short of, demonstrating superior multi-step function chaining.

Question 4

vLLM has tool calling capability, so why use Rubra?

Accepted Answer

Rubra enhanced models and custom vLLM give the LLM full discretion on when to make a tool call or reply with an assistant message, unlike vLLM which expects tool calls.

Question 5

How were Rubra models trained?

Accepted Answer

Models were trained on a high-quality tool calling dataset (1M+ conversations) using thousands of A100/H100 GPU hours, with iterative training techniques for fast convergence.

Question 6

Why do the benchmark results differ from the ones found in parent model cards?

Accepted Answer

Benchmark results differ due to constant improvements in evaluation tools (LM Evaluation Harness, FastChat LLM Judge) and updates to judging models, but Rubra does not game benchmarks.

Question 7

How did you construct the training set for Rubra models?

Accepted Answer

The training set was constructed using the 'chain of function' concept, focusing on multi-step, consecutive function calls to achieve complex goals, exemplified by customer feedback processing.

Rubra

Click to visit website

About

Platform

Task

Features

FAQs

Who are the intended users of Rubra?

Why use Rubra models over Llama, Mistral, or other popular fine tunes like Hermes Pro or Gorilla OpenFunctions?

Mistral-7B-Instruct-v0.3 has tool calling capability, so why use Rubra enhanced Mistral-7B-Instruct-v0.3

vLLM has tool calling capability, so why use Rubra?

How were Rubra models trained?

Why do the benchmark results differ from the ones found in parent model cards?

How did you construct the training set for Rubra models?

Pricing Plans

Free

Job Opportunities

Social Media

Ratings & Reviews

Featured Tools

adly.news

Atoms

Atomic Mail

Rekap

Sketch To

Seedance 4.0

Latest AI News