Qualspec

LLM-judged qualitative testing for Ruby. Evaluate AI agents, compare models, and test subjective qualities that traditional assertions can't capture.

Installation

gem "qualspec"

Configuration

Set your API key (required):

export QUALSPEC_API_KEY=your_openrouter_key

Environment Variables

Variable	Description	Default
`QUALSPEC_API_KEY`	API key (required)	-
`QUALSPEC_API_URL`	API endpoint	`https://openrouter.ai/api/v1`
`QUALSPEC_MODEL`	Default model for candidates	`google/gemini-3-flash-preview`
`QUALSPEC_JUDGE_MODEL`	Model used as judge	Same as `QUALSPEC_MODEL`

Quick Start

Compare Models (CLI)

# eval/comparison.rb
Qualspec.evaluation "Model Comparison" do
  candidates do
    candidate "gpt4", model: "openai/gpt-4"
    candidate "claude", model: "anthropic/claude-3-sonnet"
  end

  scenario "helpfulness" do
    prompt "How do I center a div in CSS?"
    eval "provides a working solution"
    eval "explains the approach"
  end
end

# Run comparison
qualspec eval/comparison.rb

# Generate HTML report
qualspec --html report.html eval/comparison.rb

Test Your Agent (RSpec)

require "qualspec/rspec"

RSpec.describe MyAgent do
  include Qualspec::RSpec::Helpers

  it "responds helpfully" do
    response = MyAgent.call("Hello")

    result = qualspec_evaluate(response, "responds in a friendly manner")
    expect(result).to be_passing
  end
end

Documentation

Getting Started
Evaluation Suites - CLI for model comparison
RSpec Integration - Testing your agents
Rubrics - Builtin and custom evaluation criteria
Configuration - All options
Recording - VCR integration

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
.qualspec_cassettes		.qualspec_cassettes
bin		bin
docs		docs
examples		examples
exe		exe
lib		lib
sig		sig
spec		spec
.DS_Store		.DS_Store
.gitignore		.gitignore
.rspec		.rspec
.rubocop.yml		.rubocop.yml
.rubocop_todo.yml		.rubocop_todo.yml
CHANGELOG.md		CHANGELOG.md
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
README.md		README.md
Rakefile		Rakefile
qualspec.gemspec		qualspec.gemspec
qualspec_structure.md		qualspec_structure.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Qualspec

Installation

Configuration

Environment Variables

Quick Start

Compare Models (CLI)

Test Your Agent (RSpec)

Documentation

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Qualspec

Installation

Configuration

Environment Variables

Quick Start

Compare Models (CLI)

Test Your Agent (RSpec)

Documentation

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages