[CLI] Add hf models card and hf datasets card commands by davanstrien · Pull Request #4118 · huggingface/huggingface_hub

davanstrien · 2026-04-16T12:37:45Z

Summary

Adds hf models card <model_id> and hf datasets card <dataset_id> commands that print the repo card (README) to stdout
Three output modes: full card (default), --metadata (just the YAML frontmatter as JSON), --text (just the markdown body)
--metadata and --text are mutually exclusive

Motivation

hf models info gives you structured Hub metadata (downloads, tags, pipeline_tag, siblings, etc.) but not the human-authored card content. The card text is where you find the stuff that info doesn't surface: usage examples with actual code, training details, known limitations, intended use cases, benchmark results with context, and architecture descriptions. Put simply — info tells you what a model is, card tells you how to use it and why.

info does include a card_data field, but it's the raw YAML string, not parsed. --metadata returns the same data as structured JSON via out.dict(), so it works with --format and is easy to pipe into jq or consume programmatically.

Agents and humans can already get card content via hf download <repo_id> README.md or curl, but that writes to a file and gives you the raw README with no way to split the YAML frontmatter from the prose. hf models card outputs directly to stdout and the --metadata/--text flags let you grab just the part you need.

For agents specifically, having a low-friction way to read model documentation helps reduce hallucination. Agents tend to default to recommending models they've memorised from training data (often outdated — e.g. still reaching for early Llama models), and fabricate usage details rather than checking the actual card. A single command that returns the real card content makes it easy for agents to look things up rather than guess. This is particularly valuable for newer models that post-date the agent's training cutoff.

For humans, it's a quick way to check a model's docs from the terminal without opening a browser — useful when comparing models or scripting.

Examples

# Full card to stdout
$ hf models card google/gemma-4-31B-it

# Just the card metadata (from the YAML frontmatter)
$ hf models card google/gemma-4-31B-it --metadata

# Card metadata as JSON
$ hf models card google/gemma-4-31B-it --metadata --format json
{"library_name": "transformers", "license": "apache-2.0", ...}

# Pretty-printed
$ hf models card google/gemma-4-31B-it --metadata --format human

# Just the text body (no YAML frontmatter)
$ hf models card google/gemma-4-31B-it --text

# Same for datasets
$ hf datasets card HuggingFaceFW/fineweb --metadata --format human

Design notes

--metadata not --yaml — We considered --yaml (the source format) and --frontmatter (the structural term) but went with --metadata because it describes what you're extracting rather than where it lives. It also pairs cleanly with --text — both flags describe the kind of content you want. And it avoids confusion with --format, which controls output format: --metadata --format json reads clearly as "give me the metadata, formatted as JSON".
No --revision support — RepoCard.load() doesn't currently pass revision to hf_hub_download. Could be added to RepoCard.load() in a follow-up and then wired through here.
--format is accepted even though the default and --text modes output free-form text (where --format json produces no output, same as hf papers read). We kept it because --metadata goes through out.dict() and genuinely benefits from it (e.g. --format human for pretty-printed JSON). This follows the majority CLI pattern — hf papers read is the only command that omits --format.

🤖 Generated with Claude Code

Note

Low Risk
Low risk: adds new read-only CLI subcommands that fetch and print repo card content, with minimal impact on existing command behavior.

Overview
Adds new hf models card and hf datasets card CLI subcommands to fetch a repo card (README) and print it to stdout, with --metadata (YAML frontmatter parsed to structured JSON via out.dict) or --text (markdown body only) modes and a mutual-exclusion check.

Updates the CLI docs/reference to document these new commands and adds CLI tests covering full/metadata/text outputs and invalid flag combinations.

^{Reviewed by Cursor Bugbot for commit 524fc2c. Bugbot is set up for automated code reviews on this repo. Configure here.}

Add commands to fetch model/dataset cards (README) from the Hub with three output modes: full card (default), --metadata (YAML frontmatter as JSON), and --text (markdown body only). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

bot-ci-comment · 2026-04-16T12:44:00Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

codecov · 2026-04-16T12:50:23Z

Codecov Report

❌ Patch coverage is 83.33333% with 5 lines in your changes missing coverage. Please review.
✅ Project coverage is 77.01%. Comparing base (1daa48b) to head (aba4878).
⚠️ Report is 272 commits behind head on main.

Files with missing lines	Patch %	Lines
src/huggingface_hub/cli/datasets.py	80.00%	3 Missing ⚠️
src/huggingface_hub/cli/models.py	86.66%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #4118      +/-   ##
==========================================
+ Coverage   75.00%   77.01%   +2.00%     
==========================================
  Files         145      167      +22     
  Lines       13978    18948    +4970     
==========================================
+ Hits        10484    14592    +4108     
- Misses       3494     4356     +862

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Wauplin

Thanks for the addition. Could you also add hf spaces card? Can be useful to quickly check the Space metadata

Wauplin · 2026-04-23T09:33:43Z

        assert kwargs["sort"] == "downloads"


+class TestModelsCardCommand:


can you replace all tests with real world ones e.g.

def test_models_card_full(self, runner: CliRunner) -> None: result = runner.invoke(app, ["models", "card", "Qwen/Qwen3.6-35B-A3B"]) assert "library_name: transformers" in result.stdout assert "# Qwen3.6-35B-A3B" in result.stdout

?

no mocks, no need to check exit code, makes the whole test more readable IMO

Co-authored-by: Lucain <lucainp@gmail.com>

- Add `hf spaces card` command to complete the models/datasets/spaces trio - Replace mocked unit tests for models/datasets card with single live tests using @with_production_testing (Wauplin's preferred pattern) - Add live test for spaces card - Document hf spaces card in CLI guide - Regenerate package_reference/cli.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

The previous examples used enzostvs/deepsite (consistent with hf spaces info) but that Space has no public README, so `hf spaces card enzostvs/deepsite` returns 404. Switch examples, docs, and the live test to mteb/leaderboard, which has a public card. Also tighten the dataset live-test assertion to check for the body heading rather than just "FineWeb" (which appears in both YAML and body). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

davanstrien · 2026-04-27T13:45:32Z

cc @Wauplin Updated tests and added support for Spaces cards.

Wauplin

Thank you!

huggingface-hub-bot · 2026-04-30T11:57:51Z

This PR has been shipped as part of the v1.13.0 release.

Wauplin reviewed Apr 23, 2026

View reviewed changes

davanstrien and others added 2 commits April 27, 2026 13:32

Update src/huggingface_hub/cli/models.py

ccd5c2d

Co-authored-by: Lucain <lucainp@gmail.com>

Update src/huggingface_hub/cli/datasets.py

524fc2c

Co-authored-by: Lucain <lucainp@gmail.com>

davanstrien marked this pull request as draft April 27, 2026 12:36

davanstrien and others added 3 commits April 27, 2026 14:22

Merge branch 'main' into feat/cli-card-command

7c8a200

davanstrien marked this pull request as ready for review April 27, 2026 13:45

Wauplin approved these changes Apr 27, 2026

View reviewed changes

Wauplin merged commit a259ecb into huggingface:main Apr 27, 2026
12 of 16 checks passed

davanstrien mentioned this pull request May 6, 2026

[CLI] Add hf spaces duplicate command #4197

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CLI] Add hf models card and hf datasets card commands#4118

[CLI] Add hf models card and hf datasets card commands#4118
Wauplin merged 6 commits into
huggingface:mainfrom
davanstrien:feat/cli-card-command

davanstrien commented Apr 16, 2026 •

edited by cursor Bot

Loading

Uh oh!

bot-ci-comment Bot commented Apr 16, 2026

Uh oh!

codecov Bot commented Apr 16, 2026

Uh oh!

Wauplin left a comment

Uh oh!

Uh oh!

Uh oh!

Wauplin Apr 23, 2026

Uh oh!

Uh oh!

davanstrien commented Apr 27, 2026

Uh oh!

Wauplin left a comment

Uh oh!

Uh oh!

huggingface-hub-bot Bot commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		assert kwargs["sort"] == "downloads"


		class TestModelsCardCommand:

Conversation

davanstrien commented Apr 16, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Examples

Design notes

Uh oh!

bot-ci-comment Bot commented Apr 16, 2026

Uh oh!

codecov Bot commented Apr 16, 2026

Codecov Report

Uh oh!

Wauplin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Wauplin Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

davanstrien commented Apr 27, 2026

Uh oh!

Wauplin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

huggingface-hub-bot Bot commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

davanstrien commented Apr 16, 2026 •

edited by cursor Bot

Loading