FEAT: add harm categories to AdvBench Dataset by paulinek13 · Pull Request #732 · microsoft/PyRIT

paulinek13 · 2025-02-22T19:45:42Z

Description

This PR aims to resolve #730 by adding harm categories to the AdvBench dataset and enabling filtering support based on those categories.

Tests and Documentation

✔️ added tests
✔️ updated the docstring

pyrit/datasets/fetch_example_datasets.py

pyrit/datasets/harm_categories/adv_bench_dataset.json

pyrit/datasets/fetch_example_datasets.py

romanlutz

Awesome! LGTM

romanlutz reviewed Feb 24, 2025

View reviewed changes

pyrit/datasets/fetch_example_datasets.py Outdated Show resolved Hide resolved

paulinek13 force-pushed the adv_bench_dataset branch from 04f97c3 to a8ec1e4 Compare March 4, 2025 07:52

paulinek13 commented Mar 4, 2025

View reviewed changes

pyrit/datasets/harm_categories/adv_bench_dataset.json Outdated Show resolved Hide resolved

paulinek13 added 8 commits March 9, 2025 21:47

init

3e30de2

categorization idea

5f1e66a

complete categorization using Claude 3.7

255b0c5

add filtering support

94d25bc

better formatting of the JSON file

73c4070

simpler logic for the filtering

817125d

enhance docs

880c1cc

add tests

45824d3

paulinek13 force-pushed the adv_bench_dataset branch from f81623f to 45824d3 Compare March 9, 2025 20:48

romanlutz mentioned this pull request Mar 10, 2025

FEAT fix harm_categories for existing datasets #443

Closed

paulinek13 added 3 commits March 10, 2025 16:59

# type: ignore

c3927c4

add cache parameter

969ed17

add a useful link

b6245e0

paulinek13 changed the title ~~[DRAFT] add harm categories to AdvBench Dataset~~ FEAT: add harm categories to AdvBench Dataset Mar 10, 2025

paulinek13 marked this pull request as ready for review March 10, 2025 16:14

romanlutz reviewed Mar 10, 2025

View reviewed changes

pyrit/datasets/fetch_example_datasets.py Show resolved Hide resolved

romanlutz reviewed Mar 10, 2025

View reviewed changes

pyrit/datasets/fetch_example_datasets.py Show resolved Hide resolved

paulinek13 added 2 commits March 11, 2025 10:38

use cache

a0553ca

credit

49d1006

romanlutz approved these changes Mar 11, 2025

View reviewed changes

romanlutz merged commit 7138510 into microsoft:main Mar 11, 2025
14 checks passed

jsong468 pushed a commit that referenced this pull request Mar 11, 2025

FEAT: add harm categories to AdvBench Dataset (#732)

569ff44

paulinek13 deleted the adv_bench_dataset branch March 13, 2025 10:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: add harm categories to AdvBench Dataset#732

FEAT: add harm categories to AdvBench Dataset#732
romanlutz merged 13 commits intomicrosoft:mainfrom
paulinek13:adv_bench_dataset

paulinek13 commented Feb 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

romanlutz left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

paulinek13 commented Feb 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests and Documentation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

romanlutz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

paulinek13 commented Feb 22, 2025 •

edited

Loading