FEAT: add a fetch function for Aya Red-teaming Dataset#713
Merged
romanlutz merged 7 commits intomicrosoft:mainfrom Feb 25, 2025
Merged
FEAT: add a fetch function for Aya Red-teaming Dataset#713romanlutz merged 7 commits intomicrosoft:mainfrom
romanlutz merged 7 commits intomicrosoft:mainfrom
Conversation
This commit also adds helper functions to work with .jsonl files, since the Aya Red-teaming dataset is stored in this format.
This update adds support for `.jsonl` (JSON Lines) files, ensuring that unit tests pass successfully when these files are used.
romanlutz
reviewed
Feb 25, 2025
Contributor
romanlutz
left a comment
There was a problem hiding this comment.
This took me an embarrassingly long time to get to, but it's a fantastic contribution. Thank you @paulinek13
romanlutz
approved these changes
Feb 25, 2025
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
This PR aims to close #416 by implementing a fetch function for Aya Red-teaming Dataset with filtering options for
language,harm_category, andglobal_or_localparameters.I've also added helper functions for handling
.jsonlfiles and updated the test configuration to recognize.jsonlas a valid file type, since the Aya Red-teaming Dataset data is stored in this format.Tests and Documentation
pytest tests/unit && pre-commit run --all-filescompletes without errorsfetch_aya_redteaming_datasetfunction added to the API reference