Skip to content
View wuyoscar's full-sized avatar
💤
Offine
💤
Offine
  • Australia

Highlights

  • Pro

Block or report wuyoscar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wuyoscar/README.md

header

Typing SVG

arXiv ISC-Bench


👋 About Me

class OscarWu:
    role = "Red Team Otaku"  # yes, both things

    def __init__(self):
        self.status = "jailbreaking"

    def motto(self):
        return "I trace failure."

    def next_target(self):
        return "the weakest assumption"

Featured Work

Internal Safety Collapse
Internal Safety Collapse in Frontier Large Language Models, with ISC-Bench, JailbreakArena, templates, tutorials, and live cases.
UltraBreak
Toward Universal and Transferable Jailbreak Attacks on Vision-Language Models.
Awesome Large Model Safety
Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety.
Safety in Embodied AI
Risks, attacks, and defenses across the full embodied AI pipeline, covering 480+ papers.
AI Safety Report
An integrated safety evaluation of frontier models across language, vision-language, and image generation.

footer

Pinned Loading

  1. ISC-Bench ISC-Bench Public

    Internal Safety Collapse, turning LLMs into "Jailbroken State" without jailbreaking attack.

    Python 635 120

  2. xingjunm/Awesome-Large-Model-Safety xingjunm/Awesome-Large-Model-Safety Public

    Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety

    248 9