You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Called ablation, but perform masking of features using a baseline.
Editing replaces tokens with their nearest neighbors in the vocabulary embedding space and measures saliency as the drop in performance for the target. In the future, this can allow users to specify a custom editing strategy via an input Callable.
Possibly overlapping with feature ablation up to some measure.
Valid only for decoder-only models.
Verify whether it would be exactly equivalent to Value Zeroing, include only if functionally different (alias otherwise).
π Feature Request
The following is a non-exhaustive list of perturbation-based feature attribution methods that could be added to the library:
pytorch/captumpytorch/captumpytorch/captumpytorch/captumpytorch/captumkeyonvafa/sequential-rationalesDFKI-NLP/thermostatdylan-slack/Modeling-Uncertainty-Local-Explainabilitydylan-slack/Modeling-Uncertainty-Local-ExplainabilityDFKI-NLP/OLMcifkao/context-probingykwon0407/WeightedSHAPhmohebbi/ValueZeroingYilunZhou/solvability-explainerYilunZhou/solvability-explainerkmeng01/romecasszhao/ReAGentk-amara/syntax-shapNotes:
Footnotes
Callable.