eAI-Repair-Toolkit

A toolkit for automated DNN repair.

Getting Started

Prerequisites

python >= 3.8
pip >= 21.3

Installing

$ python -m venv <path/to/venv>
$ source <path/to/venv>/bin/activate
(venv-name) $ pip install --upgrade pip
(venv-name) $ pip install -e .

Usage

Note

See tutorial for an example usage with concrete DNN model and dataset. See user manual for details.

Issues Standing

The following items should be given in typical machine learning contexts.

model : a DNN model (to be repaired)
dataset_test : a dataset for testing model (and those repaired)

Let dataset_test_A be a subset of dataset_test containing data labelled ''A''.

(venv-name) $ repair test \
    --model_dir=${model} \
    --data_dir=${dataset_test_A}
...
accuracy: XX.XX%

If this XX.XX% does not satisfy your requirements for model (e.g., YY.YY% or larger), this toolkit can be used for repairing it.

Solution

This toolkit additionally requires:

dataset_repair : an additional dataset for repairing model. This contents should be different from those in dataset_test (and a training dataset as well) to prevent overfitting.

This toolkit has a functionality to generate subsets of dataset_repair that model succeeds and fails to predict on each label.

(venv-name) $ repair target \
    --model_dir=${model} \
    --data_dir=${dataset_repair}

Let dataset_repair_A_neg be the subset that model fails to predict on the label ''A''. In addition, let dataset_repair_pos be the subset that model succeeds to predict on all labels (called as a ''positive subset'').

Here you need to select a repair method to be applied.

method : a repair method implemented in this toolkit. See the list of repair methods

Run this toolkit to localize suspicious neural weights in model which may cause misprediction on the label ''A''.

(venv-name) $ repair localize \
    --method=${method} \
    --model_dir=${model} \
    --target_data_dir=${dataset_A_neg}

Then, run it again to optimize the localized weights while preventing degradation in model by using the positive subset.

(venv-name) $ repair optimize \
    --method=${method} \
    --model_dir=${model} \
    --target_data_dir=${dataset_repair_A_neg} \
    --positive_inputs_dir={dataset_repair_pos}

Finally, this toolkit outputs a DNN model model_repaired that is a repaired candidate of model. Let's check it.

(venv-name) $ repair test \
    --model_dir=${model_repaired} \
    --data_dir=${dataset_test_A}
...
accuracy: ZZ.ZZ%

If this ZZ.ZZ% is greater than the YY.YY%, you can obtain model_repaired that satisfies your requirements, indicating this toolkit worked to repair model!

Contributing

Please read CONTRIBUTING for details on our code of conduct, and the process for submitting pull requests to us.

Versioning

We use SemVer for versioning. For the versions available, see the tags on this repository.

Authors

Yuta Maezawa (Udzuki/NII)
Fuyuki Ishikawa (NII)
Nobukazu Yoshioka (Waseda/NII)

See also the list of contributors who participated in this project.

License

This project is licensed under the MIT License - see the LICENSE file for details.

We quote the license below because some tests and tutorial involve the data and labels of BDD100K.

Copyright ©2018. The Regents of the University of California (Regents). All Rights Reserved.

THIS SOFTWARE AND/OR DATA WAS DEPOSITED IN THE BAIR OPEN RESEARCH COMMONS REPOSITORY ON 1/1/2021

Permission to use, copy, modify, and distribute this software and its documentation for educational, research, and not-for-profit purposes, without fee and without a signed licensing agreement; and permission to use, copy, modify and distribute this software for commercial purposes (such rights not subject to transfer) to BDD and BAIR Commons members and their affiliates, is hereby granted, provided that the above copyright notice, this paragraph and the following two paragraphs appear in all copies, modifications, and distributions. Contact The Office of Technology Licensing, UC Berkeley, 2150 Shattuck Avenue, Suite 510, Berkeley, CA 94720-1620, (510) 643-7201, otl@berkeley.edu, http://ipira.berkeley.edu/industry-info for commercial licensing opportunities.

IN NO EVENT SHALL REGENTS BE LIABLE TO ANY PARTY FOR DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES, INCLUDING LOST PROFITS, ARISING OUT OF THE USE OF THIS SOFTWARE AND ITS DOCUMENTATION, EVEN IF REGENTS HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

REGENTS SPECIFICALLY DISCLAIMS ANY WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE SOFTWARE AND ACCOMPANYING DOCUMENTATION, IF ANY, PROVIDED HEREUNDER IS PROVIDED “AS IS”. REGENTS HAS NO OBLIGATION TO PROVIDE MAINTENANCE, SUPPORT, UPDATES, ENHANCEMENTS, OR MODIFICATIONS.

Acknowledgments

This work was supported by JST-Mirai Program Grant Number JPMJMI18BB, Japan.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.github		.github
CONTRIBUTING		CONTRIBUTING
docs		docs
src/repair		src/repair
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
ruff.toml		ruff.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

eAI-Repair-Toolkit

Getting Started

Prerequisites

Installing

Usage

Issues Standing

Solution

Contributing

Versioning

Authors

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

eAI-Repair-Toolkit

Getting Started

Prerequisites

Installing

Usage

Issues Standing

Solution

Contributing

Versioning

Authors

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages