Published April 11, 2024
| Version v1
Dataset
Restricted
Today's cat is tomorrow's dog: accounting for time-based changes in the labels of ML vulnerability detection approaches (Replication Package Part 2: Linux dataset)
Authors/Creators
Description
The Replication Package of
"Time-based Analysis on Software Vulnerability ML Detection: A Differentiated Replication"
Part 2 (Linux Dataset)
The zipped package includes:
- Datasets that we created using our methodology and the original dataset (from [NVD Vuldeepecker](https://github.com/CGCL-codes/VulDeePecker))
- retrospective test
- believed_perspective test
- perspective test - Pre-trained models that we generated during our evaluation (3 test results for each time point in the timeline [2011-2017]).
We also added the summarized result files (.xlsx)
1. Timeline of Datasets.xlsx
2. ML Evaluations.xlsx
and the notebook to produce the charts in the paper: Charts.ipynb
Please refer to
https://zenodo.org/records/10965516 for Part 1 NVD Vuldeepecker Dataset