This repository contains the code and dataset for the paper "Does Coding Style Really Survive Compilation? Stylometry of Executable Code Revisited" in the PETS 2025 Conference.
results: This folder contains the code and instructions to replicate the results.pipeline: This folder contains the pipeline used to process a single-fold dataset from scratch to generate feature files given the binary dataset.scripts: This folder contains scripts to preprocess the data.data: This folder contains the different datasets.explanability: This folder contains the cause analysis data presented in the paper.
- To replicate the results reported in the paper, head to the
resultsfolder. - To process data from scratch, head to the
pipelinefolder. (Coming Soon) - To inspect the Explanability Analysis data, head to the
explanabilityfolder. - To inspect the different datasets, head to the
datafolder.
Copyright @2025 SPRLAB
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
Although our scripts depend on several third-party software, this repository does not contain (or distribute) any non-Apache license-compliant third-party software. Thus, the users are expected to obtain the required non-Apache licensed third-party software on their own.