CASTLE Source

This is the source repository for the CASTLE Benchmark project. This repository is a cleaned version of our internal repository to avoid leaking sensitive and copyrighted information. For more information on the project please look at https://github.com/CASTLE-Benchmark.

The parsed dataset JSON files are available on GitHub.

Authors: Richard A. Dubniczky, Krisztofer Zoltan Horvát, Tamás Bisztray, Mohamed Amine Ferrag, Lucas C. Cordeiro, and Norbert Tihanyi

Repository Statistics

Lines of Python code for parsing, wrappers and evaluation: 5284
Number of Python scripts: 34
Lines of C code for the tests: 10521
Number of custom Docker containers: 11
Number of commits to the project: 374

Wrappers

Wrappers are the way we automate as much of the evaluation process of a tool as possible. Due to the various licensing models, some tools are as easy as just running them in a container and gathering the reports, while others require registering and uploading code or linking repositories, and later getting the results using APIs or summary downloads.

All the tested tools must:

Provide a report that we can export going through all findings manually
Provide at least a limited free trial without the requirement to contact sales
Be able to scan C and/or C++ projects

Tool	Type	Credentials	Notes
Aikido	`web`, `download`	Yes	Needs registering, linking a repository, then downloading the result CSV and loading it into the evaluator
CBMC	`container`	No	Simple CLI and build tools wrapped in a container
Clang Analyzer	`container`	No	Simple CLI and build tools wrapped in a container
CodeQL	`container`	No	Simple CLI and build tools wrapped in a container
Code Threat	`web`, `api`	Yes	Needs registering, linking a repository, then the wrapper will download and evaluate the results automatically via an API
Coverity	`web`	Yes	Upload files with a local container and manually view on the website
CppCheck	`container`	No	Simple CLI and build tools wrapped in a container
ESBMC	`container`	No	Simple CLI and build tools wrapped in a container
GCC Fanalyzer	`container`	No	Simple CLI and build tools wrapped in a container
Gitlab SAST	`web`, `download`	Yes	Needs a gitlub project with the code committed and the result of the scan downloaded
Jit	`web`, `download`	Yes	Needs registering, linking a repository, then downloading the result and loading it into the evaluator
Semgrep	`container`	Yes	Needs a token to work with a registered user, but otherwise works from the CLI
Snyk	`api`	Yes	Needs registering and a token, and uses API requests for each check with a daily limit
SonarQube	`web`, `api`	Yes	Needs registering, linking a repository, then the wrapper will download and evaluate the results automatically via an API
Splint	`container`	Yes	Simple CLI and build tools wrapped in a container
LLM	`api`	Yes	API requests with a token with a pay-per-request model with a generic OpenAI API compliant interface

Steps

1. Generate the Dataset JSON

Takes the dataset folder and creates a dataset.json. The JSON contains all information for running the tests, and some metadata for classifying the results.

python -m venv .venv
. ./.venv/bin/activate
pip install -r requirements.txt
python parser.py

2. Select a test wrapper and run it

Running a test wrapper will usually use the scan folder in it's directory for performing the scans. In all cases you need to have the dataset built and inside the root of the repository. Please check out each wrapper for more information and additional scripts, as they all have a readme attached.

For most wrappers, you need to have docker installed on your system and have the docker daemon running.

After running multiple tests especially, it's advised to look at your local image repository in docker, as many of the containers are huge.

Please open the runner python file, as generally it will have settings that need to be updated, such as software versions.

For example with cppcheck:

cd wrappers/cppcheck
./build.sh # Build the container
python run.py

After generating a report, it is saved into the reports directory. Please commit it! We want to have a traceable timeline of reports for reference to older ones if possible.

3. Creating the statistics

The statistics are generated in the charts.ipynb Jupyter Notebook. In case you are using VSC, you can install an extension and view/run it in your IDE. The notebook will load the dataset.json, the cwe-collection.yaml, as well as all the reports to generate the statistics and graphs.

It is recommended that you use the locally created python virtual environment from Step 1, so you won't have to install them globally.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
dataset		dataset
reports		reports
scripts		scripts
wrappers		wrappers
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
Brewfile		Brewfile
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
build.sh		build.sh
charts.ipynb		charts.ipynb
complexity.py		complexity.py
cwe-collection.yaml		cwe-collection.yaml
parser.py		parser.py
prompt.txt		prompt.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CASTLE Source

Repository Statistics

Wrappers

Steps

1. Generate the Dataset JSON

2. Select a test wrapper and run it

3. Creating the statistics

Sources

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CASTLE Source

Repository Statistics

Wrappers

Steps

1. Generate the Dataset JSON

2. Select a test wrapper and run it

3. Creating the statistics

Sources

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages