UNICORN

This is the code and dataset repository for UNICORN paper in ICMLCN 2025.

Dataset

The traces that are used for training/test the models are included in the trace folder in the git repository. Full dataset before and after Colosseum can be found here.

Repository Structure

The repository has the following overall structure:

code directory: Contains the python code to do preprocessing, training, and test pipelines.
traces directory: Contains all the KPI traces used in the paper UNICORN.
file2app_map.pkl: Contains the mapping between the file names in traces and the application classes.
Plotting.ipynb: Contains scripts to read prediction pickle files and plot the results.

Running the code

Preprocessing

As the first step you need to preprocess the dataset using preprocessing.py in the code directory. Run it as: ''' python preprocessing.py ''' The above script reads the KPI traces stored in the .csv files in the trace directory, preprocesses them and saves training, validation, and test partitions and statistics in the .pkl files stored in pkls directory.

Training and Test pipeline

Training and test pipeline can be run only after Preprocessing step is done.

The pipeline can be run through run_pipeline.sh in the code directory with the following content.

python top.py \
--gpu_id $1 \
--Train \
--Test \
--OOD_class 'Teams' \
--model_flag 'ResConvNet' \
--weight_filename 'weights-ICMLCN4-ResConvNet-5ids-oodindex4.pt'

This bash file runs the file top.py that calls the pipeline for training and test. Here is a short description of the arguments.

--gpu_id: The ID of GPU node on your system. This can be set to 0 or 1 or 2 or ... depending on the GPU node you want to use.
--Train: store_true parameter which indicates if we want to train an NN model.
--Test: store_true parameter which indicates if we want to test a trained NN model.
--OOD_class: The class name you would like to exclude from the training set. Can be any of the following: 'CallofDuty', 'Facebook', 'Meet', 'Zoom', 'Teams', 'Twitch'
--model_flag: Indicating the model architecture. Can be 'ResConvNet' or ForwConvNet
--weight_filename: Indicates the filename of NN weight files, that is used only for testing the trained model.

The pipeline can be run through:

./run_pipeline.sh 1

Where 1 is the GPU ID.

The test pipeline saves model predictions in .pkl files in the results directory.

Plotting the results

You can plot the confusion matrices, bar graphs, line graphs, and scatter plots in the paper using Plotting.ipynb script.

Running Test using the Pretrained Models and Plotting

My results directory for the paper UNICORN is located here, which can be downloaded and placed in the root folder next to code directory.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
code		code
traces		traces
.gitignore		.gitignore
Plotting.ipynb		Plotting.ipynb
README.md		README.md
file2app_map.pkl		file2app_map.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UNICORN

Dataset

Repository Structure

Running the code

Preprocessing

Training and Test pipeline

Plotting the results

Running Test using the Pretrained Models and Plotting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

UNICORN

Dataset

Repository Structure

Running the code

Preprocessing

Training and Test pipeline

Plotting the results

Running Test using the Pretrained Models and Plotting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages