ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors

This repository contains the official implementation of the paper:

"ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors"
Junghyun Koo, Marco A. Martínez-Ramírez, Wei-Hsiang Liao, Giorgio Fabbro, Michele Mancusi, Yuki Mitsufuji
Presented at ISMIR 2025

🔧 Installation

sudo apt-get update && sudo apt-get install -y \
  libsox-fmt-all \
  libsox-dev \
  sox \
  libsndfile1
pip install ito_master

🚀 Inference Examples

Basic usage examples (full examples available in inference_example.sh):

🎛️ Style Transfer Only

python inference.py \
  --input_path examples/input_1.flac \
  --reference_path examples/reference_1.flac \
  --model_type white_box \
  --inference_device cuda \
  --output_dir_path outputs/white_box_st/

🎛️ Style Transfer + ITO (AudioFeatureLoss)

python inference.py \
  --input_path examples/input_1.flac \
  --reference_path examples/reference_1.flac \
  --model_type white_box \
  --inference_device cuda \
  --perform_ito \
  --ito_reference_path examples/reference_1.flac \
  --ito_objective AudioFeatureLoss \
  --num_steps 100 \
  --ito_save_freq 10 \
  --learning_rate 0.01 \
  --output_dir_path outputs/white_box_ito_af/

🎛️ Style Transfer + ITO (CLAPFeatureLoss with text prompt)

python inference.py \
  --input_path examples/input_1.flac \
  --reference_path examples/reference_1.flac \
  --model_type white_box \
  --inference_device cuda \
  --perform_ito \
  --ito_reference_path examples/reference_1.flac \
  --ito_objective CLAPFeatureLoss \
  --clap_target_type Text \
  --clap_text_prompt "heavy metal" \
  --num_steps 100 \
  --ito_save_freq 10 \
  --learning_rate 0.01 \
  --output_dir_path outputs/white_box_ito_claptxt/

📜 Citation

Please cite our work if you find it useful:

@INPROCEEDINGS{koo2025ito, 
  title={ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors}, 
  author={Koo, Junghyun and Mart{\'\i}nez-Ram{\'\i}rez, Marco A. and Liao, Wei-Hsiang and Fabbro, Giorgio and Mancusi, Michele and Mitsufuji, Yuki}, 
  booktitle={The 26th International Society for Music Information Retrieval Conference (ISMIR)},  
  year={2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
examples		examples
ito_master		ito_master
LICENSE		LICENSE
README.md		README.md
fxnorm_feat.npy		fxnorm_feat.npy
inference.py		inference.py
inference_example.sh		inference_example.sh
packages.txt		packages.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors

🔧 Installation

🚀 Inference Examples

🎛️ Style Transfer Only

🎛️ Style Transfer + ITO (AudioFeatureLoss)

🎛️ Style Transfer + ITO (CLAPFeatureLoss with text prompt)

📜 Citation

About

Uh oh!

Releases

Packages

Languages

License

SonyResearch/ITO-Master

Folders and files

Latest commit

History

Repository files navigation

ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors

🔧 Installation

🚀 Inference Examples

🎛️ Style Transfer Only

🎛️ Style Transfer + ITO (AudioFeatureLoss)

🎛️ Style Transfer + ITO (CLAPFeatureLoss with text prompt)

📜 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages