Multiverse Engine

This repository contains the official implementation of Multiverse Engine, which is built up from the SGLang codebase to support inference for Multiverse Models. For more details, please refer to our research paper:

Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation

🚀 Installation

To set up the environment, create a new conda environment and then run the following installation script.

conda create -n multiverse python=3.11
conda activate multiverse

git clone https://github.com/Multiverse4FM/Multiverse-Engine
cd Multiverse-Engine
bash install.sh

✨ Quick Start

The usage of Multiverse Engine is identical to the SGLang workflow. See example.py for a simple demonstration.

The script accepts the following arguments:

--model_path: The path to the base model on your local machine or from the Hugging Face Hub.
--prompts_path: A path to a JSON file containing a list of prompts.

To run the quick start example:

cd example

python example.py \
  --model_path Multiverse4FM/Multiverse-32B \
  --prompts_dir ./prompt

This will load the model and generate responses for each prompt in the specified text file, leveraging the Multiverse capabilities defined for the model.

🚧 Issues

We are actively working on addressing the following known issues and areas for improvement:

Support KV Cache Eviction :: Currently, KV cache offloading and reloading mechanisms are not supported. To ensure correctness, we recommend limiting the batch size to 50 or fewer.
Avoid Infinite Parallelism: We are working on implementing safeguards against infinite-depth parallelism that may be introduced by the model itself.

To avoid potential issues during usage, we recommend setting a maximum timeout in your code to prevent infinite loops or extremely long generation times.

📧 Contact

For any questions, bug reports, or feature requests, please open an issue on our GitHub repository or send an email to multiversefoundationmodel@gmail.com.

📚 References

Thank you for your interest in Multiverse Engine! We hope this tool will be helpful for your research and development. If you find it useful, please consider citing our work. Happy coding! 🚀

@misc{yang2025multiverselanguagemodelssecretly,
      title={Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation}, 
      author={Xinyu Yang and Yuwei An and Hongyi Liu and Tianqi Chen and Beidi Chen},
      year={2025},
      eprint={2506.09991},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2506.09991}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
assets		assets
example		example
sglang		sglang
README.md		README.md
install.sh		install.sh
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multiverse Engine

🚀 Installation

✨ Quick Start

🚧 Issues

📧 Contact

📚 References

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Multiverse4FM/Multiverse-Engine

Folders and files

Latest commit

History

Repository files navigation

Multiverse Engine

🚀 Installation

✨ Quick Start

🚧 Issues

📧 Contact

📚 References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages