--- license: apache-2.0 pipeline_tag: image-to-image library_name: diffusers --- # iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation [](https://huggingface.co/papers/2511.20635) [](https://kr1sjfu.github.io/iMontage-web/) [](https://github.com/Kr1sJFU/iMontage)
| *Change the material of the lava to silver.* |
|
| **cref** |
| *Confucius from the first image, Moses from the secondβ¦* |
|
| **conditioned_cref** |
| *depth* |
|
| **sref** |
| *(empty)* |
|
| **multiview** |
| *1. Shift left; 2. Look up; 3. Zoom out.* |
|
| **storyboard** |
| *Vintage film: 1. Hepburn carrying the yellow bagβ¦* |
|
## π Acknowledgment
We sincerely thank the open-source community for providing strong foundations that enabled this work.
In particular, we acknowledge the following projects for their models, datasets, and valuable insights:
- **HunyuanVideo-T2V**, **HunyuanVideo-I2V** β Provided base generative model designs and code.
- **FastVideo** β Contributed key components and open-source utilities that supported our development.
These contributions have greatly influenced our research and helped shape the design of **iMontage**.
---
## π Citation
If you find **iMontage** useful for your research or applications, please consider starring β the repo and citing our paper:
```bibtex
@article{fu2025iMontage,
title={iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation},
author={Zhoujie Fu and Xianfang Zeng and Jinghong Lan and Xinyao Liao and Cheng Chen and Junyi Chen and Jiacheng Wei and Wei Cheng and Shiyu Liu and Yunuo Chen and Gang Yu and Guosheng Lin},
journal={arXiv preprint arXiv:2511.20635},
year={2025},
}
```