← Back to publications

Abstract

Mozualization is a multimodal AI tool for creating and editing music through text, image, and audio inputs. It helps users combine emotional descriptions, visual materials, and sound samples into music with customized style, color, and personal expression. The system explores how multimodal generation can make music composition more accessible while preserving creative control.

BibTeX

@inproceedings{xu2025mozualization,
  author = {Xu, Wanfang and Zhao, Lixiang and Song, Haiwen and Song, Xinheng and Lu, Zhaolin and Liu, Yu and Chen, Min and Lim, Eng Gee and Yu, Lingyun},
  title = {Mozualization: Crafting Music and Visual Representation with Multimodal AI},
  booktitle = {Extended Abstracts of the CHI Conference on Human Factors in Computing Systems},
  year = {2025},
  pages = {1--7},
  doi = {10.1145/3706599.3719686}
}