Abstract
Personalizing a large-scale pretrained Text-to-Image (T2I) diffusion model is challenging as it typically struggles to make an appropriate trade-off between its training data distribution and the target distribution, i.e., learning a novel concept with only a few target images to achieve personalization (aligning with the personalized target) while preserving text editability (aligning with diverse text prompts). In this paper, we propose PaRa, an effective and efficient Parameter Rank Reduction approach for T2I model personalization by explicitly controlling the rank of the diffusion model parameters to restrict its initial diverse generation space into a small and well-balanced target space. Our design is motivated by the fact that taming a T2I model toward a novel concept such as a specific art style implies a small generation space. To this end, by reducing the rank of model parameters during finetuning, we can effectively constrain the space of the denoising sampling trajectories towards the target. With comprehensive experiments, we show that PaRa achieves great advantages over existing finetuning approaches on single/multisubject generation as well as single-image editing. Notably, compared to the prevailing fine-tuning technique LoRA, PaRa achieves better parameter efficiency (2× fewer learnable parameters) and much better target image alignment.
| Original language | English |
|---|---|
| Title of host publication | 13th International Conference on Learning Representations, ICLR 2025 |
| Editors | Y. Yue, A. Garg, N. Peng, F. Sha, R. Yu |
| Place of Publication | Appleton WI USA |
| Publisher | International Conference on Learning Representations (ICLR) |
| Pages | 53523-53547 |
| Number of pages | 25 |
| ISBN (Electronic) | 9798331320850 |
| Publication status | Published - 2025 |
| Event | International Conference on Learning Representations 2025 - Singapore, Singapore Duration: 24 Apr 2025 → 28 Apr 2025 Conference number: 13th https://openreview.net/group?id=ICLR.cc/2025/Conference (Peer Review) https://iclr.cc/Conferences/2025 (Website) https://proceedings.iclr.cc/paper_files/paper/2025 (Proceedings) |
Conference
| Conference | International Conference on Learning Representations 2025 |
|---|---|
| Abbreviated title | ICLR 2025 |
| Country/Territory | Singapore |
| City | Singapore |
| Period | 24/04/25 → 28/04/25 |
| Internet address |
|
Keywords
- Text-to-Image diffusion model
- Diffusion model fine-tuning
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver