Open Access Te Herenga Waka-Victoria University of Wellington
Browse
- No file added yet -

Focusing on your subject: Deep subject-aware image composition recommendation networks

Download (11.02 MB)
journal contribution
posted on 2024-05-26, 23:45 authored by Guo-Ye Yang, Wen-Yang Zhou, Yun Cai, Song-Hai Zhang, Fanglue ZhangFanglue Zhang

Photo composition is one of the most important factors in the aesthetics of photographs. As a popular application, composition recommendation for a photo focusing on a specific subject has been ignored by recent deep-learning-based composition recommendation approaches. In this paper, we propose a subject-aware image composition recommendation method, SAC-Net, which takes an RGB image and a binary subject window mask as input, and returns good compositions as crops containing the subject. Our model first determines candidate scores for all possible coarse cropping windows. The crops with high candidate scores are selected and further refined by regressing their corner points to generate the output recommended cropping windows. The final scores of the refined crops are predicted by a final score regression module. Unlike existing methods that need to preset several cropping windows, our network is able to automatically regress cropping windows with arbitrary aspect ratios and sizes. We propose novel stability losses for maximizing smoothness when changing cropping windows along with view changes. Experimental results show that our method outperforms state-of-the-art methods not only on the subject-aware image composition recommendation task, but also for general purpose composition recommendation. We also have designed a multistage labeling scheme so that a large amount of ranked pairs can be produced economically. We use this scheme to propose the first subject-aware composition dataset SACD, which contains 2777 images, and more than 5 million composition ranked pairs. The SACD dataset is publicly available at https://cg.cs.tsinghua.edu.cn/SACD/.

Funding

Reconstructing Dynamic Panoramic Scenes in Mixed Reality

Royal Society of New Zealand

Find out more...

History

Preferred citation

Yang, G. -Y., Zhou, W. -Y., Cai, Y., Zhang, S. -H. & Zhang, F. -L. (2023). Focusing on your subject: Deep subject-aware image composition recommendation networks. Computational Visual Media, 9(1), 87-107. https://doi.org/10.1007/s41095-021-0263-3

Journal title

Computational Visual Media

Volume

9

Issue

1

Publication date

2023-03-01

Pagination

87-107

Publisher

Springer Science and Business Media LLC

Publication status

Published online

Online publication date

2022-10-18

ISSN

2096-0433

eISSN

2096-0662

Language

en

Usage metrics

    Journal articles

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC