Keep Knowledge in Perception: Zero-Shot Image Aesthetic Assessment
Guolong Wang, Yike Tan, Hangyu Lin, and 1 more author
In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2024
ICASSP 2024 Oral Presentation
Image aesthetic assessment is an important issue in multimedia, but most existing studies employ supervised learning methods that rely on large-scale annotated data. However, aesthetic scoring annotations are difficult to obtain in large quantities. Therefore, this paper explores zero-shot image aesthetic assessment. We predict aesthetic scores by introducing knowledge of different attributes (e.g., Focus). First, we use prompt tuning to obtain a unique prompt for each aesthetic attribute as external knowledge. Second, we leverage image relations considering sentiment polarity as internal knowledge. Specifically, we obtain aesthetic attribute representations from pre-trained models via prompt learning, then select anchor images on specific attributes by sentiment polarity, computing aesthetic scores. Notably, annotated aesthetic scores are not used in the process. Experiments show that our zero-shot approach outperforms existing methods in several metrics.