Universal Aesthetics (Multimodal Focus)

From FDHwiki
Revision as of 21:17, 27 November 2025 by Jiajun.shen (talk | contribs) (→‎Data)
Jump to navigation Jump to search

Introduction

Methods

Data

As for the convergence of language models, we need both plain texts and aesthetic texts. For simplicity, we reuse this text-image dataset, which is also used in Huh et al.'s paper[1].

Plain Text

Peoms

For poems, we use the Poems dataset from Kaggle, and categorize the poems into 135 types based on their form (haiku, sonnet, etc.).


References