Universal Aesthetics (Multimodal Focus)

From FDHwiki
Revision as of 21:34, 27 November 2025 by Jiajun.shen (talk | contribs) (→‎Peoms)
Jump to navigation Jump to search

Introduction

Methods

Data

As for the convergence of language models, we need both plain texts and aesthetic texts. For simplicity, we reuse this text-image dataset, which is also used in Huh et al.'s paper, and then add another poem dataset.

Plain Text

Peoms

For poems, we use the Poems dataset from Kaggle. This dataset contains 6,322 poems in 135 different types in total. We find this dataset ideal for this project because of the following reasons:

  • As the plain-text dataset contains 1,024 entries, it provides enough poems to yield a substantial amount of data.
  • It categorizes the poems into 135 types based on their form (haiku, sonnet, etc.), which could facilitate our further studies.

References