Universal Aesthetics (Multimodal Focus): Difference between revisions

From FDHwiki
Jump to navigation Jump to search
Line 5: Line 5:


== Data ==
== Data ==
As for the convergence of language models, we need both plain texts and aesthetic texts. For simplicity, we reuse this [https://huggingface.co/datasets/minhuh/prh/tree/wit_1024 text-image dataset], which is also used in Huh et al.'s paper<ref>{{cite web
As for the convergence of language models, we need both plain texts and aesthetic texts. For simplicity, we reuse this [https://huggingface.co/datasets/minhuh/prh/tree/wit_1024 text-image dataset], which is also used in Huh et al.'s paper
| title = The Platonic Representation Hypothesis
 
| author = Minyoung Huh; Brian Cheung; Tongzhou Wang; Phillip Isola
| year = 2024
| url = https://arxiv.org/abs/2405.07987
| publisher = arXiv
| access-date = 2025-02-27
}}</ref>
=== Plain Text ===
=== Plain Text ===



Revision as of 21:19, 27 November 2025

Introduction

Methods

Data

As for the convergence of language models, we need both plain texts and aesthetic texts. For simplicity, we reuse this text-image dataset, which is also used in Huh et al.'s paper

Plain Text

Peoms

For poems, we use the Poems dataset from Kaggle, and categorize the poems into 135 types based on their form (haiku, sonnet, etc.).

References