Universal Aesthetics (Multimodal Focus): Difference between revisions
Jump to navigation
Jump to search
Jiajun.shen (talk | contribs) (→Peoms) |
Jiajun.shen (talk | contribs) (→Peoms) |
||
| Line 10: | Line 10: | ||
=== Peoms === | === Peoms === | ||
For poems, we use the [https://www.kaggle.com/datasets/michaelarman/poemsdataset/data Poems dataset] from Kaggle | For poems, we use the [https://www.kaggle.com/datasets/michaelarman/poemsdataset/data Poems dataset] from Kaggle. We find this dataset ideal for this project because of the following reasons: | ||
* As the plain-text dataset contains 1,024 entries, it provides enough poems to yield a substantial amount of data. | * As the plain-text dataset contains 1,024 entries, it provides enough poems to yield a substantial amount of data. | ||
* It categorizes the poems into 135 types based on their form (haiku, sonnet, etc.), which could facilitate our further studies. | * It categorizes the poems into 135 types based on their form (haiku, sonnet, etc.), which could facilitate our further studies. | ||
{| class="wikitable" | |||
|- | |||
! 列1 | |||
! 列2 | |||
|- | |||
| 内容A1 | |||
| 内容A2 | |||
|- | |||
| 内容B1 | |||
| 内容B2 | |||
|} | |||
== References == | == References == | ||
<references /> | <references /> | ||
Revision as of 21:35, 27 November 2025
Introduction
Methods
Data
As for the convergence of language models, we need both plain texts and aesthetic texts. For simplicity, we reuse this text-image dataset, which is also used in Huh et al.'s paper, and then add another poem dataset.
Plain Text
Peoms
For poems, we use the Poems dataset from Kaggle. We find this dataset ideal for this project because of the following reasons:
- As the plain-text dataset contains 1,024 entries, it provides enough poems to yield a substantial amount of data.
- It categorizes the poems into 135 types based on their form (haiku, sonnet, etc.), which could facilitate our further studies.
| 列1 | 列2 |
|---|---|
| 内容A1 | 内容A2 |
| 内容B1 | 内容B2 |