Europeana: mapping postcards: Difference between revisions
Jump to navigation
Jump to search
Jingbang.liu (talk | contribs) |
Jingbang.liu (talk | contribs) |
||
| Line 12: | Line 12: | ||
= Methodologies = | = Methodologies = | ||
== Data collection == | == Data collection == | ||
== OCR == | |||
== Prediction using GPT == | |||
== The Build of Ground Truth == | |||
== Web Application == | |||
= Result Assessment = | = Result Assessment = | ||
= Limitations & Future work = | = Limitations & Future work = | ||
Revision as of 16:39, 17 December 2023
Introduction & Motivation
Deliverables
- 39,587 records related to postcards with image copyrights, along with their metadata, from the Europeana website.
- OCR results of a sample set of 350 images containing text.
- GPT-3.5 prediction results for a sample set of 350 images containing text, based on OCR results.
- A high-quality, manually annotated Ground Truth for a sample set of 309 images.
- GPT-3.5 prediction results for Ground Truth.
- GPT-4 prediction results for Ground Truth.
- An interactive webpage displaying the mapping of the postcards.
- The GitHub repository contains all the codes for the whole project.
Methodologies
Data collection
OCR
Prediction using GPT
The Build of Ground Truth
Web Application
Result Assessment
Limitations & Future work
Projet plan & milestones
| Timeframe | Task | Completion |
|---|---|---|
| Week 4 |
|
✅ |
| Week 5 |
|
✅ |
| Week 6 |
|
✅ |
| Week 7 |
|
✅ |
| Week 8 |
|
✅ |
| Week 9 |
|
✅ |
| Week 10 |
|
✅ |
| Week 11 |
|
✅ |
| Week 12 |
|
✅ |
| Week 13 |
|
✅ |
| Week 14 |
|
✅ |