Europeana: mapping postcards: Difference between revisions
Jump to navigation
Jump to search
Jingbang.liu (talk | contribs) |
Jingbang.liu (talk | contribs) |
||
Line 12: | Line 12: | ||
= Methodologies = | = Methodologies = | ||
== Data collection == | == Data collection == | ||
== OCR == | |||
== Prediction using GPT == | |||
== The Build of Ground Truth == | |||
== Web Application == | |||
= Result Assessment = | = Result Assessment = | ||
= Limitations & Future work = | = Limitations & Future work = |
Revision as of 16:39, 17 December 2023
Introduction & Motivation
Deliverables
- 39,587 records related to postcards with image copyrights, along with their metadata, from the Europeana website.
- OCR results of a sample set of 350 images containing text.
- GPT-3.5 prediction results for a sample set of 350 images containing text, based on OCR results.
- A high-quality, manually annotated Ground Truth for a sample set of 309 images.
- GPT-3.5 prediction results for Ground Truth.
- GPT-4 prediction results for Ground Truth.
- An interactive webpage displaying the mapping of the postcards.
- The GitHub repository contains all the codes for the whole project.
Methodologies
Data collection
OCR
Prediction using GPT
The Build of Ground Truth
Web Application
Result Assessment
Limitations & Future work
Projet plan & milestones
Timeframe | Task | Completion |
---|---|---|
Week 4 |
|
✅ |
Week 5 |
|
✅ |
Week 6 |
|
✅ |
Week 7 |
|
✅ |
Week 8 |
|
✅ |
Week 9 |
|
✅ |
Week 10 |
|
✅ |
Week 11 |
|
✅ |
Week 12 |
|
✅ |
Week 13 |
|
✅ |
Week 14 |
|
✅ |