Europeana: mapping postcards: Difference between revisions

From FDHwiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 1: Line 1:
== Introduction & Motivation ==
= Introduction & Motivation =
== Deliverables ==
= Deliverables =
== Methodologies ==
= Methodologies =
=== Data collection ===
== Data collection ==
== Result Assessment ==
= Result Assessment =
== Limitations & Future work ==
= Limitations & Future work =
== Projet plan & milestones  ==
= Projet plan & milestones  =
{|class="wikitable"
{|class="wikitable"
! style="text-align:center;"|Timeframe
! style="text-align:center;"|Timeframe
Line 73: Line 73:
| align="center" |Week 12
| align="center" |Week 12
|
|
* Use the TA's annotation tool for test set evaluation
* Use the TA's annotation tool for building a ground truth
* Build the visualization platform
* Build the visualization platform
| align="center" |  
| align="center" |
|-
|-


Line 83: Line 83:
* Analyze the results of the test set evaluation
* Analyze the results of the test set evaluation


| align="center" |  
| align="center" |
|-
|-


Line 89: Line 89:
|
|
* Prepare the final report and presentation
* Prepare the final report and presentation
| align="center" |  
| align="center" |
|-
|-
|}
|}

Revision as of 14:15, 17 December 2023

Introduction & Motivation

Deliverables

Methodologies

Data collection

Result Assessment

Limitations & Future work

Projet plan & milestones

Timeframe Task Completion
Week 4
  • Explore postcard search results on Europeana's website
  • Study the Europeana API documentation and get an access key.
  • Extract data of postcards using the Europeana API
Week 5
  • Clean data using metadata.
  • Analyze the data of Europeana postcards
  • Prepare sample image sets and explore prediction methods
Week 6
  • Decide to focus on postcards with text
  • Test and evaluate the effectiveness of multiple OCR models
Week 7
  • Use OCR and NER for prediction
  • Test and evaluate the effectiveness of multiple NER tools
  • Explore alternative forecasting methods
Week 8
  • Introduce ChatGPT for the prediction(OCR+GPT-3.5+NER)
  • Try to make predictions directly using GPT-4
Week 9
  • Optimize GPT-3.5 prompt for better results
  • Compare the results of OCR + GPT-3.5 (optimized prompts) to those of GPT-4.
Week 10
  • Complete the pipeline for the entire prediction process
  • Prepare a sample set to evaluate the effect
Week 11
  • Explore the visualization methods
  • Refine the test set and analyze it
Week 12
  • Use the TA's annotation tool for building a ground truth
  • Build the visualization platform
Week 13
  • Testing and refinement of the Web application
  • Analyze the results of the test set evaluation
Week 14
  • Prepare the final report and presentation

Github Repository

Europeana-mapping-postcards

References