Deciphering Venetian handwriting: Difference between revisions
Jump to navigation
Jump to search
Line 12: | Line 12: | ||
|- | |- | ||
|10 | |10 | ||
| | |Mapping transcription (excel file) -> page id | ||
|- | |- | ||
|11 | |11 | ||
Line 29: | Line 29: | ||
===Week 09=== | ===Week 09=== | ||
* Input : Sommarioni images | |||
* Output : Patch of pixels containing text with coordinate of the patch in the Sommarioni | |||
* Step 1 : Recognize lines of hand written text | |||
* Step 2 : Extraction of the patches | |||
===Week 10=== | ===Week 10=== | ||
* Input : transcription (Excel File), tuples (page id, patch) extracted in week 9 | |||
* Output : line in the transcription -> page id | |||
* Step 1 : HTR recognition in the patch and cleaning : (patch, text) | |||
* Step 2 : Find matching pair between recognized text and transcription | |||
* Step 3 : New excel file with the new page id column | |||
===Week 11=== | ===Week 11=== | ||
===Week 12=== | ===Week 12=== | ||
==Historical introduction to the source== | ==Historical introduction to the source== |
Revision as of 11:27, 15 November 2020
Introduction
Planning
Week | Task |
---|---|
09 | Segment patch of text in Sommarioni : (page id, patch) |
10 | Mapping transcription (excel file) -> page id |
11 | Mapping between HTR output and Excel file : (text from the HTR) -> (excel cell) |
12 | Full mapping : (page id, patch) -> (excel cell) |
13 | Final results, final evaluation & final report writing |
14 | Final project presentation |
Week 09
- Input : Sommarioni images
- Output : Patch of pixels containing text with coordinate of the patch in the Sommarioni
- Step 1 : Recognize lines of hand written text
- Step 2 : Extraction of the patches
Week 10
- Input : transcription (Excel File), tuples (page id, patch) extracted in week 9
- Output : line in the transcription -> page id
- Step 1 : HTR recognition in the patch and cleaning : (patch, text)
- Step 2 : Find matching pair between recognized text and transcription
- Step 3 : New excel file with the new page id column