Deciphering Venetian handwriting: Difference between revisions
Jump to navigation
Jump to search
Line 15: | Line 15: | ||
|- | |- | ||
|11 | |11 | ||
|Mapping transcription (excel file) -> page id (on the whole dataset) | |Mapping transcription (excel file) -> page id (on the whole dataset) | ||
|- | |- | ||
|12 | |12 | ||
| | |Depending of the quality of the results : improve the mapping of page id, more precise matching, viewer web | ||
|- | |- | ||
|13 | |13 |
Revision as of 12:44, 19 November 2020
Introduction
Planning
Week | Task |
---|---|
09 | Segment patch of text in Sommarioni : (page id, patch) |
10 | Mapping transcription (excel file) -> page id (proof of concept) |
11 | Mapping transcription (excel file) -> page id (on the whole dataset) |
12 | Depending of the quality of the results : improve the mapping of page id, more precise matching, viewer web |
13 | Final results, final evaluation & final report writing |
14 | Final project presentation |
Week 09
- Input : Sommarioni images
- Output : Patch of pixels containing text with coordinate of the patch in the Sommarioni
- Step 1 : Segment hand written text regions in Sommarioni images
- Step 2 : Extraction of the patches
Week 10
- Input : transcription (Excel File), tuples (page id, patch) extracted in week 9
- Output : line in the transcription -> page id
- Step 1 : HTR recognition in the patch and cleaning : (patch, text)
- Step 2 : Find matching pair between recognized text and transcription
- Step 3 : New excel file with the new page id column
Week 11
- Step 1 : Apply the pipeline validated on week 10 on the whole dataset
- Step 2 : Begin a minimalist web interface development
Week 12
- Wrapping everything together and finalizing the web interface