Venice Families and Real Estate: Difference between revisions
Hannah.casey (talk | contribs) |
Hannah.casey (talk | contribs) |
||
Line 9: | Line 9: | ||
==== The Sudetto problem ==== | ==== The Sudetto problem ==== | ||
The sudetto problem appeared quite early when working with the cadaster from 1808. Whenever multiple consecutive parcels have the same owner, most of the later entries have no owner specified anymore but the value of the column is 'sudetto'. Sometimes the later entries still contain the actual name of the parcel owner, in addition to 'sudetto'. | The sudetto problem appeared quite early when working with the cadaster from 1808. Whenever multiple consecutive parcels have the same owner, most of the later entries have no owner specified anymore but the value of the column is 'sudetto'. Sometimes the later entries still contain the actual name of the parcel owner, in addition to 'sudetto'. In order to solve this problem, we iterated once through the entire dataset and filled in all the 'sudetto' with the value of the previous parcel owner. | ||
==== Family names ==== | |||
Since most of our analysis is based on families and we assume that if a owner of a parcel has the same surname as another, they belong to the same family, extracting the family names was one of the first tasks we attempted to tackle. In the Cadaster we assume that for most of the areas (all except Canareggio) the surnames of parcel owners are written in capital letters. Using this information, we managed to extract some family names from the cadaster using a simple regex pattern matching. However, some other entities are also written in all capitals, especially public institutions. | |||
The extracted names were then compared to a list of familiy names we received. The list contains information about families in old venice like their noble status. Unfortunately, the list also contains all names of public institutions, so it became redundant to our purposes. | |||
Hence, a list of names of public institutions (by no means complete) was used to filter the extracted names. | |||
{| class="wikitable" | {| class="wikitable" |
Revision as of 17:06, 24 November 2021
Introduction
The Catastici and the Cadaster hold information about who owned buildings in Venice in the years 1741 and 1808. In order to find out how power shifted between families in this time, we will look at the change in real estate between these years.
Motivation
Plan and Milestones
First Steps
The first step working on the project was to familiarize ourselves with the datasets. We started working on the cadaster, as we assumed it would be easier to handle. After extracting some general information about the owners of the parcels, quickly we had to realize that the parcel owner column of the dataset is not homogenous in the way it is constructed. This is due to the way the data was extracted, and as it is based on scans of hand written documents, we found we had to deal with all kinds of problems associated with handwritten documents.
The Sudetto problem
The sudetto problem appeared quite early when working with the cadaster from 1808. Whenever multiple consecutive parcels have the same owner, most of the later entries have no owner specified anymore but the value of the column is 'sudetto'. Sometimes the later entries still contain the actual name of the parcel owner, in addition to 'sudetto'. In order to solve this problem, we iterated once through the entire dataset and filled in all the 'sudetto' with the value of the previous parcel owner.
Family names
Since most of our analysis is based on families and we assume that if a owner of a parcel has the same surname as another, they belong to the same family, extracting the family names was one of the first tasks we attempted to tackle. In the Cadaster we assume that for most of the areas (all except Canareggio) the surnames of parcel owners are written in capital letters. Using this information, we managed to extract some family names from the cadaster using a simple regex pattern matching. However, some other entities are also written in all capitals, especially public institutions. The extracted names were then compared to a list of familiy names we received. The list contains information about families in old venice like their noble status. Unfortunately, the list also contains all names of public institutions, so it became redundant to our purposes. Hence, a list of names of public institutions (by no means complete) was used to filter the extracted names.
Date | Task | Completion |
---|---|---|
Week 5 |
|
✓ |
Week 6 |
|
✓ |
Week 7 |
|
✓ |
Week 8 |
|
✓ |
Week 9 |
|
✓ |
Week 10 |
|
|
Week 11 |
|
|
Week 12 |
|
|
Week 13 |
|
|
Week 14 |
|