Jerusalem: locating the colonies and neighborhoods: Difference between revisions
No edit summary |
No edit summary |
||
Line 21: | Line 21: | ||
== Data collection == | == Data collection == | ||
=== OCR method for paper book === | === OCR method for paper book === | ||
''Jerusalem and its Environs'' is a book written by xxxxxxx in xxxx. This book provides detailed information about development of the Jerusalem neighborhoods during different time periods. We used OCR technology to scan relevant information from the book. In addition, because many of the community names contain punctuations and annotations, we also performed manual proofreading to ensure the accuracy of the data. | ''Jerusalem and its Environs'' is a book written by xxxxxxx in xxxx. This book provides detailed information about development of the Jerusalem neighborhoods during different time periods. We used OCR technology to scan relevant information from the book. In addition, because many of the community names contain punctuations and annotations, we also performed manual proofreading to ensure the accuracy of the data. Here is an example of data from the book. Information about the name, founded year, inhabitants, initiative is provided. Remarks are also included in some cases. | ||
This source gives us the founded year of neighborhoods in Jerusalem, an important feature in our study. However, not every neighborhood has an exact year of construction. For neighborhoods where the founded year is an interval and where the year of construction is an ambiguous period (e.g., 1900s or end of Mandate), we choose the first year of the period for further processing. | |||
=== Crawler method for Wikipedia and Wikidata information === | === Crawler method for Wikipedia and Wikidata information === |
Revision as of 09:57, 21 December 2022
Introduction
The goal of this project is to study the construction of neighborhoods in Jerusalem over time. We collect information about Jerusalem neighborhoods from four different sources, including the book Jerusalem and its Environs, the Wikipedia category Neighbourhoods of Jerusalem, the Wikipedia list Places of Jerusalem - Neighborhoods and Wikidata entity neighborhood of Jerusalem. These sources provide us with different information with different focuses. We merge this content through matching methods and present it on a web page. The page organizes and visualizes all the information. With the timeline, search, and view details features, users can get a clear picture of the Jerusalem community in our map interface. At the same time, the matching approach we use can be easily applied to other cities with multiple sources of information (similar, different, or even contradictory), with the potential for reuse in the future.
Motivation
The study of the geography and chronology of neighborhoods in Jerusalem can provide valuable insights into the city's past and present. The location of a neighborhood can often reflect the social, economic, and political forces that shaped it, as well as the cultural traditions and values of its residents.
Examining the founding year of a neighborhood can also provide insight into the city's history and development. Visualizing the location and founded year of neighborhoods in Jerusalem can be a powerful tool for understanding the city's past and present. By mapping and analyzing these data, it is possible to gain a deeper understanding of the cultural, social, and economic dynamics of different neighborhoods and the forces that have shaped them.
A city with such a rich and varied history as Jerusalem has many different accounts of it. These accounts from various sources are an important basis when studying it. How to integrate the information from these sources is also one of the focuses of our research.
Deliverables
- OCR results of Development of Jerusalem neighborhoods information from Jerusalem and its Environs.
- Crawler results from Wikipedia category Neighbourhoods of Jerusalem, Wikipedia list Places of Jerusalem - Neighborhoods, and Wikidata entity neighborhood of Jerusalem.
- Integrated database with multiple information sources after perfect matching and fuzzy matching.
- An interactive and user-friendly webpage showing the changes in neighborhoods of Jerusalem with time, with:
- A timeline feature that illustrates the evolution of the construction of neighborhoods in Jerusalem over time.
- A search function that enables users to search for neighborhoods by name.
- A dedicated sub-page that contains relevant information for each neighborhood.
Methodology
Data collection
OCR method for paper book
Jerusalem and its Environs is a book written by xxxxxxx in xxxx. This book provides detailed information about development of the Jerusalem neighborhoods during different time periods. We used OCR technology to scan relevant information from the book. In addition, because many of the community names contain punctuations and annotations, we also performed manual proofreading to ensure the accuracy of the data. Here is an example of data from the book. Information about the name, founded year, inhabitants, initiative is provided. Remarks are also included in some cases.
This source gives us the founded year of neighborhoods in Jerusalem, an important feature in our study. However, not every neighborhood has an exact year of construction. For neighborhoods where the founded year is an interval and where the year of construction is an ambiguous period (e.g., 1900s or end of Mandate), we choose the first year of the period for further processing.
Crawler method for Wikipedia and Wikidata information
Data matching
Database establishment
Webpage development
Search function
Timeline feature
Result assessment
Limitations and Further Work
Limitations
Future work
Project Plan and Milestones
Date | Task | Completion |
---|---|---|
By Week 3 |
|
✓ |
By Week 4 |
|
✓ |
By Week 5 |
|
✓ |
By Week 6 |
|
✓ |
By Week 7 |
|
✓ |
By Week 8 |
|
✓ |
By Week 9 |
|
✓ |
By Week 10 |
|
✓ |
By Week 11 |
|
|
By Week 12 |
|
|
By Week 13 |
|
|
By Week 14 |
|
|
By Week 15 |
|