WikiBio: Difference between revisions

From FDHwiki
Jump to navigation Jump to search
Line 6: Line 6:
== Data sources ==
== Data sources ==


[[File:Marcopolo3-data.png]]


In this project, we make use of two different data sources:
In this project, we make use of two different data sources:


* Wikidata is used to gather the structured information about the people who lived in the Republic of Venice. Multiple information are extracted from their wikidata entries, such as: birth and death times, professions and family names. To gether the data from wikidata, a customizable SPARQL query is used.
* Wikidata is used to gather the structured information about the people who lived in the Republic of Venice. Multiple information are extracted from their wikidata entries, such as: birth and death times, professions and family names. To gether the data from wikidata, a customizable SPARQL query is used.
[[File:Marcopolo3-data.png|thumb|240px|The schema of data acquisition step]]


== Generation methods ==
== Generation methods ==

Revision as of 13:36, 19 November 2020

Motivation

The motivation for our project was to explore the possibilities of natural-language generation in the context of biography generation. It is easy to get structural data from the Wikidata pages, but not all the Wikidata pages have a corresponding Wikipedia page. This project will showcase how we can use the structural data from the Wikidata pages to generate realistic biographies in the Wikipedia pages format.

Project plan

Data sources

In this project, we make use of two different data sources:

  • Wikidata is used to gather the structured information about the people who lived in the Republic of Venice. Multiple information are extracted from their wikidata entries, such as: birth and death times, professions and family names. To gether the data from wikidata, a customizable SPARQL query is used.
The schema of data acquisition step

Generation methods

Evaluation

Evaluation-bio.png

Automatic

Human

Deliverables