Main Page: Difference between revisions

From FDHwiki
Jump to navigation Jump to search
Line 43: Line 43:
* FDH-1-1 (1h) What Are Digital Humanities :  Digital Humanities, Digital Studies, Humanities Computing and Studies about Digital Culture. Digital Humanism vs. Digital Humanities. Why digital methods tend to dissolve traditional disciplinary frontiers. A focus on practice. Translation issues.
* FDH-1-1 (1h) What Are Digital Humanities :  Digital Humanities, Digital Studies, Humanities Computing and Studies about Digital Culture. Digital Humanism vs. Digital Humanities. Why digital methods tend to dissolve traditional disciplinary frontiers. A focus on practice. Translation issues.
* FDH-1-2 (1h) Digital Humanities as a field : Big Data Digital Humanities vs Small Data Digital Humanities. The 3 circles.  Exercise on relationship between elements in Digital Culture schema.
* FDH-1-2 (1h) Digital Humanities as a field : Big Data Digital Humanities vs Small Data Digital Humanities. The 3 circles.  Exercise on relationship between elements in Digital Culture schema.
* FDH-1-3 (2h) Big Data of the Past. Data acceleration regime. Inferred Patterns. Redocumentation. Fictional Spaces [https://tube.switch.ch/videos/c2a637c2 Video recording link (2020)].
* FDH-1-3 (2h) Big Data of the Past. Data acceleration regime. Inferred Patterns. Redocumentation. Fictional Spaces.


==== Week 2 :  Patrimonial Capitalism and Commons  ====
==== Week 2 :  Patrimonial Capitalism and Commons  ====
Line 49: Line 49:
18.09 :   
18.09 :   


* FDH 1-4 Patrimonial Capitalism (1h) Introduction to the DH circle linking the digitisation of sources, their processing, their analysis, visualisation and the creation of societal value (insight, culture) leading ultimately to the digitisation of new sources. Presentation of some sustainable DH circles (genealogy, image banks). Patrimonial capitalism and the risk of monopolistic companies. Parallelism with the race for sequencing the Human Genome. [https://tube.switch.ch/videos/e02cfdd0 Video recording link]. [https://tube.switch.ch/videos/t5WxZR3avB Video recording link (2021)]
* FDH 1-4 Patrimonial Capitalism (1h) Introduction to the DH circle linking the digitisation of sources, their processing, their analysis, visualisation and the creation of societal value (insight, culture) leading ultimately to the digitisation of new sources. Presentation of some sustainable DH circles (genealogy, image banks). Patrimonial capitalism and the risk of monopolistic companies. Parallelism with the race for sequencing the Human Genome.  


* FDH 1-5 The Commons (1h) What are the commons ? What is the public domains ? History and evolution. Copyright overreaching. Frontal collision. Governing with the commons [https://tube.switch.ch/videos/79fa0d4e Video recording link]. [https://tube.switch.ch/videos/9n4jojmzpd/ Video recording link (2021)]
* FDH 1-5 The Commons (1h) What are the commons ? What is the public domains ? History and evolution. Copyright overreaching. Frontal collision. Governing with the commons.


19.09 :
19.09 :


* FDH 1-6 Anatomy of a large-scale project (1h) Venice Time Machine. European Time Machine. [https://tube.switch.ch/videos/93a6e77b Video recording link (pre-recorded)]. [https://tube.switch.ch/videos/61b8d58f Video recording link (live)] [[:File:FDH2023-1-6.pdf|FDH 1-6 slides]]
* FDH 1-6 Anatomy of a large-scale project (1h) Venice Time Machine. European Time Machine.  
* Past projects presentation. [https://tube.switch.ch/videos/64542a60 Video recording link] [https://tube.switch.ch/videos/jnrEK9A55w/  Video recording link (2021)].
* Past projects presentation.  
* FDH 1-7 Projects. See also [[Projects]] PDF of 2023 Projects
* FDH 1-7 Projects. See also [[Projects]] PDF of 2023 Projects


Line 69: Line 69:
26.09 :
26.09 :


* (2h) FDH 2-2 Introduction to the Digitization Process. The Story of Google books. Document digitization as a problem of conversion of dimensions. Digitization is logistic optimization. Alienation. Digitization on demand. [https://tube.switch.ch/videos/6cb9541c Video recording link] [https://tube.switch.ch/videos/7sxSQKh4ZV/ Video recording link (2021)].
* (2h) FDH 2-2 Introduction to the Digitization Process. The Story of Google books. Document digitization as a problem of conversion of dimensions. Digitization is logistic optimization. Alienation. Digitization on demand.


* (2h) Document Structure. General presentation of the pipeline. Content and Structure. Circulation. Standards. Open Annotation Data Model. Shared Canvas. IIIF. Synchronic patterns and diachronic homology. [https://tube.switch.ch/videos/247ce03e Video recording link] [https://tube.switch.ch/videos/ojhOdWVXZ5 Video recording link (2021)]
* (2h) Document Structure. General presentation of the pipeline. Content and Structure. Circulation. Standards. Open Annotation Data Model. Shared Canvas. IIIF. Synchronic patterns and diachronic homology.  


==== Week 4: Writing Systems and Text Encoding  ====
==== Week 4: Writing Systems and Text Encoding  ====
Line 77: Line 77:
2.10 :
2.10 :


(2h) FDH 2-3 : Writing Systems [https://tube.switch.ch/videos/2e3b9a81 Video recording link] [https://tube.switch.ch/videos/VXZ8E9Q7kR/ Video recording link (2021)]
(2h) FDH 2-3 : Writing Systems  


3.10 :
3.10 :
   
   
- (2h) FDH 2-4 : Text Encoding [https://tube.switch.ch/videos/95c94280/ Video recording link] [https://tube.switch.ch/videos/fs8485BgLM/ Video recording link (2021)]
- (2h) FDH 2-4 : Text Encoding  


* (2h)  [[Projects]] presentations. 5' per project with max 3 slides. Fill out the [[Projects#Groups|group table]] before the course. You can find a group using the [https://annuel2.framapad.org/p/fdh framapad]. [https://tube.switch.ch/videos/7eef2f77 Video recording link] [https://tube.switch.ch/videos/J7YimPiGKw Video recording link (2021)]
* (2h)  [[Projects]] presentations. 5' per project with max 3 slides. Fill out the [[Projects#Groups|group table]] before the course. You can find a group using the [https://annuel2.framapad.org/p/fdh framapad].  


==== Week 5: Text Processing and Understanding ====
==== Week 5: Text Processing and Understanding ====
Line 89: Line 89:
9.10 :
9.10 :


(2h) FDH 2-5 Text Processing : Diachronic and synchronic analysis. n-grams, TF-IDF, Topic Modeling, Word Space Models and Word embeddings (2h) [https://tube.switch.ch/videos/4aeff31f Video recording link.] [https://tube.switch.ch/videos/4s6Zxdgx7W/ Video recording link (2021)]
(2h) FDH 2-5 Text Processing : Diachronic and synchronic analysis. n-grams, TF-IDF, Topic Modeling, Word Space Models and Word embeddings (2h)  


10.10 :
10.10 :


(2h) FDH 2-6 Text Understanding : Close, surface, distant and machine reading, Information extraction, Named Entities, Resources, Large-Scale Projects (2h) [https://tube.switch.ch/videos/b9821f1e Video recording link.]. Work on Project (2h).
(2h) FDH 2-6 Text Understanding : Close, surface, distant and machine reading, Information extraction, Named Entities, Resources, Large-Scale Projects (2h) Work on Project (2h).


==== Week 6:  Images  ====
==== Week 6:  Images  ====
Line 99: Line 99:
16.10 :
16.10 :


(2h) FDH 2-7 : Image systems. [http://www.tube.switch.ch/videos/521a27b7 Video recording link]
(2h) FDH 2-7 : Image systems.  


17.10 :
17.10 :


(2h) FDH 2-8 : Image processing [https://tube.switch.ch/videos/b36df7a0 Video recording link] [https://diamond.timemachine.eu/ Time machine search engine] (2h) Work on project. Tutorial (Grasshopper / Rhino)
(2h) FDH 2-8 : Image processing (2h) Work on project.  


(FDH 2-9 : Image understanding not done this year)
(FDH 2-9 : Image understanding not done this year)
Line 122: Line 122:
23.10 :
23.10 :


(2h) FDH-2-10 Map systems [https://tube.switch.ch/videos/d4ab83fe/ Video recording link]
(2h) FDH-2-10 Map systems  


24.10 :
24.10 :


(2h) FDH-2-11 Map processing (2h) [https://tube.switch.ch/videos/18e9f30a Video recording link]Work on project
(2h) FDH-2-11 Map processing (2h) Work on project


==== Week 8: Architecture and Objects  ====
==== Week 8: Architecture and Objects  ====
Line 132: Line 132:
30.10 :
30.10 :
   
   
(2h) FDH-2-12: Architecture and Object Systems. [https://tube.switch.ch/videos/2e8468fd Video recording link]
(2h) FDH-2-12: Architecture and Object Systems.  


31.10 :
31.10 :


(2h) FDH-2-13: Architecture and Object Processing: Modelling vs Sampling : Model-based Procedural methods. Architectural grammars. Class I and Class II elements. The question of realism. [https://tube.switch.ch/videos/397be5e4 Video recording link]. (2h)Work on project
(2h) FDH-2-13: Architecture and Object Processing: Modelling vs Sampling : Model-based Procedural methods. Architectural grammars. Class I and Class II elements. The question of realism.   (2h)Work on project


=== Part III : Knowledge modelling and processing ===
=== Part III : Knowledge modelling and processing ===
Line 144: Line 144:
6.11 :
6.11 :


- (1h) FDH-3-0 Summary of the concept viewed so far and introduction to part 3 [https://tube.switch.ch/videos/efa55bf4 Video recording link]
- (1h) FDH-3-0 Summary of the concept viewed so far and introduction to part 3  


- (1h) FDH-3-1 Semantic modelling. RDF, Metaknowledge [https://tube.switch.ch/videos/4ac03a54 Video recording link]
- (1h) FDH-3-1 Semantic modelling. RDF, Metaknowledge  




7.11 :
7.11 :


(2h) FDH 3-2 Universal Ontologies [https://tube.switch.ch/videos/130eace5/ Video recording link]
(2h) FDH 3-2 Universal Ontologies  


- Work on project (2h)
- Work on project (2h)
Line 159: Line 159:
13.11 :
13.11 :


(2h) FDH 3-3 Rule systems, simulations and parallel worlds [https://tube.switch.ch/videos/f315a5f3 Video recording link]
(2h) FDH 3-3 Rule systems, simulations and parallel worlds  


14.11 :
14.11 :
Line 208: Line 208:
20.11 :
20.11 :


(2h) FDH 3-4 Non conceptual knowledge systems [https://tube.switch.ch/videos/64c53fc5/ Video recording link (Part 1)] [https://tube.switch.ch/videos/dd24ce0a/ Video recording link (Part 2)]
(2h) FDH 3-4 Non conceptual knowledge systems  


21.11 :
21.11 :


(2h) FDH 3-5 Topological data science [https://tube.switch.ch/videos/f21c15b0 Video recording link]
(2h) FDH 3-5 Topological data science  


=== Part IV : Platforms ===
=== Part IV : Platforms ===
Line 220: Line 220:
27.11 :
27.11 :


(2h) Data Management  : FAIR principle, Creative Commons,  Data Management models, Sustainability,  Right to Forgotten. Management of uncertainty, incoherence and errors. Iconographic principle of precaution [https://tube.switch.ch/videos/fa8d5847/ Video recording link]
(2h) Data Management  : FAIR principle, Creative Commons,  Data Management models, Sustainability,  Right to Forgotten. Management of uncertainty, incoherence and errors. Iconographic principle of precaution  


28.11 :
28.11 :


(2h) User Management : Part I: Persona. Part II: Motivation and onboarding dynamics. Three case studies: Twitter. Quora. Wikipedia. Part III: "Wisdom" of the crowds. Collectivism vs Liberalism. Open source as a form of liberalism for engineering. The ambiguous of fork. Part IV: The "power" of the crowds. Mechanical Turk. Crowdflower. Crowdfunding. [https://tube.switch.ch/videos/26697fee/ Video recording link]
(2h) User Management : Part I: Persona. Part II: Motivation and onboarding dynamics. Three case studies: Twitter. Quora. Wikipedia. Part III: "Wisdom" of the crowds. Collectivism vs Liberalism. Open source as a form of liberalism for engineering. The ambiguous of fork. Part IV: The "power" of the crowds. Mechanical Turk. Crowdflower. Crowdfunding.  


(2h) Bot Management : Three case studies on bot management : Twitter, Wikipedia, Google. [https://tube.switch.ch/videos/b3bef9b2/ Video recording link]
(2h) Bot Management : Three case studies on bot management : Twitter, Wikipedia, Google.  


==== TBD / Project Work  ====
==== TBD / Project Work  ====

Revision as of 21:01, 10 September 2024

Welcome to the wiki of the course Foundation of Digital Humanities (DH-405).

Contact

Professor: Frédéric Kaplan

Assistants: Alexander Rusnak, Tristan Krach

Rooms: Wednesday (CM1110) and Thursday (BC03)

Links

Summary

This course gives an introduction to the fundamental concepts and methods of the Digital Humanities, both from a theoretical and applied point of view. The course introduces the Digital Humanities circle of processing and interpretation, from data acquisition to new understandings and services. The first part of the course presents the technical pipelines for digitising, analysing and modelling written documents (printed and handwritten), maps, photographs and 3d objects and environments. The second part of the course details the principles of the most important algorithms in particular deep learning approaches (for document analysis and image generation) and knowledge modelling (semantic web, ontologies, graph databases). The third part of the course focuses on platform management from the points of view of data, users and bots. Students will practise the skills they learn by engaging in a class-wide collective project.

Plan

Part I : Concepts

Week 1 : What are Digital Humanities?

12.09 :

(2h) Welcome and Introduction to the course

  • FDH-0 (1h) Introduction to the course and Digital Humanities, structure of the course. Introduction to Framapad and Slido with a simple exercise. Principle of collective note talking and use in the course. State of the Digital Humanities at EPFL, in Switzerland and in Europe.

13.09 :

(4h) What are Digital Humanities? What is their object of study?

  • FDH-1-1 (1h) What Are Digital Humanities : Digital Humanities, Digital Studies, Humanities Computing and Studies about Digital Culture. Digital Humanism vs. Digital Humanities. Why digital methods tend to dissolve traditional disciplinary frontiers. A focus on practice. Translation issues.
  • FDH-1-2 (1h) Digital Humanities as a field : Big Data Digital Humanities vs Small Data Digital Humanities. The 3 circles. Exercise on relationship between elements in Digital Culture schema.
  • FDH-1-3 (2h) Big Data of the Past. Data acceleration regime. Inferred Patterns. Redocumentation. Fictional Spaces.

Week 2 : Patrimonial Capitalism and Commons

18.09 :

  • FDH 1-4 Patrimonial Capitalism (1h) Introduction to the DH circle linking the digitisation of sources, their processing, their analysis, visualisation and the creation of societal value (insight, culture) leading ultimately to the digitisation of new sources. Presentation of some sustainable DH circles (genealogy, image banks). Patrimonial capitalism and the risk of monopolistic companies. Parallelism with the race for sequencing the Human Genome.
  • FDH 1-5 The Commons (1h) What are the commons ? What is the public domains ? History and evolution. Copyright overreaching. Frontal collision. Governing with the commons.

19.09 :

  • FDH 1-6 Anatomy of a large-scale project (1h) Venice Time Machine. European Time Machine.
  • Past projects presentation.
  • FDH 1-7 Projects. See also Projects PDF of 2023 Projects

Part II : Pipelines

Week 3: Digitisation

25.09 :

  • FDH 2-1 No Class or Venice Data presentation (tbc)

26.09 :

  • (2h) FDH 2-2 Introduction to the Digitization Process. The Story of Google books. Document digitization as a problem of conversion of dimensions. Digitization is logistic optimization. Alienation. Digitization on demand.
  • (2h) Document Structure. General presentation of the pipeline. Content and Structure. Circulation. Standards. Open Annotation Data Model. Shared Canvas. IIIF. Synchronic patterns and diachronic homology.

Week 4: Writing Systems and Text Encoding

2.10 :

(2h) FDH 2-3 : Writing Systems

3.10 :

- (2h) FDH 2-4 : Text Encoding

  • (2h) Projects presentations. 5' per project with max 3 slides. Fill out the group table before the course. You can find a group using the framapad.

Week 5: Text Processing and Understanding

9.10 :

(2h) FDH 2-5 Text Processing : Diachronic and synchronic analysis. n-grams, TF-IDF, Topic Modeling, Word Space Models and Word embeddings (2h)

10.10 :

(2h) FDH 2-6 Text Understanding : Close, surface, distant and machine reading, Information extraction, Named Entities, Resources, Large-Scale Projects (2h) Work on Project (2h).

Week 6: Images

16.10 :

(2h) FDH 2-7 : Image systems.

17.10 :

(2h) FDH 2-8 : Image processing (2h) Work on project.

(FDH 2-9 : Image understanding not done this year)


Week Off

23.10 :

No course

24.10 :

No Course

Week 7: Maps

23.10 :

(2h) FDH-2-10 Map systems

24.10 :

(2h) FDH-2-11 Map processing (2h) Work on project

Week 8: Architecture and Objects

30.10 :

(2h) FDH-2-12: Architecture and Object Systems.

31.10 :

(2h) FDH-2-13: Architecture and Object Processing: Modelling vs Sampling : Model-based Procedural methods. Architectural grammars. Class I and Class II elements. The question of realism. (2h)Work on project

Part III : Knowledge modelling and processing

Week 9 : Semantic modelling

6.11 :

- (1h) FDH-3-0 Summary of the concept viewed so far and introduction to part 3

- (1h) FDH-3-1 Semantic modelling. RDF, Metaknowledge


7.11 :

(2h) FDH 3-2 Universal Ontologies

- Work on project (2h)

Week 10 :Constraints and Rule systems

13.11 :

(2h) FDH 3-3 Rule systems, simulations and parallel worlds

14.11 :

Midterm presentation (10%)

-- Project plan and milestones deliverable on the Wikipage of each project (10%)

Time Project name
10:20-10:40 Group 9
10:40-11:00 Group 8
11:00-11:20 Group 7
11:20-11:40 Group 6
11:40-12:00 Group 5
Time Project name
13:15-13:35 Group 2
13:35-13:55 Group 3
13:55-14:15 Group 4
14:15-14:35 Group 1

Week 11 : Non conceptual knowledge systems and topological data science

20.11 :

(2h) FDH 3-4 Non conceptual knowledge systems

21.11 :

(2h) FDH 3-5 Topological data science

Part IV : Platforms

Week 12 : Data, User and Bot Management

27.11 :

(2h) Data Management  : FAIR principle, Creative Commons, Data Management models, Sustainability, Right to Forgotten. Management of uncertainty, incoherence and errors. Iconographic principle of precaution

28.11 :

(2h) User Management : Part I: Persona. Part II: Motivation and onboarding dynamics. Three case studies: Twitter. Quora. Wikipedia. Part III: "Wisdom" of the crowds. Collectivism vs Liberalism. Open source as a form of liberalism for engineering. The ambiguous of fork. Part IV: The "power" of the crowds. Mechanical Turk. Crowdflower. Crowdfunding.

(2h) Bot Management : Three case studies on bot management : Twitter, Wikipedia, Google.

TBD / Project Work

Final Week : Project Presentation

-- Due: GitHub repository (10%)

-- Due: Report writing (40%)

(2h) Final project presentation (20%)

Resources

Assessment and Notation grid

  • (Group work) 2 oral presentations (30%)
    • 1 midterm presentation of the project (10%)
    • 1 final discussing the project result (20%)
  • (Group work) Written deliverables (Wiki writing) (20%)
  • (Group work) Quality of the project (30%)
  • (Individual work) Exam on Course Content (20%)

2 collective oral presentations (30%)

Midterm presenting the project planning (10%)

10' max presentation + 5' questions

Notation grid :

  • The presentation contains a planning (4)
  • + 0.5 The slides are clear and well presented
  • + 0.5 The oral presentation is dynamic and fluid
  • + 0.5 The planning is realistic.
  • + 0.5 The students answer well to the questions

Final discussing the project result (20%)

10-15' for presentation and 5-10' for questions

Notation grid :

  • The presentation presents the results of the project (4)
  • + 0.5 The slides are clear and well presented
  • + 0.5 The oral presentation is dynamic and fluid
  • + 0.5 The results are well discussed
  • + 0.5 The students answer well to the questions

Written deliverables (Wiki writing) (20%)

  • Project plan and milestones (5%) (>300 words)
  • Motivation and description of the deliverables (5%) (>300 words)
  • Detailed description of the methods (5%) (>500 words)
  • Quality assessment and discussion of limitations (5%) (>300 words)

The indicated number of words is a minimal bound. Detailed description can in particular be extended if needed.

Production (30%)

  • Quality of the realisation 20%
  • Code deliverable on github 10%


Exam on Course Content (20%)

  • A series of questions on the course to ensure the core concepts are understood.