Research Center for Digital Sustainability

NLP Seminar Spring 2022

This seminar provides a conceptual and practical introduction into modern Natural Language Processing (NLP) methods and technologies. Each lecture introduces a new NLP approach based on a seminal publication and including a presentation of an academic guest speaker. The NLP methods include Bag-of-words (BoW), term frequency–inverse document frequency (TF-IDF), word2vec, long short-term memory (LSTM), latent Dirichlet allocation (LDA), transformers, BERT, and GPT-3. The lectures have been recorded from the previous Semester and are (in most cases) available on BigBlueButton. Please find the schedule of the Seminar here, listing relevant research articles or blogposts. 

Each student will conduct and present an individual project related to NLP. The topic descriptions can be found here. For maximum learning opportunities from other student's presentations, active participation is mandatory.

This seminar is mandatory for all students conducting a bachelor or master thesis at the Research Center for Digital Sustainability. They will present the progress of their theses in the seminar.

The Natural Language Processing Seminar will stop with this edition because of a lack of funding.

Time, Location, and Links

Schedule 2022

Date Topic Mandatory Paper or BlogPost and Recording


25 February 2022 Overview and introduction, NRP77 project on reidentification of Swiss judgments, presentation of topics for a thesis project Deep Learning Quiz:   
4 March 2022

Bag-of-words (BoW) and term frequency-inverse document frequency (TF-IDF)

Starting Presentations: David Bucher, Ronja Stern

A Brief Survey of Text Mining: Classification, Clustering and Extraction Techniques:


Dominic Schweizer, University of Bern
11 March 2022


Intro to FDN-Sandbox by Adrian
Intermediate Presentation: Marco Buchholz



Prof. Dr. Tobias Hodel, Digital Humanities University of Bern
18 March 2022 Presentation of student project proposals


25 March 2022

Recurrent Neural Networks

Presentation of student project proposals


Lecture Materials: 
Exercise Solutions:


Dr. Mathias Müller, Postdoc and Lecturer at University of Zürich
1 April 2022

ML and NLP in industry

Starting Presentation: Tobias Brugger

Technical Debt: 
(Compressible Subspace:


Siddhartha Singh
8 April 2022

Building Knowledge Graphs using NLP

Starting Presentation: Roman Martinez

Knowledge Graphs:


Prof. Dr. Patrizio Collovà, Bern University of Applied Sciences
15 April 2022 Good Friday!

22 April 2022 Spring Break!

29 April 2022 Text Analysis with Contextualized Topic Models

Nice introduction about topic modeling:
Blog post about contextualized topic models for zero-shot cross-lingual prediction:
Colab Notebook:


Silvia Terragni, PhD student at University of Milano-Bicocca
6 May 2022 Transformers



Joel Niklaus, University of Bern

13 May 2022 GPT-3: Lessons from Generative Pre-Training and AI Marketing



Dr. Simon Clematide, Academic Associate at University of Zurich
20 May 2022 BERT



Dr. Ilias Chalkidis, NLP Postdoctoral Researcher at University of Copenhagen 
27 May 2022 Feedback for Posters Come with a first draft of the poster prepared!  
3 June 2022 Poster Session for final presentation of projects