Course 2024-2025 a.y.

20879 - LANGUAGE TECHNOLOGY

Department of Computing Sciences

Course taught in English
AI (8 credits - II sem. - OB  |  ING-INF/05)
Course Director:
DIRK HOVY

Classi: 29 (I/II sem.)
Docenti responsabili delle classi:
Classe 29: DIRK HOVY


Conoscenze pregresse consigliate

To feel comfortable in this course, you should have good knowledge of programming in Python, as well as simple linear algebra (what are vectors and matrices, how are they multiplied) and probability theory (what is a probability distribution, what is conditional probability). Additional knowledge of data structures (classes, Counter, defaultdict) make many of the applications easier to solve.

Mission e Programma sintetico

MISSION

Natural Language Processing and language technology tools are becoming ubiquitous: from everyday tools like machine translation or smart speakers, to industry applications for hiring, customer analysis, etc. Machine-learning based text analysis tools provide a range of possibilities and are a growing field of expertise. The advance of large language models like (chatGPT, etc) have changed and greatly expanded NLP capabilities. This course provides an overview and hands-on experience in all relevant techniques.

PROGRAMMA SINTETICO

Information theory, basics and history of NLP, language models, representations, topic models, classification, NLP applications, ethics of AI and NLP.


Risultati di Apprendimento Attesi (RAA)

CONOSCENZA E COMPRENSIONE

Al termine dell'insegnamento, lo studente sarà in grado di...

- understand the power of large langauge models

- reason about the risks and benefits of various approaches

- come up with an appropriate method for a given problem

CAPACITA' DI APPLICARE CONOSCENZA E COMPRENSIONE

Al termine dell'insegnamento, lo studente sarà in grado di...

- implement various NLP methods

- develop, run, and analyze various tools


Modalità didattiche

  • Testimonianze (in aula o a distanza)
  • Esercitazioni pratiche
  • Lavori/Assignment individuali
  • Lavori/Assignment di gruppo

DETTAGLI

The course has lectures, with slides and explanantions, and associated practice collab notebooks.


Each student completes individual assignments to get experience in implementation details, and students work together in groups to solve a joint task. If applicable/available, students have the option to participate in external competitions such as Kaggle competitions or shared tasks in natural language processing.


Metodi di valutazione dell'apprendimento

  Accertamento in itinere Prove parziali Prova generale
  • Lavori /Assignment individuale (relazione, esercizio, dimostrazione, progetto etc.)
x    
  • Lavori/Assignment di gruppo (relazione, esercizio, dimostrazione, progetto etc.)
x    
  • Partecipazione in aula (virtuale, fisica)
x    

STUDENTI FREQUENTANTI E NON FREQUENTANTI

Best two out of three individual assignments (50%)
Final project (50%)

 

Projects are graded based on the performance of the system and the quality of the report. Assessment of projects will include their clarity of presentation and performance of models used, as well as ambitiousness of the project.

 

Additional points can be assigned for attendance and quiz performance throughout the semester.


Materiali didattici


STUDENTI FREQUENTANTI E NON FREQUENTANTI

Colab notebooks are provided for each class, as well as class notes for required reading.
 

OPTIONAL READING

  • Hovy, Dirk. Text Analysis in Python for Social Scientists, Discovery and Exploration. Cambridge University Press, 2020.
  • Jurafsky, Dan, and James H. Martin. Speech and language processing. Vol. 3. London: Pearson, 2014.
  • Manning, Christopher D., and Hinrich Schütze. Foundations of statistical natural language processing. MIT press, 1999.
  • Marsland, Stephen. Machine learning: an algorithmic perspective. CRC press, 2015.
  • Chollet, Francois. Deep learning with Python. Manning Publications Co., 2017.
  • Goldberg, Yoav. A Primer on Neural Network Models for Natural Language Processing. ArXiv, 2015.
  • Eisenstein, Jacob. Introduction to Natural Language Processing. MIT Press, 2019.
     
Modificato il 13/05/2024 15:27