Course 2024-2025 a.y.

30412 - MACHINE LEARNING

Department of Computing Sciences

Course taught in English

Student consultation hours
Class timetable
Exam timetable

BEMACS (6 credits - II sem. - OB | INF/01)

Course Director:
ANDREA CELLI

Classes: 25 (II sem.)

Instructors:
Class 25: ANDREA CELLI

Suggested background knowledge

For an effective learning experience, it is recommended a basic knowledge of Python programming, elementary calculus and linear algebra, probability and statistics.

Mission & Content Summary

MISSION

Machine learning is an exciting and fast-moving field of computer science with many applications such as computer vision, speech recognition, biology and medicine (e.g., predicting protein-protein interactions, predicting proteins’ 3D structure), finance, and many consumer applications such as recommending music and movies, and drive autonomous cars. This course provides an introduction to the theoretical foundations and practical algorithms of modern machine learning.

CONTENT SUMMARY

Basics of optimization
Supervised learning: linear regression, logistic regression, kernel methods, SVM, decision trees, ensemble methods
Neural Networks and backpropagation
Bias-variance tradeoff, model selection
Unsupervised learning: Principal Component Analysis, hierarchical clustering, k-means
Basics of Reinforcement learning
Implementing Machine Learning Pipelines with sklearn

Intended Learning Outcomes (ILO)

KNOWLEDGE AND UNDERSTANDING

At the end of the course student will be able to...

State and use the basic results of statistical inference for data analysis.
Distinguish and characterize the following basic machine learning problems: regression, classification, clustering, dimensionality reduction.
Describe the basic conceptual ideas, strengths, and limitations of the different learning algorithms for the settings above.
Evaluate the main trade-offs such as overfitting, and computational cost vs accuracy.

APPLYING KNOWLEDGE AND UNDERSTANDING

At the end of the course student will be able to...

Implement end-to-end machine learning pipelines and rigorously evaluate their performance.
Demonstrate the ability to cope with real-world data analysis and problem solving (managing, preprocessing and analyze real datasets).

Teaching methods

Lectures
Practical Exercises
Collaborative Works / Assignments

DETAILS

Face-to-face lectures focus on the theoretical, methodological, and computational aspects of the topics covered by the course.
In hands-on exercise sessions students will work on their laptops to implement solutions to relevant case studies. Students will use Python and common libraries such as scikit-learn.
Individual/group assignments: each student is required to solve a machine learning problem and provide a written report and the related code.

Assessment methods

	Continuous assessment	Partial exams	General exam
Written individual exam (traditional/online)			x

ATTENDING AND NOT ATTENDING STUDENTS

The final exam consists of a written general exam, with an optional hands-on project.

The written exam is mainly aimed at testing the learning of the concepts, methods and techniques illustrated in the course and their correct understanding. The hands-on project is mainly functional to assess the ability to apply such techniques to real data.

There are two options to pass the exam:

Option 1: final exam for the whole 31 points. It will consist of questions (closed & open answer) and exercises to be answered on paper. The final exam is closed-book.
Option 2: final exam (same rules as above) + ML project

The theory part consists in exercises and questions to be answered on paper. This contributes to 70% of the final grade.
The project consists in programming code to solve a real ML problem, to be developed individually or in small groups and described through a written report, which is evaluated by the instructors. This contributes to 30% of the final grade.
In order to pass the exam with Option 2, students must achieve a passing grade in both the theory and the project part (that is, at least 18 points out of 31 in each part). If the student is sufficient in both parts, then the grade is computed as 0.7 X + 0.3 Y, where X and Y are the scores obtained in the final written exam and in the project, respectively.
The written exam is the same regardless of whether or not you participate in the challenge
There is going to be one ML project for each edition of the course. If you participate in the project and obtain a passing grade, you can keep such grade and cumulate it with a final exam done in later sessions (it can be used up to the September session of the same year in which you did the project)

The written exam is used to asses the "knowledge and understanding" learning objectives. The project is used to asses the "applying knowledge and understanding" learning objectives.

Teaching materials

ATTENDING AND NOT ATTENDING STUDENTS

The course mostly relies on the following two books (both available online for free):

Lindholm, A., Wahlström, N., Lindsten, F., & Schön, T. B. (2022). Machine learning: a first course for engineers and scientists.
James, G., Witten, D., Hastie, T., Tibshirani, R., & Taylor, J. (2023). An introduction to statistical learning: With applications in Python.

Other useful references:

C.M. BISHOP (2006). Pattern Recognition and Machine Learning.
T. HASTIE, R. TIBSHIRANI, J. FRIEDMAN (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction.

Slides and/or notes summarizing the contents presented in class are provided.

Last change 18/11/2024 11:55