Course 2025-2026 a.y.

20867 - BUSINESS ECONOMICS - MODULE I (EMPIRICAL METHODS)

Department of Management and Technology

Course taught in English

27 - 28

EMIT (8 credits - I sem. - OB | SECS-P/06)

Course Director:
STEFANO BRESCHI

Classes: 27 (I sem.) - 28 (I sem.)

Instructors:
Class 27: STEFANO BRESCHI, Class 28: STEFANO BRESCHI

Mission & Content Summary

MISSION

In today’s data-driven landscape, understanding the dynamics of high-tech ventures requires mastery of complex, multi-source datasets and advanced analytical tools. This course empowers students to streamline and reshape raw startup ecosystem data—from fundraising rounds to performance metrics—into robust, analysis-ready formats. Through hands-on labs in Python and Stata, students will develop reproducible pipelines for data cleaning, integration, and visualization, then apply core econometric techniques (OLS, IV, panel models) to test hypotheses on growth, valuation, and investment trends. Throughout the lab, we emphasize critical thinking and problem-solving as essential skills for navigating complex data workflows and econometric challenges. By the end of the course, students will be fully prepared to carry out robust data transformations and econometric analyses across a range of academic and professional settings.

CONTENT SUMMARY

Part I – Data Engineering Lab

A. Python Foundations & Environment

IDEs and notebook workflows (Jupyter, VS Code, remote servers)
Core Python syntax: variables, data types (numeric, boolean, string), containers (lists, dicts, tuples, sets)
Control flow: if/elif/else, for and while loops, error handling (try/except)
Functions and modules: defining, scoping, lambdas, package management

B. Data Acquisition & Import

Reading and writing CSV, JSON, Excel, and Stata files
Managing large datasets with chunked loading and memory optimization
Ensuring data quality at import: type conversions and error reporting

C. Data Wrangling & Cleaning

Pandas Series and DataFrame fundamentals: indexing, slicing, filtering
Detecting and treating missing values, duplicates, and outliers
String operations and date–time parsing
Grouped aggregations and transformations (groupby, agg, transform)

D. Data Integration & Reshaping

Merging, joining, concatenating multiple tables (e.g., rounds, valuations, company profiles)
Pivoting and melting for “wide” vs. “long” formats
Building event‐time panels and longitudinal structures
Chaining operations into reproducible pipelines

E. Exploratory Visualization

Plotting with Matplotlib (line, bar, scatter, box)
Customizing plots: titles, labels, legends, and annotations
Exporting figures for reports and presentations

F. Introduction to Text Data

Regular expressions for parsing term sheets and news feeds
Tokenization and basic NLP workflows (TF-IDF)

Part II – Econometric Methods

A. Econometric Data Structures

Cross-section vs. time series vs. panel data
Constructing balanced and unbalanced panels from transaction logs

B. Simple Linear Regression

Economic vs. econometric model specification
Ordinary least squares estimation and interpretation
Incorporating non-linear terms and interaction dummies

C. Inference & Hypothesis Testing

Confidence intervals and p-values
Robust standard errors and heteroskedasticity tests

D. Multiple Regression & Diagnostics

Multicollinearity, specification tests, omitted-variable bias
Model selection and goodness-of-fit measures

E. Binary Outcome Models

Linear probability model, Logit, and Probit for exit or follow-on rounds
Marginal effects and interpretation

F. Panel Data Techniques

Pooled OLS, fixed effects, random effects

Hausman test and model choice

Intended Learning Outcomes (ILO)

KNOWLEDGE AND UNDERSTANDING

At the end of the course student will be able to...

Describe how to structure and harmonize data from multiple sources into tidy, analysis-ready formats.
Demonstrate proficiency in Python and Stata for data ingestion, cleaning, and management.
Apply advanced data‐wrangling techniques to detect and correct missing values, outliers, and inconsistencies.
Transform and merge disparate tables into cross-sectional and panel datasets ready for analysis.
Interpret the output of regression models within the context of venture financing.
Evaluate model diagnostics—such as heteroskedasticity tests and endogeneity checks—to ensure valid inference.
Present empirical results effectively through concise written reports and visualizations.

APPLYING KNOWLEDGE AND UNDERSTANDING

At the end of the course student will be able to...

Construct end-to-end data workflows to ingest, clean, and prepare complex startup datasets for analysis.
Design and implement regression models in Python and Stata to test real-world hypotheses.
Communicate analytical results through clear visualizations and concise written reports.
Adapt analytical techniques to novel data challenges encountered in professional research, policy, or finance settings.

Teaching methods

Lectures
Guest speaker's talks (in class or in distance)
Practical Exercises
Individual works / Assignments
Collaborative Works / Assignments

DETAILS

The following methods will be employed:

Lectures
- Core concepts in Python, data handling, and econometrics
Guest Speaker Seminars
- Talks by industry practitioners on data sources, startup finance, and market insights
Hands-On Sessions
- Guided lab sessions focused on coding exercises, data cleaning tasks, and visualization
Problem-Solving Sessions
- Instructor-led exercises and self-assessment quizzes to reinforce key techniques
Group Project Work
- Team-based empirical research: hypothesis development, data pipeline construction, model estimation, and presentations

Assessment methods

	Continuous assessment	Partial exams	General exam
Written individual exam (traditional/online)			x
Collaborative Works / Assignment (report, exercise, presentation, project work etc.)	x

ATTENDING STUDENTS

Assessment of learning:

Written Examination (60%)

A 90-minute exam at the end of the course covering data engineering and econometric methods.
- Assesses theoretical understanding and practical problem-solving skills across both parts of the curriculum.

2. Group Project (40%)

Teams design and execute an empirical study, culminating in an in-class presentation.
- Evaluation criteria include research design, data pipeline robustness, econometric analysis and presentation effectiveness.
- Individual contributions and presentation skills are assessed for each team member.

NOT ATTENDING STUDENTS

Written Examination (100%)

A single, 2½-hour individual exam at the end of the course.
Covers all topics from data preparation through econometric modeling.

Teaching materials

ATTENDING STUDENTS

Course materials for Part I will consist of tutorials, code notebooks, and reference readings distributed at the start of the term. For Part II, students will receive lecture slides and a selection of research papers in advance. In addition, the econometrics module will draw on specified chapters from:
Hill, R. C., Griffiths, W. E., & Lim, G. C. (2018). Principles of Econometrics. John Wiley & Sons.

NOT ATTENDING STUDENTS

Hill, R. C., Griffiths, W. E., & Lim, G. C. (2018). Principles of Econometrics. John Wiley & Sons (selected chapters). Additional readings for part I and II will be specified at the beginning of the course.

Last change 27/05/2025 11:33