20564 - BIG DATA FOR BUSINESS DECISIONS
Course taught in English
Go to class group/s: 31
Class 31: FRANCESCO GROSSETTI
Some familiarity with basic statistics and programming in R and Python.
Today's world is constellated by interdisciplinary professional figures, like data scientists, who are able to successfully mix different technical skills to provide extremely powerful insights from data. The mission of this course is to teach students of business administration and related fields how to reduce the gap when interacting with more quantitative colleagues. The course provides some basic knowledge about statistical modeling, data visualization as well as computer programming which are all fundamental aspects when developing an impactful storytelling.
- Definition of Big Data.
- Parallel and distributed computing.
- Statistical modeling: from linear regression to clustering and classification.
- Model evaluation.
- Data visualization.
- Analyzing textual data.
- Artificial Neural Networks.
- Introduction to programming in R and Python.
- Statistical modeling in R and Python.
- Recognize business problems where Big Data can be applicable.
- Recognized the main statistical models generally adopted to extract insights.
- Understand the complexity of analyzing textual data.
- Understand what an Artificial Neural Network is and what are its main components.
- Solve business problems by data-analytic thinking.
- Use several tools and techniques to practically implement solution methods.
- Use R to carry out simple statistical analyses and visualizations.
- Face-to-face lectures
- Guest speaker's talks (in class or in distance)
- Company visits
- Exercises (exercises, database, software etc.)
- Case studies /Incidents (traditional, online)
- Group assignments
- Participation in external competitions
- Speakers from practice are brought to class to discuss about the challenges and costs involved in implementing Big Data solutions.
- Home programming exercises are given to students.
- A practical group assignment is presented in two sessions, at the half and end of the semester.
|Continuous assessment||Partial exams||General exam|
- One assignment representing 30% of final grade.
- One final multiple-choice written exam representing 70% of final grade.
- One final multiple-choice written exam representing 100% of final grade.
- Slides provided by the instructors.
- F. PROVOST, T. FAWCETT (edition by O'REALLY), Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking.
- G. RUBERA, F. GROSSETTI (edition by Egea BUP), Python for non-Pythonians: How to Win Over Programming Languages.
- J. SILGE, D. ROBINSON (edition by O'REALLY),Text Mining with R: A Tidy Approach.
- G. GROLEMUND, H. WICKHAM (edition by O'REALLY), R for Data Science.
- G. GROLEMUND (edition by O'Really), Hands-On Programming with R: Write Your Own Functions and Simulations.
- F. CHOLLET (edition by Manning Publications), Deep Learning with R.
- F. CHOLLET (edition by Manning Publications), Deep Learning with Python.