Data Science for Non-Life Insurance (B-KUL-D0N55A)
Aims
The goal of this course is to provide students with quantitative skills relevant for analytic tasks (in particular: pricing and reserving) in the non-life insurance business. Focus is on both theory and practice, including hands-on programming (in R or Python). Upon completion of the course the student
- can evaluate the pros and cons of wide range of analytic methods, both for supervised as well as unsupervised learning
- can build an insightful exploratory analysis of insurance data, through cleaning, summarizing and visualizing of the data
- knows how to deploy modern predictive modeling and machine learning techniques for insurance business decision making
- can evaluate the performance of these techniques and choose a suitable approach for the given problem, taking the context of the actuarial problem and the applicable frameworks (e.g. IT, regulation, society, ethics) into account
- can explain ongoing evolutions in the design of insurance products and the analytic tasks involved
- can translate the output of the analytic model to relevant business metrics.
Previous knowledge
At the beginning of this course the student has active knowledge of a basic course in probability theory and statistics (at bachelor level).
Is included in these courses of study
- Doctoral Programme in Business Economics (Leuven)
- Master handelsingenieur (Leuven) 120 ects.
- Master handelsingenieur (Leuven) (Minor: Actuariële en financiële wetenschappen) 120 ects.
- Master of Business Engineering (Leuven) 120 ects.
- Master of Business Engineering (Leuven) (Minor: Actuarial and Financial Engineering) 120 ects.
- Master of Actuarial and Financial Engineering (Leuven) 120 ects.
- Master in de actuariële en financiële wetenschappen (Leuven) 120 ects.
- Courses for Exchange Students Faculty of Economics and Business (Leuven)
- Master in de accountancy en het revisoraat (programma voor studenten gestart in de master in 2024-2025 of later, en voor studenten gestart in de master en/of het schakel- of voorbereidingsprogramma vóór 2024-2025 indien zij hiervoor kiezen) (Leuven) 60 ects.
Activities
6 ects. Data Science for Non-Life Insurance (B-KUL-D0N55a)
Content
Data science meets actuarial science
- The data science landscape
- Predictive modeling tasks in insurance for life, non-life (property & casualty, P&C) and health insurance products
- Features of insurance data: the frequency-severity setting, and beyond
- Applicable frameworks (IT, regulation, ethics and society).
Part I: data science methods for risk classification
Risk classification for frequencies and severities
- Generalized Linear Models (GLMs): theory, use in pricing, estimation and inference
- Building tarification models with GLMs
- From GLMs to GAMs: flexible effects of continuous and spatial risk factors
Machine learning basics for actuarial work
- Taxonomy of machine learning methods for supervised (regression and classification) and unsupervised learning
- Vanilla loss functions vs actuarial loss functions, model accuracy, bias-variance tradeoff
- Tuning parameters, hyperparameters, model tuning
- Resampling methods: k-fold cross validation, stratified k-fold cross validation
Tree-based machine learning methods
- Decision trees for regression and classification
- Ensembles of trees: bagging, random forests and gradient boosting machine
- Model interpretation: variable importance, partial dependence plot, individual conditional expectation, hunting for interaction effects
Lasso, friends of Lasso and the actuary
- Shrinkage, regularization and sparsity
- Ridge, Lasso, group and fused Lasso
- SMuRF: Sparsity with Multi-type Regularized Feature modelling.
Demistify neural networks
- The basics of neural networks: ANNs, CNNs, RNNs
- Non-insurance applications of (deep) neural networks
- Neural networks for analyzing claim frequencies and severities, Combined Actuarial Neural Networks (CANNs)
Part II: experience rating with credibility models and Bonus-Malus scales
Credibility methods for experience rating
- Aims and challenges
- Classical credibility models: an actuarial cornerstone dating back to the 60s-70s
- Bayesian credibility models
- A statistical approach to credibility models (with mixed models)
Bonus-malus scales
- The scale, Markov chains, transition rules
- Relativities: connecting credibility models and Bonus Malus scales
- BM scales incorporating a priori risk classification
Part III: claims reserving
- Aims and challenges
- Deterministic methods for claims reserving: chain-ladder
- Stochastic claims reserving models: best estimate, standard error, predictive distributions (bootstrapping and Bayesian approach)
- Contemporary topics: Solvency II, reserve risk, micro-level loss reserving.
Course material
- Hastie, T., Tibshirani, R., and Friedman, J. H. (2009, 2nd edition). The elements of statistical learning: data mining, inference, and prediction. Springer series in statistics. Springer.
- James, G., Witten, D., Hastie, T., and Tibshirani, R. (2023, 2nd edition). An introduction to statistical learning. Springer, New York.
- Denuit, M., Maréchal, X., Pitrebois, S. and Walhin, J.F. (2007). Actuarial modelling of claim counts. Wiley.
- Wuthrich, M. and Merz, M. (2008). Stochastic claims reserving methods in insurance. Wiley.
Articles, lecture sheets and R markdowns and Google Colabs will be distributed via TOLEDO.
Format: more information
Weekly lectures, regular computer labs and tutorials.
Evaluation
Evaluation: Data Science for Non-Life Insurance (B-KUL-D2N55a)
Explanation
The grades are determined by the lecturer as communicated via TOLEDO and stated in the examination schedule. The result is calculated and communicated as a number on a scale of 20.
Assignments are given throughout the semester: (small) coding assignments count for 2/20, a project + presentation + interaction with the teaching team counts for 4/20. If the student does not participate in one (or more) of the assignments, the grades for these assignments will be a 0-grade within the calculations of the final grade.
The final exam counts for 14/20.
The final grade is the sum of all parts.
For the second examination opportunity the features of the evaluation and determination of grades are similar to those of the first examination opportunity, as described above.
Information about retaking exams
See 'Explanation' for further information regarding the second examination opportunity.