UTTV

Video Player is loading.

Current Time 0:00

Duration -:-

Loaded: 0%

Stream Type LIVE

Remaining Time -:-

Machine Learning - Performance evaluation measures

Embed

Klipi teostus: Mirjam Paales 26.02.2013 4784 vaatamist Arvutiteadus

Given by Sven Laur

Brief summary: Principles of experiment design. Machine learning as minimisation of future costs. Overview of standard loss functions. Stochastic estimation of future costs by random sampling (Monte-Carlo integration). Theoretical limitations. Standard validation methods: holdout, randomised holdout, cross-validation, leave-one-out, bootstrapping. Advantages and drawbacks of standard validation methods

Slides: PDF slides Handwritten slides

Literature:

Davison and Hinkley: Bootstrap Methods and Their Application
Molinaro, Simon and Pfeiffer: Prediction Error Estimation: A Comparison of Resampling Methods
Arlot and Celisse: A survey of cross-validation procedures for model selection
Efron: Estimating the Error Rate of a Prediction Rule: Improvement on Cross-Validation
Efron and Tibshirani: Improvements on Cross-Validation: The .632+ Bootstrap Method
Wolfgang Härardle: Applied Nonparametric Regression: Choosing the smoothing parameter (Chapter 5)
Yang: Can the Strengths of AIC and BIC Be Shared?
van Erven, Grunwald and de Rooij:Catching Up Faster by Switching Sooner: A Prequential Solution to the AIC-BIC Dilemma

Complementary exercises:

Generate data form a simple linear or polynomial regression model and use various validation methods and report results:

Did a training method chose a correct model
Is there some differences when the correct model is not feasible?
Estimate bias and variance of a training method
Did a validation method correctly estimated expected losses

Try various classification and linear regression methods together with various validation methods report the results

Iris dataset
Computer Hardware Data Set
Housing Data Set
Datasets for testing linear regression models

Free implementations:

Boot package in R
Some methods in the rminer package in R

Machine Learning - Performance evaluation measures

Seotud videod

Machine Learning - Ia. Introduction to the course
12.02.13

Machine Learning - practice session (18.02)
18.02.13

Machine Learning - Linear models and polynomial interpolation
19.02.13

Machine Learning - practice session (25.02)
25.02.13

Machine Learning - Introduction to optimization
05.03.13

Machine Learning - practice session (11.03)
11.03.13

Machine Learning - Linear Classification
12.03.13

Machine Learning - practice session (18.03)
18.03.13

Machine Learning - Feed-forward neural networks for prediction tasks
19.03.13

Machine Learning - practice session (25.03)
25.03.13

Machine Learning - Basics of probabilistic modelling
26.03.13

Machine Learning practice session (1.04)
01.04.13

Machine Learning - Maximum likelihood and maximum a posteriori estimates
02.04.13

Machine Learning practice session (8.04)
08.04.13

Machine Learning - Model-based clustering techniques
09.04.13

Machine Learning - practice session (15.04)
15.04.13

Machine Learning - Expectation-maximisation and data augmentation algorithms
16.04.13

Machine Learning practice session (22.04)
22.04.13

Machine Learning - Principal Component Analysis
23.04.13

Machine Learning practice session (29.04)
29.04.13

Machine Learning - Statistical learning theory
30.04.13

Machine Learning practice session (6.05)
06.05.13

Machine Learning - Support Vector Machines
07.05.13

Machine Learning - Kernel Methods
14.05.13

Machine Learning practice session (21.05)
20.05.13

Machine Learning - Basics of ensemble methods
21.05.13

Machine Learning - Particle filters
28.05.13