Bayesian inference and Gaussian processes
author:Carl Edward Rasmussen,
Max Planck Institute
published: Aug. 20, 2007, recorded: August 2007, views: 3663
published: Aug. 20, 2007, recorded: August 2007, views: 3663
You might be experiencing some problems with Your Video player.
Slides
Related content
Visitors who watched this lecture also watched...
01:00:47
12624 views - David MacKay, 2006
01:34:49
6432 views - Yee Whye Teh, 2007
04:22:56
1343 views - Carl Edward Rasmussen, 2003
05:15:54
4699 views - Zoubin Ghahramani, 2007
04:59:19
18448 views - Sam Roweis, 2006
03:13:52
2374 views - Mike Tipping, 2003
01:24:49
3286 views - Zoubin Ghahramani, 1970
01:47:07
2772 views - Joaquin Quiñonero Candela, 2007
02:10:19
784 views - Carl Edward Rasmussen, 2008
01:05:42
4882 views - Michael I. Jordan, 2005
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity:
You need to login to cast your vote.
Watch videos: (click on thumbnail to launch)
See Also:
Download slides:
rasmussen_carl_edward_bigp_00.pdf (1.5 MB)
Launch in a standalone WM Player
Switch to Windows Media Player
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !




Reviews and comments:
The guy is actually cool. I like this talk. Thanks for uploading.
Slide 6.
"Notice: the likelihood function is a probability distribution over observations, not over parameters."
Likelihood function is a function (probability distribution) of a parameter, not the observations:
L(pi|D) or L(pi) is the likelihood function
p(D|pi) is the condition probability
Ref:
1. R. Hogg, A. Craig Introduction to Mathematical Statistics, 4th Ed 1978, p 202.
2. E. Lehmann, G Cassela Theory of Point Estimation (Springer Texts in Statistics) , 2nd Ed, 2003, p. 238
3. http://en.wikipedia.org/wiki/Likelihood
Slide 10.
Usage of the Beta distribution with alpha=beta=1 is not correct description of the Leaner B case: probabilities p(pi=0) and p(pi=1) will never be obtained. Therefore, instead of informative prior (Beta distribution with alpha=beta=1) the non-informative prior (Beta distribution with alpha=beta=0) has to be used.
Reader #2 is right!!! Come on, how this guy can say such a blunder!?
Readers #2 and #4 have a misunderstanding here: the likelihood function really takes two arguments, observed data and model parameters. It will then give you the probability (up to a proportionality constant) of the observed data given the model parameters, i.e. you obtain a probability distribution _over observations_ given the parameters. You do _not_ get a probability distribution over parameters. This is exactly what the slides say and is perfectly consistent with the references that reader #2 provides. Reader #2 conflates "function of a parameter" and "probability distribution of a parameter", which is clearly wrong here.
Write your own review or comment: