WelcometotheKernel-Class



《WelcometotheKernel-Class》由会员分享,可在线阅读,更多相关《WelcometotheKernel-Class(11页珍藏版)》请在装配图网上搜索。
1、Click to edit Master title style,,Click to edit Master text styles,,Second level,,Third level,,Fourth level,,Fifth level,,,,*,Welcome to the Kernel-Class,My name,: Max (Welling),,Book:,,There will be class-notes/slides.,,Homework,: reading material, some exercises,,,some MATLAB implementations.,,I l
2、ike,: an active attitude in class.,,ask questions! respond to my questions.,,Enjoy.,,1,,Primary Goal,,What is the primary goal of:,,,- Machine Learning,,- Data Mining,,- Pattern Recognition,,- Data Analysis,,- Statistics,,,Automatic detection of non-coincidental structure in data.,,,2,,Desiderata,,,
3、Robust algorithms,are insensitive to outliers and wrong,,model assumptions.,,,,Stable algorithms,: generalize well to unseen data.,,,,Computationally efficient algorithms,are necessary to handle,,large datasets.,,3,,Supervised & Unsupervised Learning,,,supervised,: classification, regression,,,,uns
4、upervised,: clustering, dimensionality reduction, ranking,,,outlier detection.,,,primal vs. dual problems: generalized linear models vs.,,kernel classification.,this is like nearest neighbor,,classification.,4,,Feature Spaces,,non-linear mapping to F,,1. high-D space,,2. infinite-D countable space :
5、,,3. function space (Hilbert space),example:,5,,Kernel Trick,,Note: In the dual representation we used the Gram matrix,,to express the solution.,,,Kernel Trick:,,Replace :,kernel,If we use algorithms that only depend on the Gram-matrix, G,,,then we never have to know (compute) the actual features,T
6、his is the crucial point of kernel methods,6,,Properties of a Kernel,,Definition:,,A finitely,positive semi-definite,function,,is a,symmetric,function of its arguments for which matrices formed,,by restriction on any finite subset of points is positive semi-definite.,Theorem:,,A function
7、 can be written,,as where is a feature map,,iff k(x,y) satisfies the semi-definiteness property.,Relevance:,We can now check if k(x,y) is a proper kernel using,,only properties of k(x,y) itself,,,i.e. without the need to know the featu
8、re map!,7,,Modularity,,Kernel methods consist of two modules:,,,1) The choice of kernel (this is non-trivial),,2) The algorithm which takes kernels as input,,,Modularity: Any kernel can be used with any kernel-algorithm,.,,some kernels:,some kernel algorithms:,,- support vector machine,,- Fisher dis
9、criminant analysis,,- kernel regression,,- kernel PCA,,- kernel CCA,8,,Goodies and Baddies,,Goodies:,,Kernel algorithms are typically constrained convex optimization,,problems,,solved with either spectral methods or convex optimization tools,.,,Efficient algorithms do exist in most cases.,,The simi
10、larity to linear methods facilitates analysis. There are strong,,generalization bounds on test error.,,,Baddies:,,You need to choose the appropriate kernel,,Kernel learning is prone to over-fitting,,All information must go through the kernel-bottleneck.,,9,,Regularization,,Demo Trevor Hastie.,regula
11、rization is very important!,,,,regularization parameters determined by out of sample.,,measures (cross-validation, leave-one-out).,10,,Learning Kernels,,All information is tunneled through the Gram-matrix information,,bottleneck.,,The real art is to pick an appropriate kernel.,,e.g. take the RBF kernel:,if c is very small: G=I (all data are dissimilar): over-fitting,,if c is very large: G=1 (all data are very similar): under-fitting,,,We need to,learn,the kernel. Here is some ways to combine kernels to improve them:,,k1,k2,cone,any positive,,polynomial,11,,
- 温馨提示:
1: 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
2: 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
3.本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 装配图网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。