CSAIL Event Calendar
Transportation Distances and their Application in Machine Learning: New ProblemsSpeaker: Marco Cuturi, University of Kyoto Date: Wednesday, December 12 2012 Time: 4:00PM to 5:00PM Location: Patil/ Kiva Seminar Room 32-G449 Host: Tomaso Poggio, Lorenzo Rosasco, Laboratory for Computational and Statistical Learn Contact: Kathleen Sullivan, 617-253-0551, kdsulliv@mit.edu Relevant URL: http://lcsl.mit.edu/cml-seminars.htmlAbstract: I will present in this talk two new research topics related to the optimal transportation distance (also known as Earth Mover's or Wasserstein) and its application in machine learning to compare histograms of features. I discuss first the ground metric learning problem, which is the problem of tuning automatically the parameters of transportation distances using labeled histogram data. After providing some reminders on optimal transportation, I will argue that learning transportation distances is akin to learning an L1 distance on the simplex, namely a distance with polyhedral level sets, and I will draw some parallels with Mahalanobis distances, the L2 distance and elliptic level sets. I will then introduce our algorithm (arXiv:1110.2306) and more recent extensions. In the second part of my talk, I address the fact that transportation distances are not Hilbertian by showing that they can be cast as positive definite kernels through the "generating function trick". We prove that the trick, which uses the generating function of the transportation polytope to define a similarity - rather than focusing exclusively on the optimal transport to define a distance - leads to a positive definite kernel between histograms (arXiv:1209.2655). See other events that are part of Cambridge Machine Learning Colloquium and Seminar Series 2012/2013
|







