Skip to content

A list of topics for a google summer of code (gsoc) 2012

paolo-losi edited this page Mar 13, 2012 · 37 revisions

A list of topics for a Google summer of code (GSOC) 2012

Online Low Rank Matrix Completion

Possible mentor: Olivier Grisel

Possible candidate: Vlad Niculae, ?

Goal: Online or Minibatch SGD or similar on a squared l2 reconstruction loss + low rank penalty (nuclear norm) on scipy.sparse matrix: the implicit components of the sparse input representation would be interpreted by the algorithms as missing values rather that zero values.

Application: Build a scalable recommender system example, e.g. on the movielens dataset.

TODO: find references in the literature.

Online Non Negative Matrix Factorization

Possible mentor: Olivier Grisel

Possible candidate: Vlad Niculae, ?

Goal: Online or Minibatch NMF using SGD + positive projections (or any other out-of-core algorithms) accepting both dense and sparse matrix as input (decomposition components can be dense array only).

Application: Build a scalable topic model e.g. on million of Wikipedia abstracts for instance using this script.

TODO: find references in the literature.

Robust PCA

Algorithms for decomposing a design matrix into a low rank + sparse components.

Possible mentor: ?

Possible candidate: Kerui Min (Minibio: "I'm a graduate student at UIUC who is currently pursuing the research work related to low-rank matrices recovery & Robust PCA.")

Applications: ?

References:

Multilayer Perceptron / Neural Network

Possible mentor: Andreas Mueller

Goal: Implement a stochastic gradient descent algorithm to learn a multi-layer perceptron.

References:

SVM with low rank kernel approximation

Possible mentor: Andreas Mueller

Goal: Implement a stochastic gradient descent SVM using a low-rank kernel approximation.

References:

Generalized Additive Models

Possible mentor: Paolo Losi (others?)

Goal: Implement one of the state of art methods for Generalized Additive Models

References:

Clone this wiki locally