CoCoA - Communication-Efficient Distributed Coordinate Ascent

This code performs a comparison of 4 distributed algorithms for training of machine learning models, using Apache Spark. The implemented algorithms are

CoCoA
mini-batch stochastic dual coordinate ascent (mini-batch SDCA)
stochastic subgradient descent with local updates (local SGD)
mini-batch stochastic subgradient descent (mini-batch SGD)

The present code trains a standard SVM (hinge-loss, l2-regularized), and reports training and test error, as well as the duality gap certificate if the method is primal-dual.

Getting Started

How to run the code locally:

sbt/sbt assembly
./run-demo-local.sh

(For the sbt script to run, make sure you have downloaded CoCoA into a directory whose path contains no spaces.)

References

The CoCoA algorithmic framework is described in more details in the following paper:

Jaggi, M., Smith, V., Takac, M., Terhorst, J., Krishnan, S., Hofmann, T., & Jordan, M. I. (2014) Communication-Efficient Distributed Dual Coordinate Ascent (pp. 3068–3076). NIPS 2014 - Advances in Neural Information Processing Systems 27.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
conf		conf
data		data
project		project
sbt		sbt
src/main/scala		src/main/scala
.gitignore		.gitignore
README.md		README.md
build.sbt		build.sbt
local-helper.sh		local-helper.sh
run-demo-cluster.sh		run-demo-cluster.sh
run-demo-local.sh		run-demo-local.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CoCoA - Communication-Efficient Distributed Coordinate Ascent

Getting Started

References

About

Releases

Packages

Languages

xinmei9322/cocoa

Folders and files

Latest commit

History

Repository files navigation

CoCoA - Communication-Efficient Distributed Coordinate Ascent

Getting Started

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages