mlbench · ehoelzl · Dec 7, 2020 · Feb 25, 2021 · martinjaggi · Feb 26, 2021
diff --git a/_layouts/page.html b/_layouts/page.html
@@ -3,6 +3,8 @@
 ---
 
 <div class="page">
-  <h1 class="page-title">{{ page.title }}</h1>
+  {% if page.notitle != true %}
+    <h1 class="page-title">{{ page.title }}</h1>
+  {% endif %}
   {{ content }}
 </div>
diff --git a/index.md b/index.md
@@ -5,9 +5,10 @@ title: Home
 <h1>MLBench: Distributed Machine Learning Benchmark</h1>
 
 <hr>
-<a style="display: inline-block" href="https://travis-ci.com/mlbench/mlbench-dashboard"><img src="https://travis-ci.com/mlbench/mlbench-dashboard.svg?branch=develop"></a>
-<a style="display: inline-block" href="https://travis-ci.com/mlbench/mlbench-core"><img src="https://travis-ci.com/mlbench/mlbench-core.svg?branch=develop"></a>
-<a style="display: inline-block" href="https://travis-ci.com/mlbench/mlbench-benchmarks"><img src="https://travis-ci.com/mlbench/mlbench-benchmarks.svg?branch=develop"></a>
+<a style="display: inline-block" href="https://github.com/mlbench/mlbench-core/actions?query=workflow%3Amlbench-core+branch%3Adevelop"><img src="https://github.com/mlbench/mlbench-core/workflows/mlbench-core/badge.svg?branch=develop"></a>
+<a style="display: inline-block" href="https://github.com/mlbench/mlbench-dashboard/actions?query=workflow%3Amlbench-dashboard+branch%3Adevelop"><img src="https://github.com/mlbench/mlbench-dashboard/workflows/mlbench-dashboard/badge.svg?branch=develop"></a>
+<a style="display: inline-block" href="https://github.com/mlbench/mlbench-benchmarks/actions?query=workflow%3Amlbench-benchmarks+branch%3Adevelop"><img src="https://github.com/mlbench/mlbench-benchmarks/workflows/mlbench-benchmarks/badge.svg?branch=develop"></a>
+<a style="display: inline-block" href="https://github.com/mlbench/mlbench-helm/actions?query=workflow%3Amlbench-helm+branch%3Adevelop"><img src="https://github.com/mlbench/mlbench-helm/workflows/mlbench-helm/badge.svg?branch=develop"></a>
 <a style="display: inline-block" href="https://mlbench.readthedocs.io/en/latest/?badge=latest"><img src="https://readthedocs.org/projects/mlbench/badge/?version=latest" alt="Documentation Status"></a>
 
 

diff --git a/public/css/hyde.css b/public/css/hyde.css
@@ -248,3 +248,10 @@ a.sidebar-nav-item:focus {
 .theme-base-0f .related-posts li a:hover {
   color: #8f5536;
 }
+
+
+table.wide {
+  max-width: 200%;
+  width: 200%;
+  font-size: 15px;
+}
diff --git a/results/index.md b/results/index.md
@@ -0,0 +1,39 @@
+---
+layout: page
+notitle: true
+title: Results
+datatable: true
+---
+
+# MLBench Official Training Results
+---
+
+<p>
+Here we present the official training results obtained by running the benchmark tasks.
+It provides a reference and comparison point for different implementations.
+</p>
+
+
+<p>
+The table below provides a comparison table for results. Times are in seconds. The baseline represents the 1 worker case, and we
+only present the results for the best obtained speedups.
+
+For more detailed results for each run, please check the documentation.
+
+</p>
+
+---
+
+|Task                 | Dataset  | Model                 | Aggregation scheme |  Metric Name           | Metric Goal | Framework    |    Baseline    | Best Speedup   | Best Speedup Workers | Description | Implementation | Results |
+|:-------------------:|:--------:|:---------------------:|:------------------:|:----------------------:|:-----------:|:------------:|:--------------:|:--------------:|:--------------------:|:-----------:|:--------------:|:-------:|
+| Image Recognition   | CIFAR10  | ResNet20              | All-Reduce         | Validation Accuracy    |  80%        |  Torch 1.7.0 |    171.15s     |   6.6 (total), 11.9 (compute) |  16   |   [details](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#task-1-image-classification)|   [code](https://github.com/mlbench/mlbench-benchmarks/tree/develop/pytorch/imagerecognition/cifar10-resnet20-all-reduce)              |   [results](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#id17)      |
+|                     |          |                       | PyTorch DDP        |                        |             |  Torch 1.7.0 |    182.81s     | 4.4            |           8          |  [details](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#task-1-image-classification)| [code](https://github.com/mlbench/mlbench-benchmarks/tree/develop/pytorch/imagerecognition/cifar10-resnet20-distributed-data-parallel)                |         |
+|---------------------+----------+-----------------------+--------------------+------------------------+-------------+--------------+----------------+----------------+----------------------+-------------+---------|
+| Language Modelling  | Wikitext2| AWD-LSTM              | All-Reduce         | Validation Perplexity  | 70          |  Torch 1.7.0 |    87,401.33s  |                |                      | [details](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#task-3-language-modelling) |  [code]()               |         |
+| Language Modelling  |          | BERT
+|---------------------+----------+-----------------------+--------------------+------------------------+-------------+--------------+----------------+----------------+----------------------+----------------+---------|
+| Machine Translation | WMT16    | LSTM (GNMT)           | All-Reduce         | Validation BLEU-score  | 24          |  Torch 1.7.0 |    65,206.62s   | 2.8 (total), 15.8 (compute)          |      16              | [details](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#a-lstm-wmt16-en-de) | [code](https://github.com/mlbench/mlbench-benchmarks/tree/develop/pytorch/nlp/translation/wmt16-gnmt-all-reduce)                |   [results](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#id30)      |
+|---------------------+----------+-----------------------+--------------------+------------------------+-------------+--------------+----------------+----------------+----------------------+----------------+---------|
+| Machine Translation | WMT17    | Transformer           | All-Reduce         | Validation BLEU-score  | 25          |  Torch 1.7.0 |    37,594.21s   | 3.0 (total), 18.3 (compute)          |      16              | [details](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#b-transformer-wmt17-en-de) | [code](https://github.com/mlbench/mlbench-benchmarks/tree/develop/pytorch/nlp/translation/wmt17-transformer-all-reduce)                |    [results](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#id32)     |
+{:.wide}
+