Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add results table #24

Draft
wants to merge 2 commits into
base: master
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 3 additions & 1 deletion _layouts/page.html
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,8 @@
---

<div class="page">
<h1 class="page-title">{{ page.title }}</h1>
{% if page.notitle != true %}
<h1 class="page-title">{{ page.title }}</h1>
{% endif %}
{{ content }}
</div>
7 changes: 4 additions & 3 deletions index.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,9 +5,10 @@ title: Home
<h1>MLBench: Distributed Machine Learning Benchmark</h1>

<hr>
<a style="display: inline-block" href="https://travis-ci.com/mlbench/mlbench-dashboard"><img src="https://travis-ci.com/mlbench/mlbench-dashboard.svg?branch=develop"></a>
<a style="display: inline-block" href="https://travis-ci.com/mlbench/mlbench-core"><img src="https://travis-ci.com/mlbench/mlbench-core.svg?branch=develop"></a>
<a style="display: inline-block" href="https://travis-ci.com/mlbench/mlbench-benchmarks"><img src="https://travis-ci.com/mlbench/mlbench-benchmarks.svg?branch=develop"></a>
<a style="display: inline-block" href="https://github.com/mlbench/mlbench-core/actions?query=workflow%3Amlbench-core+branch%3Adevelop"><img src="https://github.com/mlbench/mlbench-core/workflows/mlbench-core/badge.svg?branch=develop"></a>
<a style="display: inline-block" href="https://github.com/mlbench/mlbench-dashboard/actions?query=workflow%3Amlbench-dashboard+branch%3Adevelop"><img src="https://github.com/mlbench/mlbench-dashboard/workflows/mlbench-dashboard/badge.svg?branch=develop"></a>
<a style="display: inline-block" href="https://github.com/mlbench/mlbench-benchmarks/actions?query=workflow%3Amlbench-benchmarks+branch%3Adevelop"><img src="https://github.com/mlbench/mlbench-benchmarks/workflows/mlbench-benchmarks/badge.svg?branch=develop"></a>
<a style="display: inline-block" href="https://github.com/mlbench/mlbench-helm/actions?query=workflow%3Amlbench-helm+branch%3Adevelop"><img src="https://github.com/mlbench/mlbench-helm/workflows/mlbench-helm/badge.svg?branch=develop"></a>
<a style="display: inline-block" href="https://mlbench.readthedocs.io/en/latest/?badge=latest"><img src="https://readthedocs.org/projects/mlbench/badge/?version=latest" alt="Documentation Status"></a>


Expand Down
7 changes: 7 additions & 0 deletions public/css/hyde.css
Original file line number Diff line number Diff line change
Expand Up @@ -248,3 +248,10 @@ a.sidebar-nav-item:focus {
.theme-base-0f .related-posts li a:hover {
color: #8f5536;
}


table.wide {
max-width: 200%;
width: 200%;
font-size: 15px;
}
39 changes: 39 additions & 0 deletions results/index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,39 @@
---
layout: page
notitle: true
title: Results
datatable: true
---

# MLBench Official Training Results
---

<p>
Here we present the official training results obtained by running the benchmark tasks.
It provides a reference and comparison point for different implementations.
</p>


<p>
The table below provides a comparison table for results. Times are in seconds. The baseline represents the 1 worker case, and we
only present the results for the best obtained speedups.

For more detailed results for each run, please check the documentation.

</p>

---

|Task | Dataset | Model | Aggregation scheme | Metric Name | Metric Goal | Framework | Baseline | Best Speedup | Best Speedup Workers | Description | Implementation | Results |
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

instead of best speedup, let's only put the speedup at the max (or reasonable, fixed) number of workers, so 16?
then you can have 2 columns, one for total speedup (most important), and if you want a second one for compute only speedup. (and a column to say that #workers = 16)

and Bleu scores can have a decimal for clarity. 24.0 and 25.0 (minor nitpick :))

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if other experiments have a different max # workers that's ok too

|:-------------------:|:--------:|:---------------------:|:------------------:|:----------------------:|:-----------:|:------------:|:--------------:|:--------------:|:--------------------:|:-----------:|:--------------:|:-------:|
| Image Recognition | CIFAR10 | ResNet20 | All-Reduce | Validation Accuracy | 80% | Torch 1.7.0 | 171.15s | 6.6 (total), 11.9 (compute) | 16 | [details](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#task-1-image-classification)| [code](https://github.com/mlbench/mlbench-benchmarks/tree/develop/pytorch/imagerecognition/cifar10-resnet20-all-reduce) | [results](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#id17) |
| | | | PyTorch DDP | | | Torch 1.7.0 | 182.81s | 4.4 | 8 | [details](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#task-1-image-classification)| [code](https://github.com/mlbench/mlbench-benchmarks/tree/develop/pytorch/imagerecognition/cifar10-resnet20-distributed-data-parallel) | |
|---------------------+----------+-----------------------+--------------------+------------------------+-------------+--------------+----------------+----------------+----------------------+-------------+---------|
| Language Modelling | Wikitext2| AWD-LSTM | All-Reduce | Validation Perplexity | 70 | Torch 1.7.0 | 87,401.33s | | | [details](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#task-3-language-modelling) | [code]() | |
| Language Modelling | | BERT
|---------------------+----------+-----------------------+--------------------+------------------------+-------------+--------------+----------------+----------------+----------------------+----------------+---------|
| Machine Translation | WMT16 | LSTM (GNMT) | All-Reduce | Validation BLEU-score | 24 | Torch 1.7.0 | 65,206.62s | 2.8 (total), 15.8 (compute) | 16 | [details](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#a-lstm-wmt16-en-de) | [code](https://github.com/mlbench/mlbench-benchmarks/tree/develop/pytorch/nlp/translation/wmt16-gnmt-all-reduce) | [results](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#id30) |
|---------------------+----------+-----------------------+--------------------+------------------------+-------------+--------------+----------------+----------------+----------------------+----------------+---------|
| Machine Translation | WMT17 | Transformer | All-Reduce | Validation BLEU-score | 25 | Torch 1.7.0 | 37,594.21s | 3.0 (total), 18.3 (compute) | 16 | [details](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#b-transformer-wmt17-en-de) | [code](https://github.com/mlbench/mlbench-benchmarks/tree/develop/pytorch/nlp/translation/wmt17-transformer-all-reduce) | [results](https://mlbench.readthedocs.io/en/latest/benchmark-tasks.html#id32) |
{:.wide}