Fold output of each benchmark #10914

JaroslavTulach · 2024-08-29T05:50:14Z

Pull Request Description

This PR improves output of benchmarks run on GitHub Actions CI to group and fold output for each benchmark.

Checklist

Please ensure that the following checklist has been satisfied before submitting the PR:

All code follows the
Java
Benchmarks output improved
- engine run
- stdlib run

JaroslavTulach · 2024-08-29T05:50:49Z

lib/java/benchmarks-common/src/main/java/org/enso/interpreter/bench/BenchmarksRunner.java

+
+    @Override
+    public void startBenchmark(BenchmarkParams benchParams) {
+      output.println("::group::" + benchParams.getBenchmark());


The ::group:: format seems to be mentioned at this discussion.

It is documented here: https://docs.github.com/en/actions/writing-workflows/choosing-what-your-workflow-does/workflow-commands-for-github-actions#grouping-log-lines

Although the documentation is not very detailed.

…ied.

JaroslavTulach · 2024-08-29T11:34:24Z

Update: solved in 9ba4428

I don't know how to print plain line without any prefix in our Rust build system. If I use info! like in 37d0385 the line gets prefixed and looses its :: command prefix. If I use println! like in b1ef6fb the text isn't displayed in the log at all! Any advice from Rust experts?

Akirathan

This is a good idea. I am interested in how the end result will look like. I had no idea that there is some output grouping for GH actions. We could probably integrate this functionality even for other actions.

JaroslavTulach · 2024-08-30T05:34:37Z

Locally the command lines seem to be determined fine. If I execute, I see:

enso$ ./run backend benchmark runtime --minimal-run
 INFO ide_ci::program::command: sbt ℹ️ [info] # *** WARNING: Use non-forked runs only for debugging purposes, not for actual performance runs. ***
 INFO ide_ci::program::command: sbt ℹ️ [info] Iteration   1: 102.137 ms/op
::endgroup::
::group::org.enso.interpreter.bench.benchmarks.semantic.ArrayProxyBenchmarks.sumOverComputingProxy
 INFO ide_ci::program::command: sbt ℹ️ [info] # Run progress: 12.68% complete, ETA 00:01:29
 INFO ide_ci::program::command: sbt ℹ️ [info] # Fork: N/A, test runs in the host VM

e.g. both ::endgroup and ::group are properly detected and the "INFO..." prefix isn't printed. Let's see what does the CI print?

Finally it does something reasonable!

JaroslavTulach · 2024-08-30T07:38:31Z

Certain parts of the stdlibs bench run output look nice as well.

Some messages "escape" the grouping. Not sure why and whether that matters that much to prevent integration. Hypothesis:

either output prints to stdout and compilation detection prints to System.err and the output gets mixed
or two two messages are printed by different threads

Let's try to switch to stdout - stdlib benchmark is running.

Update: The latest stdlibs output looks pretty decent. Looks like c698ef7 helped.

build/ci_utils/src/program/command.rs

No longer convinced this is needed at all

Akirathan

After some thinking, I believe that your changes are unnecessary, and seeing the current folded output, it is very messy - The group for benchGenerateList obviously contains output even from some other benchmark for example.

May I ask you again to explain your use-case that this folding will solve, and why do you think it is needed?

Note that at the end of each benchmark run (the very end of the whole job output) there is table-like structure with all the scores. In your case it is at https://github.com/enso-org/enso/actions/runs/10627416248/job/29460495368#step:7:1886. If you want to see the output for a single benchmark, you can just type its name in the search field. You would need to do that even if the output was folded.

EDIT: If you still think the folding will be beneficial, I will revoke my PR reject after I see that the folding works and that no output is assigned to a wrong group.

JaroslavTulach · 2024-08-30T11:22:27Z

... changes are unnecessary...

I am sad you have changed your mind. I believe that with few more homey touches, we can make this folding work well and increase overall satisfaction.

May I ask you again to explain your use-case that this folding will solve, and why do you think it is needed?

I want to hide non-interesting crap - folding would achieve that
I want to organize things per interest - using title of a benchmark for navigation seems human friendly
I was hoping we tackle the size of the log (didn't work trying more in 2344fac) as we are reaching a limit on stdlibs benchmarks

Note that at the end of each benchmark run (the very end of the whole job output) there is table-like structure with all the scores. In your case it is at https://github.com/enso-org/enso/actions/runs/10627416248/job/29460495368#step:7:1886.

Right now the log stops at 22% of stdlibs benchmark run so the table isn't really visible.

If you want to see the output for a single benchmark, you can just type its name in the search field. You would need to do that even if the output was folded.

I've just tried and folding has no negative effect on searching. These ::group: folds are automatically expanded when the found text is in a collapsed fold.

If you still think the folding will be beneficial

The latest stdlibs output looks pretty decent. Looks like c698ef7 helped.

…e benchOnly to see whole output

JaroslavTulach · 2024-09-06T11:38:22Z

The newest idea c36fdad to solve the too huge log issue is to differentiate between execution of all benchmarks and execution of a selected benchmark(s):

running bench limits the output to ten lines per benchmark - let's see if that helps
running benchOnly Xyz dumps full output

As a result the CI run will be less verbose, but one will have a chance to re-run individual benchmark locally and see whole output.

JaroslavTulach · 2024-09-06T16:47:43Z

at the end of each benchmark run (the very end of the whole job output) there is table-like structure with all the scores.

Yes, with the latest run the table is finally visible.

you can just type its name in the search field.

That is still possible even the output is folded.

If you still think the folding will be beneficial, I will revoke my PR reject

Yes, consider revoking your reject.

Akirathan

The latest output seems reasonable. The fact that when all the benchmarks are run, the output is less verbose is good - it helps to be able to interactively browse the output on the CI.

(cherry picked from commit 89c5b31)

Fold output of each benchmark

e21edc1

JaroslavTulach added the CI: No changelog needed Do not require a changelog entry for this PR. label Aug 29, 2024

JaroslavTulach requested a review from Akirathan August 29, 2024 05:50

JaroslavTulach self-assigned this Aug 29, 2024

JaroslavTulach commented Aug 29, 2024

View reviewed changes

JaroslavTulach requested review from jdunkerley, AdRiley, hubertp and Frizi as code owners August 29, 2024 10:26

Lines starting with :: represent a GitHub command. Print them unmodif…

b1ef6fb

…ied.

JaroslavTulach force-pushed the wip/jtulach/GithubActionsOutput branch from 37d0385 to b1ef6fb Compare August 29, 2024 10:56

Akirathan previously approved these changes Aug 29, 2024

View reviewed changes

JaroslavTulach marked this pull request as draft August 29, 2024 11:40

Prefix for command line is [info] ::

9ba4428

enso-bot bot mentioned this pull request Aug 30, 2024

Benchmark and speed processing of polyglot java imports up #10899

Merged

4 tasks

Use find to scan for a command line

d9d44d1

JaroslavTulach marked this pull request as ready for review August 30, 2024 05:56

Print messages to stdout

c698ef7

JaroslavTulach requested a review from 4e6 as a code owner August 30, 2024 07:41

4e6 approved these changes Aug 30, 2024

View reviewed changes

build/ci_utils/src/program/command.rs Show resolved Hide resolved

Akirathan requested changes Aug 30, 2024

View reviewed changes

JaroslavTulach added 2 commits August 30, 2024 13:28

Explaing why println! is needed

4c0ddd5

Include just the first part of compilation messages

2344fac

enso-bot bot mentioned this pull request Aug 31, 2024

Speedup EnsoRootNode.buildFrameDescriptor #10843

Closed

Limit number of compilation detected output when benchmarking all. Us…

c36fdad

…e benchOnly to see whole output

JaroslavTulach requested a review from Akirathan September 6, 2024 16:47

enso-bot bot mentioned this pull request Sep 7, 2024

Avoid AliasAnalysis for Hello_World.enso #10996

Merged

3 tasks

Akirathan approved these changes Sep 9, 2024

View reviewed changes

jdunkerley approved these changes Sep 9, 2024

View reviewed changes

JaroslavTulach merged commit 89c5b31 into develop Sep 9, 2024
42 checks passed

JaroslavTulach deleted the wip/jtulach/GithubActionsOutput branch September 9, 2024 10:30

jdunkerley pushed a commit that referenced this pull request Sep 9, 2024

Fold output of each benchmark (#10914)

b9a8ab3

(cherry picked from commit 89c5b31)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fold output of each benchmark #10914

Fold output of each benchmark #10914

JaroslavTulach commented Aug 29, 2024 •

edited

Loading

JaroslavTulach Aug 29, 2024

radeusgd Sep 3, 2024

JaroslavTulach commented Aug 29, 2024 •

edited

Loading

Akirathan left a comment

JaroslavTulach commented Aug 30, 2024 •

edited

Loading

JaroslavTulach commented Aug 30, 2024 •

edited

Loading

Akirathan left a comment •

edited

Loading

JaroslavTulach commented Aug 30, 2024 •

edited

Loading

JaroslavTulach commented Sep 6, 2024

JaroslavTulach commented Sep 6, 2024

Akirathan left a comment

Fold output of each benchmark #10914

Fold output of each benchmark #10914

Conversation

JaroslavTulach commented Aug 29, 2024 • edited Loading

Pull Request Description

Checklist

JaroslavTulach Aug 29, 2024

Choose a reason for hiding this comment

radeusgd Sep 3, 2024

Choose a reason for hiding this comment

JaroslavTulach commented Aug 29, 2024 • edited Loading

Akirathan left a comment

Choose a reason for hiding this comment

JaroslavTulach commented Aug 30, 2024 • edited Loading

JaroslavTulach commented Aug 30, 2024 • edited Loading

Akirathan left a comment • edited Loading

Choose a reason for hiding this comment

JaroslavTulach commented Aug 30, 2024 • edited Loading

JaroslavTulach commented Sep 6, 2024

JaroslavTulach commented Sep 6, 2024

Akirathan left a comment

Choose a reason for hiding this comment

JaroslavTulach commented Aug 29, 2024 •

edited

Loading

JaroslavTulach commented Aug 29, 2024 •

edited

Loading

JaroslavTulach commented Aug 30, 2024 •

edited

Loading

JaroslavTulach commented Aug 30, 2024 •

edited

Loading

Akirathan left a comment •

edited

Loading

JaroslavTulach commented Aug 30, 2024 •

edited

Loading