Product shape #101

axch · 2017-04-19T03:06:50Z

Thorough rewrite of the Aronnax Python package, carrying out the packaging plan outlined in Issue #95 sufficiently that the remaining items are decoupled and incrementalizable.

Effects on issues:

Closes Implement product shape #95, by construction, except for the parts not implemented. These will be filed as separate tickets.
Closes Discrepancy in dx values invalidates "known good" test outputs #96.
Makes progress toward Create a pip-installable aronnax PyPI package #87, in that it is now much more clear what the UI of that package will be.
Does not needlessly impede Restartability #67 (restartability), in that the full configuration of a run is written to disk before the run begins.
Obsoletes and thus closes Do we want to allow command-line overriding of parameters from parameters.in? #62 (command-line overrides). Its content is implemented in the interface of driver.simulate.
Lays ground work for Output model data in a portable and friendly format #30 (NetCDF output), in that there is now a workflow on which to hang any such output conversion.
Unblocks Make the user-facing examples complete #23 (user-facing examples), by defining the interface to make examples of.
Closes Ship helpers for preparing common patterns of input #22 (input generation helpers), by defining them.

… I want to reuse in the driver.

…d polite output.

…or now) data.

…ctangular pool".

…generated raw files, not their specs.

Also suppress ConfigParser's default behavior of case-normalizing the options.

… no input data was specified.

…f output_preservation_test.run_experiment.

…tput_preservation_test.run_experiment.

… up to edoddridge#96.

Turns out the driver was ignoring the botDrag parameter.

…h its config file name to its new Python name.

… of the right values.

Fixes edoddridge#96.

…line. This whole thing looks like something of a mess, but now it should at least be a navigable mess.

edoddridge · 2017-04-19T12:34:35Z

You weren't kidding - this is a serious rewrite!

I think I like the structure, but I'm struggling to get it running. After running python setup.py install the python files get copied over to a new directory under Anaconda, but they don't take the fortran core or the makefile with them. This means that compile_core tries to run in a directory without these files, and hence it fails. From some online sluething, it seems that we can use a MANIFEST.in file to make sure these files get copied across when the package is installed.

Unless I've missed something obvious?

axch · 2017-04-19T13:11:34Z

Travis seems to agree with you. This is actually an old bug, but I think I know why it is manifesting now and didn't manifest earlier. To wit, before, the .f90 file and the Makefile were, in effect, being searched for relative to the test suite rather than relative to the installed package.

I also know why I didn't experience this problem during development: I had install Aronnax with pip install -e ., which creates symbolic links in the installation directory that point back at the source files. This arrangement is convenient for iterating, because I don't have to reinstall every time I edit the source, but masks the problem that not enough files are, in fact, installed. If that is a sufficient workaround for you to be able to continue the review, I'd like to think a bit on the best way to resolve this issue more permanently.

edoddridge · 2017-04-19T13:39:43Z

Thanks - using pip install -e . means that the test suite runs, and will let me continue the review.

…This is a work around and needs a better long term fix.

… component - it is a scalar field, not a vector field

…cstring

edoddridge · 2017-04-19T14:53:17Z

Comments:

I like the implementation of an input configuration file, that then gets altered and written to a new file to become the version that is used for the simulation and the basis of the parameters.in file for the fortran core.
As we discussed, this PR introduces lots of undocumented features and code, but that documentation can be incrementally included as the code and interface are refined. I don't think there is any point delaying this PR while we make the documentation more complete. As we use this new interface I expect we'll find friction points and refine the code, thus necessitating changes to the documentation.

Things that I've changed:

rename the beta_plane Coriolis functions to include an f in the name. Otherwise it seems like they refer to the velocities, rather than the grid locations.
likewise, rename the f_plane Coriolis functions.
make the .travis.yml file use the pip install -e . command so that it can run the test suite. Once this PR is merged, one of us should file a ticket to fix that issue.
split the Coriolis functions and the wetmask function into separate sections in the documentation
edit the docstring of driver.simulate() to indicate that aronnax-merged.confandparameters.in` are generated automatically.
added a section to the documentation called "Running Aronnax"

Questions:

There is a function named default_configuration, but it doesn't seem to actually create a configuration file with the defaults, apart from setting the compile prefixes to false. Am I missing something?
- My conclusion from this is that it still requires the aronnax.conf file to exist and contain the defaults, which then get transferred to aronnax-merged.conf unless they are explicitly overwritten by the call to driver.simulate()`. I'm happy with that choice, but want to confirm that I've understood what is happening.
Would it be polite to have the driver notice if the output directories already exist, and write to new ones with an incremented number appended? This might be a pain to implement in the fortran core, but could certainly be done with the python generated NetCDF output. A fairly major downside of this is that it would break the correspondence between the config files and the output - it would be possible to edit the config, rerun the simulation and end up with output and config files that came from different simulations.
The aronnax-merged.conf files contain memory addresses for inputs that were generated from functions specified in aronnax.conf. Is that likely to be an issue for restarting simulations?

axch · 2017-04-19T15:22:11Z

Your changes look good to me. To your questions:

Yes, the default_configuration function is not a complete configuration, and aronnax.conf is required. Its purpose is to set default values for parameters required by the driver program, so that the driver does not crash if aronnax.conf is missing some fields that have reasonable defaults. It would be reasonable to extend this to provide sensible defaults for more parameters, especially "administrative" ones like DumpWind, but I actually think that forcing the user to look at the physics variables (e.g., by copying and modifying an example aronnax.conf) is better than trying to set defaults for them. You also correctly noticed that default_configuration does not save a default configuration file; it just operates in memory.
I don't think auto-incrementing names of directories is polite. Instead, it would be reasonable to do any subset of
- status quo, and warn users in the documentation to save expensive run outputs; or
- abort before starting the core if the output directory exists and is not empty, expecting users to move old runs out of the way; or
- teach the core to abort instead of overwriting a file that already exists; or
- offer the user runtime control of the output directory, e.g. by adding one more configuration parameter for it, perhaps with some combination of the above; or
- actually, offering the user runtime control of the input directory accomplishes the same effect, because then they become free to control the output directory by controlling the working directory of the process; or
- offering control of both directories; or
- finally, as an additional option above and beyond setting the output directory, we can add an auto-increment feature; or possibly some other sort of autogeneration of the name, like a content hash of the configuration file and/or the inputs.
If one provided an in-memory function to driver.simulate, one should not expect to be able to automatically rerun it from the merged configuration file alone. (Actually, it is possible to try to provide that capability by saving that function in a Python pickle file, or some such, but that's a pain to think about.) However, specifically restarting, specifically on the same system, can presumably be coded (as an additional feature) to skip the input specification and reuse the generated .bin input files. Then it doesn't matter that the custom function cannot be directly recovered. The same code can also cover the "tweak a parameter and rerun" use case.

Which of the above items are actionable enough to pull out as their own tickets, or as additions to tickets 102-106?

axch · 2017-04-19T15:23:30Z

Speaking of which, do you find that tickets 102-106 cover the gaps you see in this PR? You specifically mentioned documentation, which is #103.

edoddridge · 2017-04-19T15:31:21Z

Cool. Thanks for the explanations.

I agree that forcing the user to think, at least briefly, about the physical parameters is a good idea. I was just a little thrown that a function called default_configuration doesn't actually create a default configuration. Expanding this function to cover the housekeeping parameters is probably worth its own ticket.

Honestly, I think the status quo with a warning in the documentation is probably the best option. So let's leave it as is and I'll open a ticket to add a warning to the docs.

I should have mentioned that 102-106 look good. Thanks for opening them.

axch added 30 commits April 17, 2017 15:41

Start a utils module for stuff from the output preservation test that…

6cc8839

… I want to reuse in the driver.

Draft end-to-end driver program, with enough room for flexible use an…

8e9ae4d

…d polite output.

Draft schema for generation of data files.

5699ef8

Draft dumping the parameter file.

42e6f69

Interpret existing compile and run conventions in the driver framework.

cf4d9d7

Draft reading input data from raw Fortran files.

8fd8288

Fiddles from trying to actually run the draft driver.

890158a

Do what it takes to permit programmatic generation of layer height (f…

d3b0484

…or now) data.

Do what it takes to allow a config file to specify the wetmask as "re…

2007a93

…ctangular pool".

Add the u and v beta plane Coriolis components as generators.

a901e5b

More fiddles from trying to run the draft driver.

1624067

Format boolean values as Fortran expects, and write the names of the …

1a2f1cf

…generated raw files, not their specs.

Force all the sections to be present, since Fortran looks for them.

c1de42a

Also suppress ConfigParser's default behavior of case-normalizing the options.

It's spelled CONDITIONS, with two letters 'I'.

de76fa2

Oops, accidentally broke write_rectangular_pool.

eb6cb69

Force blank file names to be written for the core's benefit even when…

13b979a

… no input data was specified.

Rewrite the reduced gravity benchmark to use the new driver instead o…

d54a54e

…f output_preservation_test.run_experiment.

Rewrite the 2-layer benchmark to use the new driver as well.

7a1f14c

The benchmark script no longer depends on the test suite.

0416279

Apparently ConfigParser.getboolean doesn't understand Python booleans :(

fc011ef

Rewrite the profiling script in terms of the new driver instead of ou…

64400b1

…tput_preservation_test.run_experiment.

Flush broken comment.

a20f46e

Rewrite test_gaussian_bump_red_grav to use the new driver. It passes,…

1dfe4ea

… up to edoddridge#96.

Explicit variable name makes it a bit easier to add debug prints.

2895d11

Port the beta plane bump test to the new driver.

b892b97

Turns out the driver was ignoring the botDrag parameter.

Flush another mis-copied comment.

cca22fc

More useful debugging output for the agreement tests.

5ffc229

Don't need these test input generation helpers anymore.

54efda7

Ignore generated configuration files.

e22d511

Rewrite test_f_plane_red_grav in terms of the new driver.

ac3c5b5

axch added 11 commits April 18, 2017 08:29

Port test_beta_plane_gyre_red_grav to the new driver.

1e963a0

Port test_beta_plane_gyre to new driver.

a9170b6

This helper code has been superseded by the new driver.

2b467dd

All of these writer helpers are subsumed by the new driver too.

4b3ac7a

Update documentation references to new input generators.

f22c4e3

Rename "interpret_initial_heights" to the less awkward "depths"; matc…

9c8371f

…h its config file name to its new Python name.

Try to guess type annotations for some input files, but I am not sure…

c54b7b9

… of the right values.

Stop working around edoddridge#96, and update known-good outputs.

91c1a60

Fixes edoddridge#96.

Document the Aronnax driver functions.

dbdabd2

Marginally better documentation of the idealized data generation pipe…

997816a

…line. This whole thing looks like something of a mess, but now it should at least be a navigable mess.

Merge remote-tracking branch 'origin/master'

83ae6d5

This was referenced Apr 19, 2017

Incrementally complete the packaging plan #102

Open

Clean up the implementation of the new Aronnax driver and environs #106

Open

edoddridge added 8 commits April 19, 2017 09:52

rename Coriolis field generators to include "f"

6150c82

make Travis install package in such a way that the test suite works. …

392d68b

…This is a work around and needs a better long term fix.

change Coriolis function names in documentation too.

8456a68

edit Coriolis function docstrings to say u or v location, rather than…

f7dd2a7

… component - it is a scalar field, not a vector field

direcotry -> directly

9748b2a

put Coriolis functions in separate section from wetmask function

0a3b8d8

state which configuration files are automatically generated.

c5b97f0

add "running Aronnax" section to docs and modify driver.simulate() do…

04e6727

…cstring

rename coriolis functions in benchmark simulations

3a113eb

edoddridge merged commit 2e839e0 into edoddridge:master Apr 19, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Product shape #101

Product shape #101

axch commented Apr 19, 2017

edoddridge commented Apr 19, 2017

axch commented Apr 19, 2017

edoddridge commented Apr 19, 2017

edoddridge commented Apr 19, 2017

axch commented Apr 19, 2017

axch commented Apr 19, 2017

edoddridge commented Apr 19, 2017

Product shape #101

Product shape #101

Conversation

axch commented Apr 19, 2017

edoddridge commented Apr 19, 2017

axch commented Apr 19, 2017

edoddridge commented Apr 19, 2017

edoddridge commented Apr 19, 2017

axch commented Apr 19, 2017

axch commented Apr 19, 2017

edoddridge commented Apr 19, 2017