Starting tests if general features don't pass is pointless, currently those features are queued at the end. We should either start behave twice (once for common Docker features and then other tests) or make sure common-features are being run first