Wimpy could execute a shell script right after deploying the new version, and before deleting the old one.
This script could check error ratio in /metrics endpoint (or whatever logic is needed) to determine if the new version is good, and the old one can be removed.
The default script could just exit 0