An cost-based parameter tuning tool for Spark framework
- compile and package code/spark-1.4.0
- install code/spark-1.4.0 to your cluster
- shrink input data and sumbit an application to spark cluster
- feed the application runtime logs to code/WhatIf, get the best config
- rerun the application with the best config and the whole input data