-
Notifications
You must be signed in to change notification settings - Fork 75
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Model hyperparameter tuning #933
Comments
I've started a notebook here - nn_hyperparameters Using Optuna optimisation with 50 trials
|
See the most up to date tuning notebook here, using the LlamaCPP MiniLM version for consistency with CodeGate - nn_hyperparameters-llamacpp.ipynb Training on Linux and MacOS command data only (n=6087), and extending the test dataset size with further synthetic commands to n=699 Training set:
Test set: Note: the test set is not labelled by OS, this is something I should do but for now treating all of them as Linux/MacOS should be a reasonable approximation due to their similarity. The best trial hyperparameter were:
🔍 Model Evaluation on Test Set:
I've added this model to the repository in ONNX (nn-0503.onnx) and PT formats (nn-0503.pt). I'll close this out for now. |
We need to explore the dimensions and learning parameters of the ANN and optimise it.
The text was updated successfully, but these errors were encountered: