Skip to content

feat: FMA actuation path classification, hit rates, and per-path timing#1429

Draft
rubambiza wants to merge 1 commit into
llm-d:mainfrom
rubambiza:feature/fma-metrics-phases-1-2
Draft

feat: FMA actuation path classification, hit rates, and per-path timing#1429
rubambiza wants to merge 1 commit into
llm-d:mainfrom
rubambiza:feature/fma-metrics-phases-1-2

Conversation

@rubambiza
Copy link
Copy Markdown

Summary

  • Replace pod-name heuristic for actuation path classification with timestamp-based logic (launcher creationTimestamp vs requester creationTimestamp)
  • Rename T_LUKE_WARM to T_COLD_LAUNCHER to align with updated FMA terminology
  • Add Hot_hit_rate, Warm_hit_rate, cold_launcher_rate per iteration
  • Add upper-bound per-path timing: T_wake (hot), T_instance_create (warm), T_cold_launcher (cold)

Test plan

  • Run FMA scenario with hot-start config and verify T_hot classification + t_wake populated
  • Run FMA scenario with warm-start config and verify T_warm classification + t_instance_create populated
  • Run FMA scenario with cold-start config (no pre-existing launcher) and verify T_cold_launcher classification + t_cold_launcher populated
  • Verify hit rates sum to 1.0 per iteration
  • Verify JSON output includes new fields without breaking existing consumers

Related

…timing

Phase 1: Replace pod-name heuristic with timestamp-based classification.
Launcher creationTimestamp vs requester creationTimestamp determines
warm (pre-existing launcher) vs cold-with-launcher (DPC created new).
Add Hot_hit_rate, Warm_hit_rate, cold_launcher_rate per iteration.

Phase 2: Compute upper-bound per-path timing using Kube timestamps:
T_wake (hot), T_instance_create (warm), T_cold_launcher (cold).

Rename T_LUKE_WARM to T_COLD_LAUNCHER across harness and analysis.

Closes llm-d#1422

Assisted-By: Claude Opus 4.6 <noreply@anthropic.com>
Signed-off-by: Gloire Rubambiza <gloire@ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Feature]: FMA actuation path classification, hit rates, and per-path timing (Phases 1-2)

1 participant