Record rollout start time and show rollout latency in UI #398
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Motivation
created_attimestamps refer to the invocation and cannot be used to compute per-rollout start times.Description
rollout_start_timetoExecutionMetadataineval_protocol/models.pyand extended the TypeScript schema invite-app/src/types/eval-protocol.tswithrollout_start_time,rollout_duration_seconds, andeval_duration_seconds.rollout_start_timeat rollout start in the main rollout entry points by setting it before the processing timer in processors such asdefault_single_turn_rollout_process.py,default_pydantic_ai_rollout_processor.py,remote_rollout_processor.py,github_action_rollout_processor.py,openenv_rollout_processor.py,default_klavis_sandbox_rollout_processor.py,default_agent_rollout_processor.py,tinker_rollout_processor.py,priority_scheduler.py, andmcp/execution/manager.py.Rollout Latencycolumn invite-app/src/components/EvaluationTable.tsx, aRowRolloutDurationrenderer, and wiring the cell invite-app/src/components/EvaluationRow.tsxto displayexecution_metadata.rollout_duration_secondsformatted as seconds.rollout_duration_secondsassignments remain unchanged) while providing the start timestamp for future trace alignment.Testing
make pre-committo run linters/type checks but it failed because thepre-committool is not installed in the environment.npm installinvite-appto validate frontend dependencies but it failed withCannot read properties of null (reading 'matches')fromnpmin this environment.git commit) after the edits completed successfully.Codex Task
Note
Adds explicit rollout timing to enable accurate per-rollout latency and sorting.
rollout_start_timetoExecutionMetadataplusrollout_duration_secondsandeval_duration_secondsfields; keep existing duration plumbing; serialize inmodels.pyand TSExecutionMetadataSchema.execution_metadata.rollout_start_timeat rollout start and computerollout_duration_secondsintinker_rollout_processor.py,mcp/execution/manager.py,default_single_turn_rollout_process.py,default_pydantic_ai_rollout_processor.py,default_agent_rollout_processor.py,default_klavis_sandbox_rollout_processor.py,openenv_rollout_processor.py,remote_rollout_processor.py,github_action_rollout_processor.py, andpriority_scheduler.py.EvaluationTable.tsxand render withRowRolloutDurationinEvaluationRow.tsxusingexecution_metadata.rollout_duration_seconds.Written by Cursor Bugbot for commit 9171bc5. This will update automatically on new commits. Configure here.