Skip to content

Cache _load_architecture by model_type#36

Merged
YWHyuk merged 1 commit into
mainfrom
claude/cache-architecture-yaml
May 19, 2026
Merged

Cache _load_architecture by model_type#36
YWHyuk merged 1 commit into
mainfrom
claude/cache-architecture-yaml

Conversation

@YWHyuk
Copy link
Copy Markdown

@YWHyuk YWHyuk commented May 19, 2026

Architecture yaml was reloaded once per scheduler iteration when running with --analytical-modeling. The non-analytical path goes through _load_perf_db which is cached, but the analytical branch in _build_trace_ctx calls _load_architecture directly with no cache.

For a 33-iter baseline run on Llama-3.1-8B this accounted for ~12% of total wall-clock (~10ms per iter on yaml.safe_load).

Architecture yaml was reloaded once per scheduler iteration when running
with --analytical-modeling. The non-analytical path goes through
_load_perf_db which is cached, but the analytical branch in
_build_trace_ctx calls _load_architecture directly with no cache.

For a 33-iter baseline run on Llama-3.1-8B this accounted for ~12% of
total wall-clock (~10ms per iter on yaml.safe_load).
@YWHyuk YWHyuk merged commit ffa8d3c into main May 19, 2026
2 checks passed
@YWHyuk YWHyuk mentioned this pull request May 19, 2026
4 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants