Skip to content

Update Tiny Llama example#161

Draft
marikurz-amd wants to merge 3 commits into
mainfrom
update.tiny.transformer
Draft

Update Tiny Llama example#161
marikurz-amd wants to merge 3 commits into
mainfrom
update.tiny.transformer

Conversation

@marikurz-amd
Copy link
Copy Markdown
Collaborator

WIP

…the memory advantages of flash attention and its linear scaling. Documentation needs to be updated to correctly show peak memory usage results.
…some current reference results, discussion of results and memory evolution fused vs. unfused.
@marikurz-amd marikurz-amd self-assigned this May 19, 2026
@marikurz-amd marikurz-amd marked this pull request as draft May 19, 2026 10:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant