[WAN] Use different sharding strategy for self and cross attention.#250
Closed
hyeygit wants to merge 1 commit intoAI-Hypercomputer:mainfrom
Closed
[WAN] Use different sharding strategy for self and cross attention.#250hyeygit wants to merge 1 commit intoAI-Hypercomputer:mainfrom
hyeygit wants to merge 1 commit intoAI-Hypercomputer:mainfrom