feat: Add Qwen3.5 support to Megatron backend by vivekkalyan · Pull Request #637 · OpenPipe/ART

vivekkalyan · 2026-04-02T02:48:29Z

Summary

This adds Qwen3.5 MoE support to the Megatron backend on top of #636.

With this stack, ART can now train Qwen/Qwen3.5-35B-A3B with Megatron while serving inference from the dedicated vLLM process using merged weight updates.

What changed

Bump megatron-bridge to a revision that includes upstream Qwen3 support needed for this path
Teach the Megatron provider path to accept Qwen3.5 MoE Bridge models and patch the hybrid layer spec so ART's flex attention only applies to the standard attention layers
Add Qwen3.5 LoRA coverage for:
- gated delta net in-projection
- shared experts
- the Qwen3.5 attention packing layout while preserving existing Qwen3 behavior
Replace the old ART-specific merged-weight math with a Bridge compatibility layer that converts ART adapters into Bridge AdapterWeights for export and merge
Add provider, wrapper, and helper coverage for the Qwen3.5 path

Validation

Clean-cluster Qwen3.5 Megatron dedicated + merged smoke completed through step @2
Clean-cluster 20-step yes/no/maybe run completed successfully for Qwen/Qwen3.5-35B-A3B

vivekkalyan added 3 commits April 1, 2026 18:46

build: bump megatron-bridge for Qwen3 support

0211221

feat: Add Qwen3.5 support to Megatron backend

d712ce9

feat: Add Qwen3.5 Megatron smoke runners

3ad7d06

vivekkalyan force-pushed the feat/qwen35-megatron branch from 9fcc5e9 to 3ad7d06 Compare April 2, 2026 03:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add Qwen3.5 support to Megatron backend#637

feat: Add Qwen3.5 support to Megatron backend#637
vivekkalyan wants to merge 3 commits intofeat/merged-mode-megatronfrom
feat/qwen35-megatron

vivekkalyan commented Apr 2, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

vivekkalyan commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

Validation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vivekkalyan commented Apr 2, 2026 •

edited

Loading