Skip to content

feat: add Qwen3.5 35B A3B model support on TRN2#48

Open
YantaoShen wants to merge 1 commit intoaws-neuron:mainfrom
YantaoShen:feat/qwen3.5-35b-a3b
Open

feat: add Qwen3.5 35B A3B model support on TRN2#48
YantaoShen wants to merge 1 commit intoaws-neuron:mainfrom
YantaoShen:feat/qwen3.5-35b-a3b

Conversation

@YantaoShen
Copy link
Copy Markdown

Issue #, if available:

Description of changes:

feat: add Qwen3.5 35B A3B model support on TRN2

Add inference support for Qwen3.5-35B-A3B (MoE) on AWS Trainium2.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

@YantaoShen YantaoShen requested a review from a team March 30, 2026 23:07
all_logits = local_logits

# Argmax on CPU
next_id = all_logits.argmax(dim=-1, keepdim=True).to(dtype=torch.int) # (B, 1)
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's make sure this is properly fixed before merging

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants