[WIP]feat: support Speculative Decoding by Sglang Eagle algo by TaoZex · Pull Request #1176 · inclusionAI/AReaL

TaoZex · 2026-04-13T13:32:52Z

Description

Related Issue

Fixes #(issue)

Type of Change

Checklist

I have read the Contributing Guide
Pre-commit hooks pass (pre-commit run --all-files)
Relevant tests pass; new tests added for new functionality
Documentation updated (if applicable; built with ./docs/build_all.sh)
Branch is up to date with main
Self-reviewed via /review-pr command
This PR was created by a coding agent via /create-pr
This PR is a breaking change

Breaking Change Details (if applicable):

Additional Context

Need help? Check the Contributing Guide or ask in
GitHub Discussions!

gemini-code-assist

Code Review

This pull request introduces support for speculative decoding using EAGLE and Multi-Token Prediction (MTP) online training. Key changes include the addition of MTP and speculative decoding configuration fields across various API and engine components, implementation of MTP loss collection and gradient isolation in the Megatron engine, and the inclusion of speculative decoding statistics in model responses. Additionally, the PR provides comprehensive documentation, example configurations, and end-to-end tests. Feedback focuses on improving configuration consistency in SGLangConfig, removing redundant no-op string replacements in the MTP layer conversion utility, and simplifying logic in the RLVR workflow by removing a redundant conditional check.

gemini-code-assist · 2026-04-13T13:36:07Z

areal/api/cli_args.py

+            "help": "Attention mode for speculative decoding. E.g., 'full', 'sparse'."
+        },
+    )
+    enable_multi_layer_eagle: bool = False


For consistency with other configuration fields in this dataclass, enable_multi_layer_eagle should be defined using field(). This also provides an opportunity to add a help string in the metadata for better documentation and discoverability through CLI help messages.

enable_multi_layer_eagle: bool = field( default=False, metadata={"help": "Enable multi-layer EAGLE for speculative decoding."}, )

gemini-code-assist · 2026-04-13T13:36:08Z

areal/engine/megatron_utils/megatron.py

+    hf_remainder = hf_remainder.replace("enorm.weight", "enorm.weight")
+    hf_remainder = hf_remainder.replace("hnorm.weight", "hnorm.weight")


These lines are no-ops as they replace a string with itself. They should either be corrected to perform the intended name mapping as suggested by the comment on line 191, or removed if they are not needed.

gemini-code-assist · 2026-04-13T13:36:08Z

areal/workflow/rlvr.py

+            accept_rate = (
+                resp.spec_accept_token_num / resp.spec_draft_token_num
+                if resp.spec_draft_token_num > 0
+                else 0.0
+            )


The check if resp.spec_draft_token_num > 0 is redundant here, as it's already guaranteed by the outer if condition on line 135. You can simplify this to a direct division.

accept_rate = resp.spec_accept_token_num / resp.spec_draft_token_num

bingyechen added 3 commits April 13, 2026 21:12

feat: speculative decode init

4f7686d

feat: add config

71cecba

feat: fix log

ac145b8

gemini-code-assist bot reviewed Apr 13, 2026

View reviewed changes

bingyechen added 21 commits April 13, 2026 22:38

feat: fix

c82b174

feat: fix

3282366

feat: fix

fffe03e

feat: fix

76999a3

feat: fix

4682168

feat: fix code

8a9e8d1

feat: add log

e955708

fet: fix

985cb26

feat: fix

6670f3d

feat: fix code

6fdcbc7

feat: fix mtp

261c9c7

feat: remove

bd36e6a

feat: change config

4924ee4

feat: improve mtp_loss_scaling_factor

9f7796b

feat: fix config

e0471c3

feat: fix

0bc4616

feat: fix local for test

031fa89

feat: bug fix

b36ca69

feat: fix

a07dc22

feat: add base config

3a56483

feat: remove mtp keep

05bdf24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP]feat: support Speculative Decoding by Sglang Eagle algo#1176

[WIP]feat: support Speculative Decoding by Sglang Eagle algo#1176
TaoZex wants to merge 24 commits intoinclusionAI:mainfrom
TaoZex:spec_v1

TaoZex commented Apr 13, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 13, 2026

Uh oh!

gemini-code-assist bot Apr 13, 2026

Uh oh!

gemini-code-assist bot Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		hf_remainder = hf_remainder.replace("enorm.weight", "enorm.weight")
		hf_remainder = hf_remainder.replace("hnorm.weight", "hnorm.weight")

Conversation

TaoZex commented Apr 13, 2026

Description

Related Issue

Type of Change

Checklist

Additional Context

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant