-
Notifications
You must be signed in to change notification settings - Fork 486
[WIP]feat: support Speculative Decoding by Sglang Eagle algo #1176
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
TaoZex
wants to merge
104
commits into
inclusionAI:main
Choose a base branch
from
TaoZex:spec_v1
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Changes from 3 commits
Commits
Show all changes
104 commits
Select commit
Hold shift + click to select a range
4f7686d
feat: speculative decode init
71cecba
feat: add config
ac145b8
feat: fix log
c82b174
feat: fix
3282366
feat: fix
fffe03e
feat: fix
76999a3
feat: fix
4682168
feat: fix
8a9e8d1
feat: fix code
e955708
feat: add log
985cb26
fet: fix
6670f3d
feat: fix
6fdcbc7
feat: fix code
261c9c7
feat: fix mtp
bd36e6a
feat: remove
4924ee4
feat: change config
9f7796b
feat: improve mtp_loss_scaling_factor
e0471c3
feat: fix config
0bc4616
feat: fix
031fa89
feat: fix local for test
b36ca69
feat: bug fix
a07dc22
feat: fix
3a56483
feat: add base config
05bdf24
feat: remove mtp keep
41ac1ab
feat: fix mtp loss
dd4eb5e
feat: fix ckpt
7c27544
feat: fix config oom
f719dad
feat: fix no mtp
1bcabea
feat: fix config
107b8f3
feat: fix
7cb5e49
feat: fix
7cbfe81
feat: add qwen
0a32066
feat: remove enable_draft_weights_cpu_backup
93baaf8
feat: fix mtp loss
cdeebcb
feat: fix OOM
f79b5c5
feat: revert
f553572
feat: add mem log
53a3b2d
feat: rm log
b4bbb1f
feat: sample log
df7d918
feat: fix
f50e604
feat: fix mtp gradient
b40a55b
feat: fix again
ca39b2e
featοΌ fix
1889b1a
feat(engine): add mtp weight update
bbc9deb
feat(mtp): fix mtp weight update
ee24e8d
fix(controller): fix callback
e031206
fix(controller): skip _NO_PROXY
aaa3aa5
fix(controller): fix update
4d04c35
feat(controller): add log
a1c3e82
fix(engine): cuda ipc sync
b23abd1
fix(megatron): add log
e7c3f7b
fix(engine): improve serialize
dd3eeea
fix(engine): skip NCCL broadcast
1e6a453
fix(engine): improve
e7a6b38
fix(engine): fix nccl block
7028020
refactor(rollout_controller): add log metric
cfd9115
fix(engine): fix CUDA stream
d373f03
feat(megatron): add log
8c070b7
fix(rollout_controller): add
a4b48c7
feat(megatron): fix
c802bee
refactor(megatron_engine): improve
3af6904
fix(mcore): deal eh_proj.weight
4b2e96a
fix(megatron_engine): remove code
57061fe
fix(megatron_engine): grad
fa88152
feat(megatron_engine): add mtp log
02dc326
fix: use _logger
f8c2dab
fix(engine): fix mtp gradient
7e4118a
feat(mtp): add mtp lr
2276771
fix(engine): add mtp clip
b4f5543
refactor(megatron_engine): mv
00e4497
feat(megatron_engine): ad
3117ccf
fix: h20 config
6dba807
perf: fix config
a9161e7
feat: add log
5feca78
fix(scheduler): worker checkγ
a5177cc
fix(infra): fix net
4e06ba9
fix(net): add callback(need rethink)
63497c7
fix(engine): double scale
5b84634
feat(actor): fix mtp_lr_scale
056724c
fix(engine): fix mtp gradient numbatch
8b90666
fix(engine): lr
be8c1b0
feat(engine): megatron log
b55f862
feat(engine): audit log
23ebf0e
fix(megatron_engine): mimo weight update
b0d9363
fix: scale up mtp_lr_scale
9c29945
fix(megatron_engine): add log
410fb90
feat(megatron_engine): fp32 weight update
8c3f60d
feat(engine): add full stage log
56e5e08
fix(engine): fix mtp
918fb3f
fix(megatron_engine): mtp nccl error
8913215
fix(engine): fix again
ba01036
fix(megatron_engine): fix
4933461
feat(engine): improve
4b9a8e7
feat(megatron_engine): verify fp32 weight
3ea975f
feat(engine): fix again
d76a4ff
feat(infra): read sglang weight for verify
866e6a1
feat(engine): fix
dc6571b
feat(megatron_engine): verify log
dcb2e44
fix(engine): mtp issue
3ee9a06
feat(controller): fix
59b319c
feat(controller): fix1
832025c
fix(engine): fix
2dff0d9
fix(engine): v36
17cf72e
fix(engine): fix
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For consistency with other configuration fields in this dataclass,
enable_multi_layer_eagleshould be defined usingfield(). This also provides an opportunity to add ahelpstring in the metadata for better documentation and discoverability through CLI help messages.