@@ -7,16 +7,16 @@ ______________________________________________________________________
77
88## CRITICAL Level (Must use Opus)
99
10- | Change Type | File Path Pattern | Code Pattern |
11- | ---------------------- | ------------------------------------------------------------- | ----------------------------------------------------------- |
12- | ** ARCHON_CORE** | ` areal/experimental/models/archon/ ` | - |
13- | ** ARCHON_PARALLEL** | ` parallel_dims.py ` | ` ArchonParallelDims ` , ` _build_mesh ` , ` DeviceMesh ` |
14- | ** ARCHON_MOE** | ` archon/moe/ ` | ` router ` , ` grouped_experts ` , ` TokenReorderer ` , ` grouped_mm ` |
15- | ** ARCHON_PARALLELIZE** | ` qwen*/infra/parallelize.py ` | ` apply_moe_ep_tp ` , ` apply_tp ` , ` apply_cp ` |
16- | ** ARCHON_ENGINE** | ` areal/experimental/engine/archon_engine.py ` | ` ArchonEngine ` |
17- | ** FSDP_CORE** | ` areal/utils/fsdp / ` , ` areal/engine/fsdp_engine.py ` | ` FSDP ` , ` FullyShardedDataParallel ` , ` fully_shard ` |
18- | ** MEGATRON_CORE** | ` areal/engine/megatron_engine.py ` , ` areal/utils/megatron*.py ` | ` MegatronEngine ` |
19- | ** DCP_CHECKPOINT** | - | ` DCP ` , ` DistributedCheckpoint ` , ` dcp.save ` , ` dcp.load ` |
10+ | Change Type | File Path Pattern | Code Pattern |
11+ | ---------------------- | ----------------------------------------------------------------- | ----------------------------------------------------------- |
12+ | ** ARCHON_CORE** | ` areal/experimental/models/archon/ ` | - |
13+ | ** ARCHON_PARALLEL** | ` parallel_dims.py ` | ` ArchonParallelDims ` , ` _build_mesh ` , ` DeviceMesh ` |
14+ | ** ARCHON_MOE** | ` archon/moe/ ` | ` router ` , ` grouped_experts ` , ` TokenReorderer ` , ` grouped_mm ` |
15+ | ** ARCHON_PARALLELIZE** | ` qwen*/infra/parallelize.py ` | ` apply_moe_ep_tp ` , ` apply_tp ` , ` apply_cp ` |
16+ | ** ARCHON_ENGINE** | ` areal/experimental/engine/archon_engine.py ` | ` ArchonEngine ` |
17+ | ** FSDP_CORE** | ` areal/engine/fsdp_utils / ` , ` areal/engine/fsdp_engine.py ` | ` FSDP ` , ` FullyShardedDataParallel ` , ` fully_shard ` |
18+ | ** MEGATRON_CORE** | ` areal/engine/megatron_engine.py ` , ` areal/engine/megatron_utils/ ` | ` MegatronEngine ` |
19+ | ** DCP_CHECKPOINT** | - | ` DCP ` , ` DistributedCheckpoint ` , ` dcp.save ` , ` dcp.load ` |
2020
2121## HIGH Level (Recommend Opus)
2222
@@ -33,19 +33,19 @@ ______________________________________________________________________
3333
3434## MEDIUM Level (Use Sonnet)
3535
36- | Change Type | File Path Pattern | Code Pattern |
37- | ----------------------- | ---------------------------------------------------------------------------------- | ------------------------------------------------------------------------ |
38- | ** TENSOR_OPS** | - | ` .view( ` , ` .reshape( ` , ` dtype= ` , ` .detach() ` , ` no_grad ` , ` .contiguous() ` |
39- | ** NUMERICAL** | - | ` log( ` , ` softmax ` , ` cross_entropy ` , ` eps= ` , ` .clamp( ` , ` nan ` , ` inf ` |
40- | ** WORKFLOW_ENGINE** | ` areal/workflow/ ` , ` areal/engine/ ` | ` arun_episode ` , ` agenerate ` , ` RolloutWorkflow ` |
41- | ** API_CONFIG** | ` areal/api/ ` | ` @dataclass ` , ` __post_init__ ` , ` field( ` |
42- | ** COMPILE** | - | ` torch.compile ` , ` _dynamo ` , ` mark_dynamic ` , ` fullgraph ` |
43- | ** ACTIVATION_CKPT** | ` activation_checkpoint.py ` | ` activation_checkpoint ` , ` checkpoint_wrapper ` , ` selective_checkpoint ` |
44- | ** CHECKPOINT_RECOVERY** | ` areal/utils/saver.py ` , ` areal/utils/recover.py ` , ` areal/utils/fsdp /checkpoint.py ` | ` state_dict ` , ` load_state_dict ` , ` checkpoint ` |
45- | ** REWARD** | ` areal/reward/ ` | ` reward_fn ` , ` AsyncRewardWrapper ` , ` MathVerifyWorker ` |
46- | ** DATASET** | ` areal/dataset/ ` | ` get_*_dataset ` , ` DataLoader ` , ` IterableDataset ` |
47- | ** LAUNCHER_SCHEDULER** | ` areal/infra/launcher/ ` , ` areal/infra/scheduler/ ` , ` areal/infra/rpc/ ` | ` LaunchConfig ` , ` Scheduler ` , ` RayLauncher ` , ` SlurmLauncher ` |
48- | ** ATTENTION** | ` attention/ ` , ` attention/sdpa.py ` , ` attention/varlen.py ` | ` flash_attn ` , ` sdpa ` , ` varlen ` , ` causal_mask ` |
36+ | Change Type | File Path Pattern | Code Pattern |
37+ | ----------------------- | ----------------------------------------------------------------------------------------- | ------------------------------------------------------------------------ |
38+ | ** TENSOR_OPS** | - | ` .view( ` , ` .reshape( ` , ` dtype= ` , ` .detach() ` , ` no_grad ` , ` .contiguous() ` |
39+ | ** NUMERICAL** | - | ` log( ` , ` softmax ` , ` cross_entropy ` , ` eps= ` , ` .clamp( ` , ` nan ` , ` inf ` |
40+ | ** WORKFLOW_ENGINE** | ` areal/workflow/ ` , ` areal/engine/ ` | ` arun_episode ` , ` agenerate ` , ` RolloutWorkflow ` |
41+ | ** API_CONFIG** | ` areal/api/ ` | ` @dataclass ` , ` __post_init__ ` , ` field( ` |
42+ | ** COMPILE** | - | ` torch.compile ` , ` _dynamo ` , ` mark_dynamic ` , ` fullgraph ` |
43+ | ** ACTIVATION_CKPT** | ` activation_checkpoint.py ` | ` activation_checkpoint ` , ` checkpoint_wrapper ` , ` selective_checkpoint ` |
44+ | ** CHECKPOINT_RECOVERY** | ` areal/utils/saver.py ` , ` areal/utils/recover.py ` , ` areal/engine/fsdp_utils /checkpoint.py ` | ` state_dict ` , ` load_state_dict ` , ` checkpoint ` |
45+ | ** REWARD** | ` areal/reward/ ` | ` reward_fn ` , ` AsyncRewardWrapper ` , ` MathVerifyWorker ` |
46+ | ** DATASET** | ` areal/dataset/ ` | ` get_*_dataset ` , ` DataLoader ` , ` IterableDataset ` |
47+ | ** LAUNCHER_SCHEDULER** | ` areal/infra/launcher/ ` , ` areal/infra/scheduler/ ` , ` areal/infra/rpc/ ` | ` LaunchConfig ` , ` Scheduler ` , ` RayLauncher ` , ` SlurmLauncher ` |
48+ | ** ATTENTION** | ` attention/ ` , ` attention/sdpa.py ` , ` attention/varlen.py ` | ` flash_attn ` , ` sdpa ` , ` varlen ` , ` causal_mask ` |
4949
5050## LOW Level (Use Haiku)
5151
@@ -131,14 +131,14 @@ ______________________________________________________________________
131131
132132** FSDP Core** :
133133
134- - ` areal/utils/fsdp / `
134+ - ` areal/engine/fsdp_utils / `
135135- ` areal/engine/fsdp_engine.py `
136136
137137** Megatron Core** :
138138
139139- ` areal/engine/megatron_engine.py `
140- - ` areal/utils /megatron.py `
141- - ` areal/utils/megatron_checkpointer .py `
140+ - ` areal/engine/megatron_utils /megatron.py `
141+ - ` areal/engine/megatron_utils/checkpointer .py `
142142
143143** Trainer Core** :
144144
0 commit comments