Skip to content

Commit 393491f

Browse files
committed
adapt glm quant
Signed-off-by: whx-sjtu <[email protected]>
1 parent 2d88586 commit 393491f

File tree

1 file changed

+14
-1
lines changed

1 file changed

+14
-1
lines changed

vllm_ascend/quantization/quant_config.py

Lines changed: 14 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -210,7 +210,20 @@ def get_scaled_act_names(self) -> List[str]:
210210
"gate_proj",
211211
"up_proj",
212212
],
213-
}
213+
},
214+
"glm4_moe": {
215+
"qkv_proj": [
216+
"q_proj",
217+
"k_proj",
218+
"v_proj",
219+
],
220+
"gate_up_proj": [
221+
"gate_proj",
222+
"up_proj",
223+
],
224+
"experts":
225+
["experts.0.gate_proj", "experts.0.up_proj", "experts.0.down_proj"]
226+
},
214227
}
215228

216229

0 commit comments

Comments
 (0)