Commit f0a3242
committed
support w4a8(Decode)/C8/C8+TP4EP4/PD disaggregation + compatibility fixes
Squashed from 6 feature commits + 2 compatibility fix commits:
- support w4a8(Decode)
- support C8 KV cache quantization
- support C8+TP4EP4
- bugfix C8
- bugfix pd+C8
- bugfix pd+mtp
- fix: make weight_need_transpose conditional and remove hardcoded layer_id
- fix: comprehensive compatibility fixes (Iluvatar platform, moe cast bug,
mutable default, hardcoded magic number, unconditional XPU import, etc.)1 parent f02b138 commit f0a3242
9 files changed
Lines changed: 210 additions & 27 deletions
File tree
- fastdeploy
- engine/sched
- model_executor
- layers
- backends/xpu
- moe
- quantization
- moe
- quantization
- models
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
235 | 235 | | |
236 | 236 | | |
237 | 237 | | |
| 238 | + | |
| 239 | + | |
238 | 240 | | |
239 | 241 | | |
240 | | - | |
| 242 | + | |
241 | 243 | | |
242 | 244 | | |
243 | 245 | | |
| |||
800 | 802 | | |
801 | 803 | | |
802 | 804 | | |
| 805 | + | |
| 806 | + | |
| 807 | + | |
| 808 | + | |
| 809 | + | |
| 810 | + | |
803 | 811 | | |
804 | | - | |
| 812 | + | |
805 | 813 | | |
806 | 814 | | |
807 | 815 | | |
| |||
911 | 919 | | |
912 | 920 | | |
913 | 921 | | |
| 922 | + | |
| 923 | + | |
| 924 | + | |
| 925 | + | |
| 926 | + | |
| 927 | + | |
914 | 928 | | |
915 | 929 | | |
916 | 930 | | |
| |||
920 | 934 | | |
921 | 935 | | |
922 | 936 | | |
| 937 | + | |
| 938 | + | |
| 939 | + | |
| 940 | + | |
| 941 | + | |
923 | 942 | | |
924 | 943 | | |
925 | 944 | | |
| |||
1403 | 1422 | | |
1404 | 1423 | | |
1405 | 1424 | | |
1406 | | - | |
| 1425 | + | |
1407 | 1426 | | |
1408 | | - | |
| 1427 | + | |
| 1428 | + | |
1409 | 1429 | | |
1410 | 1430 | | |
1411 | 1431 | | |
| |||
1416 | 1436 | | |
1417 | 1437 | | |
1418 | 1438 | | |
1419 | | - | |
| 1439 | + | |
1420 | 1440 | | |
1421 | 1441 | | |
| 1442 | + | |
| 1443 | + | |
| 1444 | + | |
| 1445 | + | |
| 1446 | + | |
1422 | 1447 | | |
1423 | | - | |
| 1448 | + | |
1424 | 1449 | | |
1425 | 1450 | | |
1426 | 1451 | | |
| |||
1470 | 1495 | | |
1471 | 1496 | | |
1472 | 1497 | | |
| 1498 | + | |
| 1499 | + | |
| 1500 | + | |
| 1501 | + | |
| 1502 | + | |
| 1503 | + | |
1473 | 1504 | | |
1474 | 1505 | | |
1475 | 1506 | | |
| |||
Lines changed: 4 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
181 | 181 | | |
182 | 182 | | |
183 | 183 | | |
184 | | - | |
185 | | - | |
| 184 | + | |
| 185 | + | |
186 | 186 | | |
187 | 187 | | |
188 | 188 | | |
| |||
220 | 220 | | |
221 | 221 | | |
222 | 222 | | |
223 | | - | |
224 | | - | |
| 223 | + | |
| 224 | + | |
225 | 225 | | |
226 | 226 | | |
227 | 227 | | |
| |||
Lines changed: 41 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
268 | 268 | | |
269 | 269 | | |
270 | 270 | | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
271 | 277 | | |
272 | 278 | | |
273 | 279 | | |
| |||
277 | 283 | | |
278 | 284 | | |
279 | 285 | | |
| 286 | + | |
| 287 | + | |
| 288 | + | |
| 289 | + | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
| 293 | + | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
| 297 | + | |
| 298 | + | |
| 299 | + | |
| 300 | + | |
| 301 | + | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
280 | 306 | | |
281 | 307 | | |
282 | 308 | | |
| |||
289 | 315 | | |
290 | 316 | | |
291 | 317 | | |
| 318 | + | |
| 319 | + | |
| 320 | + | |
| 321 | + | |
| 322 | + | |
| 323 | + | |
| 324 | + | |
| 325 | + | |
| 326 | + | |
| 327 | + | |
| 328 | + | |
| 329 | + | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
292 | 333 | | |
293 | 334 | | |
294 | 335 | | |
| |||
Lines changed: 65 additions & 9 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
| 18 | + | |
18 | 19 | | |
19 | 20 | | |
20 | 21 | | |
| |||
27 | 28 | | |
28 | 29 | | |
29 | 30 | | |
30 | | - | |
| 31 | + | |
31 | 32 | | |
32 | 33 | | |
33 | 34 | | |
| |||
42 | 43 | | |
43 | 44 | | |
44 | 45 | | |
| 46 | + | |
45 | 47 | | |
46 | 48 | | |
47 | 49 | | |
| |||
139 | 141 | | |
140 | 142 | | |
141 | 143 | | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
142 | 188 | | |
143 | 189 | | |
144 | 190 | | |
| |||
154 | 200 | | |
155 | 201 | | |
156 | 202 | | |
157 | | - | |
| 203 | + | |
158 | 204 | | |
159 | 205 | | |
160 | 206 | | |
161 | 207 | | |
162 | 208 | | |
163 | | - | |
| 209 | + | |
164 | 210 | | |
165 | 211 | | |
166 | 212 | | |
| |||
189 | 235 | | |
190 | 236 | | |
191 | 237 | | |
192 | | - | |
| 238 | + | |
193 | 239 | | |
194 | 240 | | |
195 | 241 | | |
196 | 242 | | |
197 | 243 | | |
198 | | - | |
| 244 | + | |
199 | 245 | | |
200 | 246 | | |
201 | 247 | | |
| |||
219 | 265 | | |
220 | 266 | | |
221 | 267 | | |
222 | | - | |
223 | | - | |
224 | | - | |
225 | | - | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
| 281 | + | |
226 | 282 | | |
227 | 283 | | |
228 | 284 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
260 | 260 | | |
261 | 261 | | |
262 | 262 | | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
263 | 277 | | |
264 | 278 | | |
265 | 279 | | |
| |||
292 | 306 | | |
293 | 307 | | |
294 | 308 | | |
| 309 | + | |
| 310 | + | |
| 311 | + | |
| 312 | + | |
| 313 | + | |
| 314 | + | |
295 | 315 | | |
296 | 316 | | |
297 | | - | |
| 317 | + | |
| 318 | + | |
| 319 | + | |
| 320 | + | |
| 321 | + | |
298 | 322 | | |
299 | 323 | | |
300 | 324 | | |
| |||
333 | 357 | | |
334 | 358 | | |
335 | 359 | | |
| 360 | + | |
| 361 | + | |
336 | 362 | | |
337 | 363 | | |
338 | 364 | | |
| |||
387 | 413 | | |
388 | 414 | | |
389 | 415 | | |
| 416 | + | |
| 417 | + | |
390 | 418 | | |
391 | 419 | | |
392 | 420 | | |
| |||
0 commit comments