Commit e5022ce
committed
support w4a8(Decode)/C8/C8+TP4EP4/PD disaggregation + compatibility fixes
Squashed from 6 feature commits + 2 compatibility fix commits:
- support w4a8(Decode)
- support C8 KV cache quantization
- support C8+TP4EP4
- bugfix C8
- bugfix pd+C8
- bugfix pd+mtp
- fix: make weight_need_transpose conditional and remove hardcoded layer_id
- fix: comprehensive compatibility fixes (Iluvatar platform, moe cast bug,
mutable default, hardcoded magic number, unconditional XPU import, etc.)1 parent f02b138 commit e5022ce
10 files changed
Lines changed: 239 additions & 28 deletions
File tree
- fastdeploy
- engine/sched
- model_executor
- layers
- backends/xpu
- moe
- quantization
- moe
- quantization
- models
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
235 | 235 | | |
236 | 236 | | |
237 | 237 | | |
| 238 | + | |
| 239 | + | |
238 | 240 | | |
239 | | - | |
240 | | - | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
241 | 246 | | |
242 | 247 | | |
243 | 248 | | |
| |||
800 | 805 | | |
801 | 806 | | |
802 | 807 | | |
| 808 | + | |
| 809 | + | |
| 810 | + | |
| 811 | + | |
| 812 | + | |
| 813 | + | |
803 | 814 | | |
804 | | - | |
| 815 | + | |
805 | 816 | | |
806 | 817 | | |
807 | 818 | | |
| |||
911 | 922 | | |
912 | 923 | | |
913 | 924 | | |
| 925 | + | |
| 926 | + | |
| 927 | + | |
| 928 | + | |
| 929 | + | |
| 930 | + | |
914 | 931 | | |
915 | 932 | | |
916 | 933 | | |
| |||
920 | 937 | | |
921 | 938 | | |
922 | 939 | | |
| 940 | + | |
| 941 | + | |
| 942 | + | |
| 943 | + | |
| 944 | + | |
923 | 945 | | |
924 | 946 | | |
925 | 947 | | |
| |||
1403 | 1425 | | |
1404 | 1426 | | |
1405 | 1427 | | |
1406 | | - | |
| 1428 | + | |
1407 | 1429 | | |
1408 | | - | |
| 1430 | + | |
| 1431 | + | |
1409 | 1432 | | |
1410 | 1433 | | |
1411 | 1434 | | |
| |||
1416 | 1439 | | |
1417 | 1440 | | |
1418 | 1441 | | |
1419 | | - | |
1420 | | - | |
1421 | | - | |
| 1442 | + | |
| 1443 | + | |
| 1444 | + | |
| 1445 | + | |
| 1446 | + | |
| 1447 | + | |
1422 | 1448 | | |
1423 | | - | |
| 1449 | + | |
1424 | 1450 | | |
1425 | 1451 | | |
1426 | 1452 | | |
| |||
1470 | 1496 | | |
1471 | 1497 | | |
1472 | 1498 | | |
| 1499 | + | |
| 1500 | + | |
| 1501 | + | |
| 1502 | + | |
| 1503 | + | |
| 1504 | + | |
1473 | 1505 | | |
1474 | 1506 | | |
1475 | 1507 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
210 | 210 | | |
211 | 211 | | |
212 | 212 | | |
| 213 | + | |
| 214 | + | |
213 | 215 | | |
214 | 216 | | |
215 | 217 | | |
| |||
Lines changed: 4 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
181 | 181 | | |
182 | 182 | | |
183 | 183 | | |
184 | | - | |
185 | | - | |
| 184 | + | |
| 185 | + | |
186 | 186 | | |
187 | 187 | | |
188 | 188 | | |
| |||
220 | 220 | | |
221 | 221 | | |
222 | 222 | | |
223 | | - | |
224 | | - | |
| 223 | + | |
| 224 | + | |
225 | 225 | | |
226 | 226 | | |
227 | 227 | | |
| |||
Lines changed: 52 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
268 | 268 | | |
269 | 269 | | |
270 | 270 | | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
271 | 279 | | |
272 | 280 | | |
273 | 281 | | |
| |||
277 | 285 | | |
278 | 286 | | |
279 | 287 | | |
| 288 | + | |
| 289 | + | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
| 293 | + | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
| 297 | + | |
| 298 | + | |
| 299 | + | |
| 300 | + | |
| 301 | + | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
| 310 | + | |
| 311 | + | |
| 312 | + | |
280 | 313 | | |
281 | 314 | | |
282 | 315 | | |
| |||
289 | 322 | | |
290 | 323 | | |
291 | 324 | | |
| 325 | + | |
| 326 | + | |
| 327 | + | |
| 328 | + | |
| 329 | + | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
| 333 | + | |
| 334 | + | |
| 335 | + | |
| 336 | + | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
292 | 344 | | |
293 | 345 | | |
294 | 346 | | |
| |||
Lines changed: 76 additions & 8 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
| 22 | + | |
22 | 23 | | |
23 | 24 | | |
24 | 25 | | |
| |||
42 | 43 | | |
43 | 44 | | |
44 | 45 | | |
| 46 | + | |
45 | 47 | | |
46 | 48 | | |
47 | 49 | | |
| |||
139 | 141 | | |
140 | 142 | | |
141 | 143 | | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
142 | 200 | | |
143 | 201 | | |
144 | 202 | | |
| |||
154 | 212 | | |
155 | 213 | | |
156 | 214 | | |
157 | | - | |
| 215 | + | |
158 | 216 | | |
159 | 217 | | |
160 | 218 | | |
161 | 219 | | |
162 | 220 | | |
163 | | - | |
| 221 | + | |
164 | 222 | | |
165 | 223 | | |
166 | 224 | | |
| |||
189 | 247 | | |
190 | 248 | | |
191 | 249 | | |
192 | | - | |
| 250 | + | |
193 | 251 | | |
194 | 252 | | |
195 | 253 | | |
196 | 254 | | |
197 | 255 | | |
198 | | - | |
| 256 | + | |
199 | 257 | | |
200 | 258 | | |
201 | 259 | | |
| |||
219 | 277 | | |
220 | 278 | | |
221 | 279 | | |
222 | | - | |
223 | | - | |
224 | | - | |
225 | | - | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
| 286 | + | |
| 287 | + | |
| 288 | + | |
| 289 | + | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
| 293 | + | |
226 | 294 | | |
227 | 295 | | |
228 | 296 | | |
| |||
0 commit comments