Add The Pertensor Quant And Fix some BUGs by Michael20070814 · Pull Request #458 · ModelTC/LightCompress

Michael20070814 · 2026-03-28T08:02:36Z

No description provided.

XHPlus · 2026-03-28T08:05:31Z

configs/quantization/methods/KVQuant/rtn_w_a_pertensor_static_naive_quant_kv.yml

-    type: model_type
-    path: model path
+    type: Qwen3
+    path: /home/michael/Project/models/Qwen3-0.6B


do not use hardcode path and models, revert to the original content

XHPlus · 2026-03-28T08:07:00Z

configs/quantization/methods/KVQuant/rtn_w_a_pertensor_static_naive_quant_kv.yml

    download: False
-    path: calib data path
+    path: /home/michael/Project/calib/pileval
+    n_sample: 128


here use n_samples

If not add the n_sample, here will emerge some problems.
[rank0]: AttributeError: 'EasyDict' object has no attribute 'n_sample'. Did you mean: 'n_samples'?
Just like this.
It seems that some spelling mistake had happened. Let me check it!

XHPlus · 2026-03-28T08:07:40Z

configs/quantization/methods/KVQuant/rtn_w_a_pertensor_static_naive_quant_kv.yml

    seed: *seed
 eval:
-    eval_pos: [transformed, fake_quant, fake_quant_wo_kv] #long_ppl eval not support pretrain eval pos
+    eval_pos: [] #long_ppl eval not support pretrain eval pos


use original content

XHPlus · 2026-03-28T08:11:01Z

llmc/compression/quantization/base_blockwise_quantization.py

-                act_static_cfg.update(self.config.calib.bs)
+                # KV cache 构造函数接收的是 num_samples / bsz，
+                # 这里把校准配置里的字段名映射成它实际需要的参数名。
+                act_static_cfg['num_samples'] = self.config.calib.n_sample


use self.config.calib.n_samples

That's the problem what causes that I must set the n_sample.

XHPlus · 2026-03-28T08:25:55Z

llmc/compression/quantization/base_blockwise_quantization.py

+
+    # 按 LightLLM 的 kv_cache_calib.json 结构导出校准结果，
+    # 目前只支持它已经接入的 per_tensor / per_head 两种 KV 格式。
+    def collect_calib_json(self):


The process of exporting calibration json file strictly relates with LightLLM. It's better to move this part to the utils folder.

Yes. I will make it more moduled

XHPlus · 2026-03-28T08:26:34Z

llmc/__main__.py

+    # 动态分配模型
    model = MODEL_REGISTRY[config.model.type](config)

+    # 打印模型和tokenizer


remove this types of comment. And avoid using chinese.

XHPlus · 2026-03-28T08:28:04Z

llmc/__main__.py

    eval_model(model, blockwise_opts, eval_list, eval_pos='transformed')
+    # 只有rank 0继续做保存和导出
    if int(os.environ['RANK']) == 0:
+        if 'save' in config and config.save.get('save_calib_json', False):


save_calib_json is not a clear config option. modify it after discussion.

Renamed the config to make the export target explicit:
save_calib_json -> save_lightllm_kv_cache_calib
calib_json_name -> lightllm_kv_cache_calib_name

This export is specifically for LightLLM KV cache calibration, so the new naming is more precise.

Michael20070814 added 3 commits March 27, 2026 15:51

add some description

3566f10

fix the bug

626ec9b

Add the calib export and fix some problems

e0117aa

XHPlus reviewed Mar 28, 2026

View reviewed changes

Michael20070814 added 5 commits March 28, 2026 16:42

remove the n_sample and repair the fixing errors

cac2d91

Make the Project more moduled

cecf46c

Rename the config to make the target explicit

3fa3e9d

modify the comment and transform the chinese to English

3911c49

remove the hardcode

5589e82

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add The Pertensor Quant And Fix some BUGs#458

Add The Pertensor Quant And Fix some BUGs#458
Michael20070814 wants to merge 8 commits intoModelTC:mainfrom
Michael20070814:main

Michael20070814 commented Mar 28, 2026

Uh oh!

XHPlus Mar 28, 2026

Uh oh!

XHPlus Mar 28, 2026

Uh oh!

Michael20070814 Mar 28, 2026

Uh oh!

XHPlus Mar 28, 2026

Uh oh!

XHPlus Mar 28, 2026

Uh oh!

Michael20070814 Mar 28, 2026

Uh oh!

XHPlus Mar 28, 2026

Uh oh!

Michael20070814 Mar 28, 2026

Uh oh!

XHPlus Mar 28, 2026

Uh oh!

XHPlus Mar 28, 2026

Uh oh!

Michael20070814 Mar 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Michael20070814 commented Mar 28, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants