Mango Tree Pruning Method

ChenhuiZhu41/prune_quant_llm

Compared to magnitude pruning which removes weights solely based on their magnitudes, our pruning approach Wanda removes weights on a per-output basis, by the product of weight magnitudes and input ...

IEEE

ICP: Immediate Compensation Pruning for Mid-to-high Sparsity

Abstract: The increasing adoption of large-scale models under 7 billion parameters in both language and vision domains enables inference tasks on a single consumer-grade GPU but makes fine-tuning ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

ChenhuiZhu41/prune_quant_llm

ICP: Immediate Compensation Pruning for Mid-to-high Sparsity

Trending now