Skip to content

Releases: VainF/Torch-Pruning

v1.5.1

01 Dec 08:16
Compare
Choose a tag to compare

What's Changed

  • Fix bug: moved handling of the case where imp is none before len is c… by @JamesKing76 in #440
  • Fixed some issues in Qwen-2 Pruning. Add examples for Qwen-2.5.

New Contributors

Full Changelog: v1.5.0...v1.5.1

v1.5.0: Add SliceOp; Support Phi-3 & Qwen-2

17 Nov 09:39
2a31ea2
Compare
Choose a tag to compare

What's Changed

Full Changelog: v1.4.3...v1.5.0

v1.4.3

07 Oct 10:37
52a2b87
Compare
Choose a tag to compare

What's Changed

  • Fixed some issues in GQA Pruning
  • [fix] Clarify variable naming in linear_scheduler function, add typing by @janumiko in #423

New Contributors

Full Changelog: v1.4.2...v1.4.3

v1.4.2

09 Aug 11:20
06c48e7
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v1.4.1...v1.4.2

v1.4.1

21 Jul 04:19
Compare
Choose a tag to compare

What's Changed

  • Add Isomorphic Pruning, an improved algorithm for global pruning.
  • Unify local/global/isomorphic pruning with Scope for importance ranking
  • Allow user-defined scope for importance ranking. The following key-value pair (model.layer1, model.layer2): 0.4 will perform global ranking only within layer1 and layer2, with the pruning ratio of 40%.
    pruner = tp.pruner.MetaPruner(
        ...
        global_pruning=True,
        pruning_ratio=0.5, # default pruning ratio
        pruning_ratio_dict = {(model.layer1, model.layer2): 0.4, model.layer3: 0.2}, 
        # Global pruning will be performed on layer1 and layer2
    )
  • Bugfixing

New Contributors

Full Changelog: v1.4.0...v1.4.1

v1.4.0: Improved Support for Huggingface Transformers & LLMs

04 Jun 12:19
b0f0a7c
Compare
Choose a tag to compare

What's Changed

  • Add support for Grouped Query Attention (GQA) in Huggingface transformers.
  • Include minimal examples for Large Language Models (LLaMA-2 & LLaMA-3).

Full Changelog: v1.3.7...v1.4.0

v1.3.7

21 Feb 09:24
2736215
Compare
Choose a tag to compare
  • Add more docstrings and comments
  • Minor bug fixing

Full Changelog: v1.3.6...v1.3.7

v1.3.6

29 Dec 14:55
33a4160
Compare
Choose a tag to compare

What's Changed

  • fix bug:inversed_idx do not match the origin index by @pleb631 in #321
  • Remove ch_groups in tp.importance

New Contributors

Full Changelog: v1.3.5...v1.3.6

v1.3.5: bugfixing

15 Dec 12:59
35de01a
Compare
Choose a tag to compare

What's Changed

Full Changelog: v1.3.4...v1.3.5

v1.3.4

14 Dec 14:18
1813e7a
Compare
Choose a tag to compare

What's Changed

  • fix nan and inf bug in sparse learning by @HollyLee2000 in #310
  • Fixed a bug in interactive pruning + iterative pruning + sparse training by @VainF in #311

New Contributors

Full Changelog: v1.3.3...v1.3.4