Skip to content

Releases: silveroxides/convert_to_quant

v1.2.8

09 Jun 18:09
b5f815d

Choose a tag to compare

What's Changed

Full Changelog: v1.2.7...v1.2.8

v1.2.7

07 Jun 20:15
644e1d4

Choose a tag to compare

What's Changed

Full Changelog: v1.2.6...v1.2.7

v1.2.6

27 May 01:56

Choose a tag to compare

Full Changelog: v1.2.5...v1.2.6

v1.2.5

26 May 15:58
37d0eb2

Choose a tag to compare

What's Changed

  • Add quantize function as a callable function via imported module by @silveroxides in #40

Full Changelog: v1.2.4...v1.2.5

v1.2.4

22 May 18:06

Choose a tag to compare

Full Changelog: v1.2.3...v1.2.4

v1.2.3

18 May 19:38
349f778

Choose a tag to compare

What's Changed

  • Fix issue with learned rounding not calculating correctly for int8 ro… by @silveroxides in #39

Full Changelog: v1.2.2...v1.2.3

v1.2.2

16 May 06:31
df1ce85

Choose a tag to compare

What's Changed

Full Changelog: v1.2.1...v1.2.2

v1.2.1

13 May 20:28
a626769

Choose a tag to compare

What's Changed

  • Add support for row-wise int8 quantization using fused triton kernel by @silveroxides in #36

Full Changelog: v1.2.0...v1.2.1

v1.2.0

26 Apr 08:50
dffb602

Choose a tag to compare

What's Changed

Full Changelog: v1.1.5...v1.2.0

v1.1.5

03 Apr 23:43
ed2b577

Choose a tag to compare

What's Changed

  • add generic text encoder flag to trigger input scales addition by @silveroxides in #30

Full Changelog: v1.1.4...v1.1.5