Skip to content

[Feature] Enhance hadamard transform #1569

@chensuyue

Description

@chensuyue

Status:

  1. support deterministic_hadamard_matrix and deterministic_hadamard_matrix
  2. support inference with transformers with a triton mxfp4 kernel
  3. support rtn and autoroud tuning (iters>0)

Todo:

  1. align the implementation of SpinQuant: LLM Quantization with Learned Rotations, especially R2/R3, and implement structured fused
  2. support online transform for vllm inference
  3. support nvfp4
  4. verify multi-cards

Originally posted by @lkk12014402 in #1323

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions