[Feature]: Packing format llm_compressor support INT4 (W4A16)

### Feature Description

Currently export format from doesn't support llm_compressor (compressed_tensor) if we are using INT4 (W4A16) quantization scheme.

### Motivation and Use Case

Would be good to be able to export into llm_compressor format using INT4 (W4A16) scheme. We can already do that through llm_compressor + AutoRoundModifier. But now if I use autoround library directly with w4a16 scheme + packing-format llm_compressor, it's not working. 

### Alternatives Considered

_No response_

### Definition of Done

_No response_

### Additional Context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Packing format llm_compressor support INT4 (W4A16) #1567

Feature Description

Motivation and Use Case

Alternatives Considered

Definition of Done

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature]: Packing format llm_compressor support INT4 (W4A16) #1567

Description

Feature Description

Motivation and Use Case

Alternatives Considered

Definition of Done

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions