add channel wise quantization option for QDQ, and opt for intel NPU by bopeng1234 · Pull Request #669 · intel/onnxruntime

@bopeng1234 mentioned this pull request

Apr 22, 2025

@bopeng1234

@bopeng1234

ankitm3k

ankitm3k pushed a commit that referenced this pull request

Jul 2, 2025
…669)

* add channel wise quantization option for QDQ, it optimize for intel NPU

* add channel_wised_quantize args to MatMulNBitsQuantizer