add channel wise quantization option for QDQ, and opt for intel NPU by bopeng1234 · Pull Request #669 · intel/onnxruntime
ankitm3k pushed a commit that referenced this pull request
Jul 2, 2025…669) * add channel wise quantization option for QDQ, it optimize for intel NPU * add channel_wised_quantize args to MatMulNBitsQuantizer
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters