MT8893 NPU Gen7 — what is the MDLA version? Is GAI-Deployment-Toolkit-v2.0.x compatible? What backend string should I use in compile_generative.sh? Is LiteRT NeuroPilot Accelerator the recommended path for Qwen2.5 on MT8893?
For the MT8893 NPU Gen7 chip, which GAI Tools should I use and what are the specific configuration files needed to convert a model to run on the MT8893 NPU Gen7?
Quick note — v2.0.x is not the recommended path for this use case.
3. Regarding LiteRT NeuroPilot Accelerator
LiteRT is a framework maintained by Google. On mobile platforms, it does provide a path down to the NPU backend; however, the integration work (wiring LiteRT to the NPU backend) needs to be handled on your end.
As an alternative, if you prefer a more turnkey solution that works consistently across platforms, we recommend going through Neuron Runtime / Shim API, which supports NPU acceleration on all our platforms out of the box. For Qwen2.5 on MT8893, the GAI-Deployment-Toolkit flow (built on top of Neuron Runtime) is the officially recommended path.
It walks through the complete end-to-end flow — model conversion, quantization, compilation, and on-device deployment — which should cover most of what you need.