Environment:
- Platform: Genio-520
- Toolkit: GAI-Deployment-Toolkit-v2.0.8_qwen2.5-0.5b-1.5b-7b-v0.1
- Neuron SDK: 20250423_Neuron_SDK_v1.2517.03_neuron-8.0-release.tar.gz
Hi MediaTek team and community,
With Qwen 2.5, I’m still encountering the issue: “No .dla file generated when running compile_generative.sh”, even after applying the adjustments you suggested.
BACKEND=“mdla5.3,edma3.6”
L1_SIZE_KB=“256”
NUM_MDLA=“1”
Here are the logs/images when running compile_generative.sh:

After running compile_generative.sh, it does not return any logs or generate any .dla file. This issue only occurs with compile_generative.sh, while compile_prompt_qwen2.5_0.5B_7B.sh still successfully generates .dla files and produces some log output such as:
(py38_test) bkav@bkav-Super-Server:~/Downloads/GAI_Toolkits/GAI-Deployment-Toolkit-v2.0.8_qwen2.5-0.5b-1.5b-7b-v0.1/compile$ ./compile_prompt_qwen2.5_0.5B_7B.sh
/home/bkav/Downloads/GAI_Toolkits/GAI-Deployment-Toolkit-v2.0.8_qwen2.5-0.5b-1.5b-7b-v0.1/post_training_quantize/tflite/Qwen2.5-7B-Instruct_asym4W_sym16A_Overall_hessian_wgt_opt_cum_layer_error_rotate_ortho_0_128t2048c/Qwen2.5-7B-Instruct_asym4W_sym16A_Overall_hessian_wgt_opt_cum_layer_error_rotate_ortho_0_7layer_128t2048c_2.tflite
/home/bkav/Downloads/SDK/Neuropilot_SDK/20250423_Neuron_SDK_v1.2517.03_neuron-8.0-release/neuron_sdk
WARNING: SMP is skipped since all backend targets are single-core or unknown.
WARNING: SMP is skipped since all backend targets are single-core or unknown.
WARNING: SMP is skipped since all backend targets are single-core or unknown.
WARNING: SMP is skipped since all backend targets are single-core or unknown.
WARNING: SMP is skipped since all backend targets are single-core or unknown.
Patch done!
WARNING: 16A4W FC signed input activation but not has 128 offset may cause the input value of sw workaround saturated.
WARNING: 16A4W FC signed input activation but not has 128 offset may cause the input value of sw workaround saturated.
WARNING: 16A4W FC signed input activation but not has 128 offset may cause the input value of sw workaround saturated.
WARNING: 16A4W FC signed input activation but not has 128 offset may cause the input value of sw workaround saturated.
DRAM Usage:
Target Input Output Temp Static Code Total
[ 0]: MDLA 5.3 29M 2.6M 20M 781M 765K 834M
Total Memory:
DRAM: 834M + 0 (Shared) = 834M
L1: 256K
Thank you very much for your time and support.
Best regards,
dmd955