Where to get the Gen AI / LLM documentation and samples for Genio-720 on Yocto Linux

Hello,

I would like to run an LLM on the Genio-720 using a Yocto Linux platform via the Offline Inference Path, as described in the Gen AI Workflow below.

To optimize performance, I want to utilize Offline Inference to directly access and operate the NPU via the Neuro runtime, not the ONNX runtime and TFLite interpreter.

However, I have not been able to find any documentation or sample code specific to Yocto Linux. Could you please point me to any guides or tutorials for deploying Gen AI workflows on Yocto Linux?

Thank you in advance for your help.

Hi Jim,

Thanks for reaching out!
Please note that Gen AI support on Yocto Linux is expected to be available in Q2 2026.

For more details, you can refer to our AI Supporting Scope.