Do you have a document on measuring npu performance?

re_roy · December 7, 2023, 4:06am

base Khadas Edge2
Is there any data comparing the difference in inference speed between other models (pt, tflite …etc) and models converted to rknn?

Electr1 · December 7, 2023, 5:14am

We have some tflite benchmark comparisons against CPU, GPU and RKNN for Edge2.

These are the examples compared.

CPU : Execution time: 0.29 ms.
GPU : Execution time: 3.10 ms.
NPU : Execution time: 0.64 ms.

CPU : Execution time: 0.51 ms.
GPU : Execution time: 1.46 ms.
NPU : Execution time: 0.67 ms.

CPU: Execution time: 44.03 ms.
GPU: Execution time: 22.36 ms.
NPU: Execution time: 5.10 ms.

CPU : Execution time: 60.53 ms.
GPU : Execution time: 21.30 ms.
NPU : Execution time: 2.920 ms.

CPU : Execution time: 206.60 ms
GPU: Execution time: 216.82 ms
NPU: Execution time: 106.84 ms

Regards.

re_roy · December 7, 2023, 7:28am

Thank you !!
You’ve been a great help.

re_roy · December 7, 2023, 8:02am

Did you apply quantization to the rknn model test?

Electr1 · December 7, 2023, 8:19am

@re_roy except the mobilenet model, none of the other models are quantized.