Ksnn convert parameters documentation

johndoe · December 15, 2021, 5:13am

The khadas doc mentions the meaning of certain parameters but the significance of the below mentioned arguments are still not clear

  --batch-size BATCH_SIZE
  --iterations ITERATIONS
  --device DEVICE
  --hybrid HYBRID

What do batch_size and iterations denote? And how can they be initialised?
What does device argument mean? Does it mention the device (CPU/GPU) where the conversion will run or the device in the khadas board where the inference will run
I’m getting the following error on using either of the two options

What does hybrid quantisation mean? Are we allowed to select the qtypes to be considered during conversion or does the script select the best performing ones (based on the board’s architecture)?

Frank · December 15, 2021, 5:43am

@johndoe These parameters are used when mixing, but now KSNN does not support mixing, so you can ignore these parameters. No matter which platform you use, you don’t need to consider these parameters for the time being.

johndoe · December 15, 2021, 6:12am

@Frank On a similar note, would you recommend using the C/C++ scripts (NPU SDK) to convert rather than relying on ksnn?

Frank · December 15, 2021, 6:26am

@johndoe If you do not plan to use python but use C/C++, then you should directly use the NPU SDK

johndoe · December 15, 2021, 6:37am

@Frank I’m benchmarking variety of networks on different quantisation techniques. Will NPU SDK allow more choices?

Frank · December 15, 2021, 7:11am

@johndoe No, the quantitative options for the two are the same. Unless you use mixed quantization, but I do not recommend using mixed quantization, the data will appear abnormal jitter