PPOCR detection model conversion error

GHdevlog · May 11, 2025, 7:16am

Which system do you use? Android, Ubuntu, OOWOW or others?

Ubuntu

Which version of system do you use? Please provide the version of the system here:

Ubuntu 22.04

Please describe your issue below:

I am converting a PPOCR detection model.
I’m trying to convert for use in C code based on a sample that converts to Python.

When I detected with the model here, the results were correct.

github.com/PaddlePaddle/PaddleOCR

docs/ppocr/model_list.md

main

---
comments: true
---

# PP-OCR系列模型列表（V4，2023年8月1日更新）

> **说明**
>
> 1. V4版模型相比V3版模型，在模型精度上有进一步提升
>
> 2. V3版模型相比V2版模型，在模型精度上有进一步提升
>
> 3. 2.0+版模型和[1.1版模型](https://github.com/PaddlePaddle/PaddleOCR/blob/develop/doc/doc_ch/models_list.md) 的主要区别在于动态图训练vs.静态图训练，模型性能上无明显差距。
>
> 4. 本文档提供的是PPOCR自研模型列表，更多基于公开数据集的算法介绍与预训练模型可以参考：[算法概览文档](../algorithm/overview.md)。

PaddleOCR提供的可下载模型包括`推理模型`、`训练模型`、`预训练模型`、`nb模型`，模型区别说明如下：

| 模型类型  | 模型格式    | 简介  |
| ---- | ----- | ----|

This file has been truncated. show original

And I downloaded the multilingual detection model v3 and found that it works fine on Windows.

However, when I used the ONNX to ADLA conversion like the other models, I got a feature map with a mean that converges to zero.

How to fix the problem?

Post a console log of your issue below:

Louis-Cheng-Liu · May 12, 2025, 1:11am

Hello @GHdevlog ,

Could you provide your model?

GHdevlog · May 13, 2025, 3:58am

paddle to onnx

onnx to adla

I’m use VIM4 in below system

Linux Khadas 5.15.137 #1.7.3 SMP PREEMPT Fri Nov 29 09:53:55 UTC 2024 aarch64 aarch64 aarch64 GNU/Linux

GHdevlog · May 14, 2025, 6:12am

Multilingual_PP-OCRv3_det_infer.zip (2.1 MB)

Louis-Cheng-Liu · May 14, 2025, 6:45am

Hello @GHdevlog ,

I have received your model. The problem may occur in our convert tool. Now our engineer is looking for issue.

GHdevlog · May 18, 2025, 11:42pm

@Louis-Cheng-Liu

Have you made any progress in resolving the issue so far?

Is there anything else I can check in the conversion or execution process?

Louis-Cheng-Liu · May 19, 2025, 2:22am

Hello @GHdevlog ,

Sorry for late. The problem is adla model has lost too much precision. Add a parameter to qunatify model per channel.

--model-name mul_ppocr_det 
--model-type onnx 
--model ./mul_ppocr_det.onnx 
--inputs "x" 
--input-shapes  "3,736,736" 
--dtypes "float32" 
--quantize-dtype int8 
--outdir onnx_output 
--channel-mean-value "123.675,116.28,103.53,57.375" 
--source-file ocr_det_dataset.txt 
--iterations 500 
--batch-size 1 
--kboard VIM4 
--inference-input-type "float32" 
--inference-output-type "float32" 
--inference-output-type "float32" 
--disable-per-channel False

GHdevlog · May 19, 2025, 4:26am

Thank you for your help.

I was wondering how the option --disable-per-channel False specifically affects the conversion?

Can you explain how this option changes the model so dramatically?

Louis-Cheng-Liu · May 19, 2025, 6:27am

Hello @GHdevlog ,

Each layer has many filters. Per layer quantification will quantize the all filters. So they share scaling factors and zero points. Per channel quantificaion will quantize each filter individually. Each filter has its own scaling factors and zero points. So the model by per channel has higher accuracy.