6.1. 样例说明
ModelZoo提供的网络模型样例所展示的功能如下表所示:
注意
AArch64 架构不支持模型量化和编译操作。用户需要使用编译后二进制模型文件(.hmm或.hmms)在AArch64架环境中推理。
网络名称 |
PTQ量化 |
模型编译 |
模型推理 (C++) |
模型推理 (Python) |
精度评测 |
性能评测 |
模型转换与评估工具 |
|---|---|---|---|---|---|---|---|
ResNet50 |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
MobileNetV2 |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
EfficientNet |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
YOLOv3 |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
YOLOP |
✓ |
✓ |
X |
✓ |
X |
✓ |
✓ |
YOLOv5s |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
YOLOv8m |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
YOLO12m |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
YOLOv8m-pose |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
DeepSeek |
✓ |
✓ |
X |
✓ |
X |
✓ |
X |
Qwen3 |
✓ |
✓ |
X |
✓ |
X |
✓ |
X |
LPRNet |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
YOLOv8m_seg |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
ViT-B-16 |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
YOLOv5s_feature |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
Gemma-4-26B-A4B |
✓ |
✓ |
X |
✓ |
X |
✓ |
X |
Qwen2.5-vl |
✓ |
✓ |
X |
✓ |
X |
✓ |
X |
PP-OCRv3 |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
BGE |
✓ |
✓ |
X |
✓ |
X |
X |
X |
GTE |
✓ |
✓ |
X |
✓ |
X |
X |
X |
Whisper-Medium |
✓ |
✓ |
✓ |
✓ |
X |
X |
X |
yolov8m-cls |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
yolov5m_face |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
yolov7 |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
yolov9m |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
yolo11m |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
yolov10m |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
yolox |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
yolo26m |
✓ |
✓ |
X |
✓ |
✓ |
✓ |
✓ |
minicpmo |
✓ |
✓ |
X |
✓ |
X |
X |
X |
qwen3-vl |
✓ |
✓ |
✓ |
✓ |
X |
✓ |
X |
qwen3-30b-a3b |
✓ |
✓ |
X |
✓ |
X |
✓ |
X |
gpt-oss |
X |
✓ |
X |
✓ |
X |
✓ |
X |
sensevoice |
✓ |
✓ |
X |
✓ |
X |
X |
X |
glm-asr |
✓ |
✓ |
X |
✓ |
X |
✓ |
X |
qwen3-asr |
✓ |
✓ |
X |
✓ |
X |
✓ |
X |
whisper-turbo |
✓ |
✓ |
X |
✓ |
X |
X |
X |
qwen3-embedding |
✓ |
✓ |
X |
✓ |
X |
X |
X |
qwen2.5 |
✓ |
✓ |
X |
✓ |
X |
✓ |
X |
qwen3.5 |
✓ |
✓ |
X |
✓ |
X |
✓ |
X |
qwen3.6 |
✓ |
✓ |
X |
✓ |
X |
✓ |
X |
CoPaw-Flash |
✓ |
✓ |
X |
✓ |
X |
✓ |
X |
glm-ocr |
✓ |
✓ |
X |
✓ |
X |
X |
X |
cosyvoice3 |
✓ |
✓ |
✓ |
✓ |
X |
✓ |
X |
qwen3-reranker |
✓ |
✓ |
X |
✓ |
X |
X |
X |
ModelZoo只提供上表中的网络模型样例,其他网络模型,暂不提供样例。ResNet50、MobileNetV2、YOLOv5s、YOLOv8m和YOLOP等模型示例通过模型转换与评估工具展示PTQ量化、模型编译、模型推理、精度评测以及性能评测。
6.2. 环境准备
执行下面步骤完成运行环境部署,所有样例默认适配最新版本软件平台:
在Docker镜像中下载应用开发示例包。
登录后摩开发者社区。
在 请先选择板级类别 下拉列表中选择使用的后摩板级产品。
在版本列表中选择下载的版本号,再在 AI模型类别筛选器 、平台架构筛选器 、操作系统筛选器 下拉菜单中分别选择AI模型类型、平台架构和操作系统,找到资源名为示例代码的下载资源,选中该资源左边复选框。
点击 直接下载、wget链接、批量直接下载 或 wget批量下载 按钮。
ModelZoo模型库位于
houmo-examples-xh2/models目录下。检查
houmo-examples-xh2/env.sh中环境变量设置:根据实际情况修改环境变量的值,例如如果更换数据集路径,则修改 HOUMO_DATASETS_PATH 变量。
初始时脚本会自动检测当前是否有可用的后摩设备。如果有,则会自动设置使用该后摩设备评测模型,否则会设置使用模拟器评测模型。用户可通过 HDPL_PLATFORM环境变量修改评测平台。
环境变量详情参看 ModelZoo环境变量列表。
在
houmo-examples-xh2目录下,执行下面指令配置运行环境:source env.sh
如果在AArch64 架构:不支持模型量化和编译操作,用户可以直接使用提供的已编译模型进行推理。
6.2.1. 环境变量
环境变量名称 |
描述 |
默认值 |
|---|---|---|
HOUMO_TARGET |
ModelZoo中模型编译和推理使用的后摩设备。 |
|
HOUMO_DATASETS_PATH |
ModelZoo数据库存放目录。 |
|
HOUMO_MODELZOO_URL |
ModelZoo中模型下载根目录。 |
|
6.3. 模型获取
ModelZoo提供原始模型和量化模型。用户可根据需求对原始模型量化,或直接使用已量化模型做模型推理。
ModelZoo提供的各网络模型的类型及原始模型如下表所示:
网络名称 |
原始模型 |
量化后模型 |
预编译模型 |
原始模型来源 |
|---|---|---|---|---|
ResNet50 |
✓ |
X |
✓ |
|
MobileNetV2 |
✓ |
X |
✓ |
|
EfficientNet |
✓ |
X |
✓ |
|
YOLOP |
✓ |
X |
✓ |
|
YOLOv3 |
✓ |
X |
✓ |
|
YOLOv5s |
✓ |
X |
✓ |
|
YOLOv8m |
✓ |
X |
✓ |
|
YOLO12m |
✓ |
X |
✓ |
|
YOLOv8m-pose |
✓ |
X |
✓ |
|
DeepSeek |
✓ |
X |
✓ |
|
Qwen3-8B |
✓ |
X |
✓ |
|
Qwen3-14B |
✓ |
X |
✓ |
|
Qwen3-0.6B |
✓ |
X |
✓ |
|
Qwen3-1.7B |
✓ |
X |
✓ |
|
LPRNet |
✓ |
X |
✓ |
- |
ViT-B-16 |
✓ |
X |
✓ |
- |
YOLOv8m_seg |
✓ |
X |
✓ |
|
YOLOv5s_feature |
✓ |
X |
✓ |
- |
Gemma-4-26B-A4B |
✓ |
X |
✓ |
|
Qwen2.5-vl |
✓ |
X |
✓ |
|
Qwen3-vl |
✓ |
X |
✓ |
|
GTE |
✓ |
X |
✓ |
- |
Whisper-Medium |
✓ |
X |
✓ |
- |
YOLOv8m-cls |
✓ |
X |
✓ |
|
YOLOv5m_face |
✓ |
X |
✓ |
|
PP-OCRv3 |
✓ |
X |
✓ |
|
BGE |
✓ |
X |
✓ |
|
YOLOv7 |
✓ |
X |
✓ |
|
YOLOv9m |
✓ |
X |
✓ |
|
YOLOv10m |
✓ |
X |
✓ |
|
YOLO11m |
✓ |
X |
✓ |
|
YOLOX |
✓ |
X |
✓ |
|
YOLO26m |
✓ |
X |
✓ |
|
qwen3-30b-a3b |
✓ |
X |
✓ |
|
minicpmo |
✓ |
X |
✓ |
|
gpt-oss |
✓ |
X |
✓ |
- |
sensevoice |
✓ |
X |
✓ |
|
glm-asr |
✓ |
X |
✓ |
|
qwen3-asr |
✓ |
X |
✓ |
|
whisper-turbo |
✓ |
X |
✓ |
|
qwen3-embedding |
✓ |
X |
✓ |
|
qwen2.5 |
✓ |
X |
✓ |
|
qwen3.5-0.8B |
✓ |
X |
✓ |
|
qwen3.5-2B |
✓ |
X |
✓ |
|
qwen3.5-4B |
✓ |
X |
✓ |
|
qwen3.5-9B |
✓ |
X |
✓ |
|
qwen3.5-27B |
✓ |
X |
✓ |
|
qwen3.5-35B-A3B |
✓ |
X |
✓ |
|
Qwen3.6-35B-A3B |
✓ |
X |
✓ |
|
CoPaw-Flash |
✓ |
X |
✓ |
|
glm-ocr |
✓ |
X |
✓ |
|
cosyvoice3 |
✓ |
X |
✓ |
- |
qwen3-reranker |
✓ |
X |
✓ |
环境准备后,在 MODELZOO_PATH/models/network_type/network_name 目录下,运行下面指令下载模型。network_type 为网络模型类型,如 diffusion、llm。network_name 为网络名称,如 qwen3。
python3 get_model.py --type [options] --model_size <size> --source_type modelscope
其中参数取值如下:
options:raw:原始网络模型。hmm:后摩TCIM编译后生成的二进制模型文件(.hmm或.hmms)。
--model_size <size>:(可选)仅部分模型适用,用于指定待下载模型的参数规模。<size>的取值可为2b、4b、9b、27b等,不同模型支持的取值不同,具体以对应示例目录中的README.md说明为准。--source_type modelscope:(可选)从魔塔社区下载模型。当前仅支持下载后摩TCIM编译后生成的二进制模型文件。可参看示例目录中的README.md文件,了解示例是否支持该功能。
模型下载后默认存放在当前目录下。
6.4. 数据集准备
ModelZoo提供少量数据供简单验证,如果需要测试真实精度,需要下载完整数据集。
通过 HOUMO_DATASETS_PATH 环境变量设置数据集实际路径,默认为 MODELZOO_PATH/data/datasets。
网络名称 |
数据集名称 |
路径 |
|---|---|---|
ResNet50 |
ImageNet 2012 |
|
MobileNetV2 |
||
EfficientNet |
||
YOLOv8m-cls |
||
YOLOv3 |
COCO 2017 |
|
YOLOv5s |
||
YOLOv8m |
||
YOLO12m |
||
YOLOv8m-pose |
||
YOLOv5s_feature |
||
YOLOv8m_seg |
||
YOLOv7 |
||
YOLOv9m |
||
YOLOv10m |
||
YOLO11m |
||
YOLOX |
||
YOLO26m |
||
YOLOv5m_face |
WIDER FACE |
|
YOLOP |
BDD100K |
|
DeepSeek |
wikitext-2-raw-v1 |
|
Qwen3 |
||
Qwen2.5 |
||
Qwen3.5 |
||
Qwen3.6 |
||
gpt-oss |
||
qwen3-embedding |
||
qwen3-30b-a3b |
||
Gemma-4-26B-A4B |
- |
- |
Qwen2.5-vl |
- |
- |
Qwen3-vl |
- |
- |
ViT-B-16 |
ILSVRC2012 |
|
LPRNet |
CCPD2019Sub |
|
PP-OCRv |
CCPD2020_PPOCR v3_eval |
|
BGE |
- |
- |
GTE |
- |
- |
Whisper-Medium |
- |
- |
minicpmo |
- |
- |
sensevoice |
- |
- |
glm-asr |
- |
- |
qwen3-asr |
- |
- |
whisper-turbo |
- |
- |
glm-ocr |
- |
- |
cosyvoice3 |
- |
- |
CoPaw-Flash |
- |
- |
qwen3-reranker |
T2Reranking |