尧图网站建设 尧图网络
  • 首页
  • 关于我们
  • 服务项目
  • 案例展示
  • 建站流程
  • 资讯中心
  • 联系我们
首页/资讯中心/详情

CANN/mat-chem-sim-pred IPDT批量滚动评分基准测试

CANN/mat-chem-sim-pred IPDT批量滚动评分基准测试
📅 发布时间:2026/7/4 8:06:53

PidIpdtBatchRolloutScore Benchmark Report

【免费下载链接】mat-chem-sim-pred面向工业领域,聚焦计算仿真、预测两大核心场景,构建面向流程工业"机理+数据"双轮驱动的领域计算层,推动AI for Science在材料化学领域的深度应用。项目地址: https://gitcode.com/cann/mat-chem-sim-pred

This document records the measured CPU/NPU behavior ofPidIpdtBatchRolloutScore.

Environment

  • NPU host:node202
  • Device:Ascend910B3, device id0
  • CANN:/usr/local/Ascend/ascend-toolkit/latest
  • CPU baseline: benchmark program multi-thread mode
  • Build:-DCMAKE_BUILD_TYPE=Release -DSOC_VERSION=Ascend910B3 -DRUN_MODE=npu

Method

Thebenchmark_pid_ipdt_batch_rollout_score_aclnnprogram builds an in-process multi-thread CPU reference (ComputeRange, the same integrator recurrencey[k+1] = y[k] + b*u[k-delay]), runs the NPU operator on the same inputs and reportsmax_abs_err,max_quality_rel_errandbest_idx_diff_count. The pass conditions arenpu_zero_score_count == 0, per-candidate scores matching the CPU reference to float32 precision, and anybest_idxdifferences being near-ties (the chosen candidate's metric rel-err stays small), matching the behavior of the verified FOPDT operator.

Correctness

The IPDT kernel differs from the verified FOPDT kernel only in the state recurrence (thea*ydecay term is dropped). The candidate-axis SIMD width does not change the numerics (each tile is independent), so the accuracy profile matches FOPDT: NPU output equals the CPU reference within float32 rounding.

Measured onnode202 / Ascend910B3, B=128, sim_steps=1024, candidate_tile=C,npu_zero_score_count=0:

candidatesmax_abs_errmax_quality_rel_errbest_idx_diff_count
10242.4e-41.5e-60
40961.01.69e-31
163841.5e-33.3e-51

Themax_abs_err=1at 4096 is the discrete settling-time metric crossing the settle band one sample later on NPU than on CPU for a single near-tie loop (dt=1-> abs diff 1); the corresponding metric rel-err stays< 2e-3. The reference FOPDT operator shows the same behavior at this candidate count (max_abs_err=1, max_quality_rel_err=4.5e-3, best_idx_diff_count=1), so IPDT is within the accepted baseline.

Measured timing

node202 / Ascend910B3, B=128, sim_steps=1024, candidate_tile=C, CPU = 64-thread parallel reference.

candidatesCPU parallel msNPU kernel msNPU kernel vs CPU
102432.57.454.36x
4096122.124.74.95x
16384426.693.84.55x

Against a 192-thread CPU reference the speedup is 3.8-4.0x (the wider CPU pool narrows the gap).

Notes

The kernel reuses the FOPDT wide-lane (kLane=768) and fused inner-loop optimizations unchanged; the only algorithmic difference is the integrator recurrence, which removes one vector multiply per timestep.

【免费下载链接】mat-chem-sim-pred面向工业领域,聚焦计算仿真、预测两大核心场景,构建面向流程工业"机理+数据"双轮驱动的领域计算层,推动AI for Science在材料化学领域的深度应用。项目地址: https://gitcode.com/cann/mat-chem-sim-pred

创作声明:本文部分内容由AI辅助生成(AIGC),仅供参考

相关新闻

  • GB28181视频平台性能瓶颈深度解构:WVP-Pro高并发架构演进与优化策略
  • LunarBar macOS农历插件完整攻略:传统节日的数字守护者
  • 如何快速使用BIThesis:北京理工大学论文写作的终极解决方案

最新新闻

  • 告别歌词烦恼:163MusicLyrics一站式音乐歌词批量获取工具
  • 直流无刷电机双闭环自抗扰控制方案详解
  • 如何快速使用LTX2.3-ICEdit-Insight:3步搞定AI视频修复与增强
  • 从浏览器到硬盘:猫抓如何重新定义你的网络视频体验
  • HsMod:炉石传说终极开源增强插件完全指南
  • OpenCV 4.8.0 形态学操作实战:3种结构元素与5种算子组合效果对比

日新闻

  • STM32F745VG与MC6470 IMU的高性能姿态控制系统设计
  • 机器不消费,人何以生存
  • AI项目操作手册编写规范与最佳实践

周新闻

  • Windows字体自定义终极方案:No!! MeiryoUI完全指南
  • Deepin Boot Maker:告别命令行,3分钟制作Linux启动盘的智能解决方案
  • Plain Craft Launcher 2:重新定义你的Minecraft游戏体验

月新闻

  • 2026年6月公司网站搭建最新热门渠道测评:四大低成本/零代码平台对比+避坑
  • 【Linux】Linux arm 编译QT程序,出现expected “}“报错
  • 【MATLAB例程】四基站二维AOA定位与距离辅助增强对比仿真。基于角度观测和测距修正的固定目标平面定位精度分析

关于尧图

  • 公司简介
  • 团队介绍
  • 企业文化
  • 荣誉资质

服务项目

  • 定制开发
  • 电商建站
  • UI 设计
  • 运维服务

快速链接

  • 案例展示
  • 建站流程
  • 常见问题
  • 资讯中心

联系方式

  • 📍北京市朝阳区互联网产业园 A 座 10 层
  • 📞400-888-8888
  • ✉️contact@rkmt.cn
  • 🕐周一至周日 9:00-21:00

© 2024 北京尧图网络科技有限公司 版权所有 | 京 ICP 备 XXXXXXXX 号