一、故障现象
2025 Apr 15 16:07:15 MDS %VSHD-3-VSHD_SYSLOG_EOL_ERR: EOL function acl_cli_show_region_info from library libacl.so exited due to Signal 11
2025 Apr 15 16:09:12 MDS %SYSMGR-2-LAST_CORE_BASIC_TRACE: : PID 27393 with message non-sysmgr(non-sysmgr) crashed, core will be saved .
二、排错思路
2.1 查看设备版本
“show version”
Software
BIOS: version 3.1.0
kickstart:version 8.2(1)
system:version 8.2(1)
kickstart image file is: bootflash:///m9700-sf3ek9-kickstart-mz-npe.8.2.1.bin
system image file is: bootflash:///m9700-sf3ek9-mz-npe.8.2.1.bin
2.2 查看cpu使用率
`show processes cpu history`
111
88890105423222222233348823311 11 1111111111111111 11111 111
100
90
80
70
60
50
40
30
20
10 ######## ##
0....5....1....1....2....2....3....3....4....4....5....5....
0 5 0 5 0 5 0 5 0 5
CPU% per second (last 60 seconds)
# = average CPU%
1 1 1 1 1
187067754594368534395534749378436834371443674440654586544646
100
90
80
70
60
50
40
30
20
10 ******** ** *** *** * * ** ** ** ** *** **** ** *
0....5....1....1....2....2....3....3....4....4....5....5....
0 5 0 5 0 5 0 5 0 5
CPU% per minute (last 60 minutes)
* = maximum CPU% # = average CPU%
111111111111111 1111111111111111111 111111 111111111111111111 1111111 11
403340313340000900110021100100111219010003901060132103121000190010001940
100
90
80
70
60
50
40
30
20 *
10 ************************************************************************
0....5....1....1....2....2....3....3....4....4....5....5....6....6....7.
0 5 0 5 0 5 0 5 0 5 0 5 0
CPU% per hour (last 72 hours)
* = maximum CPU% # = average CPU%
2.3 查看系统内部访问控制列表区域信息
`show system internal acl region-info`
Module 1:
Ingress TCAM:
---------------------------------------
Region Region Loc Loc
ID Name Start End
---------------------------------------
1 Top Sys Region 0 19664
2 Security Region 19665 29504
3 Zoning Region 29505 78640
4 Bottom Region 78641 98304
Egress TCAM:
---------------------------------------
Region Region Loc Loc
ID Name Start End
---------------------------------------
1 Top Sys Region 0 4073
2 Security Region 4074 5714
3 Zoning Region 5715 17179
4 Bottom Region 17180 21252
5 FCC DIS Region 21253 22899
6 FCC ENA Region 22900 24573
… …
Module 18:
Ingress TCAM:
---------------------------------------
Region Region Loc Loc
ID Name Start End
---------------------------------------
1 Top Sys Region 0 19664
2 Security Region 19665 29504
3 Zoning Region 29505 78640
4 Bottom Region 78641 98304
Egress TCAM:
---------------------------------------
Region Region Loc Loc
ID Name Start End
---------------------------------------
1 Top Sys Region 0 4073
2 Security Region 4074 5714
3 Zoning Region 5715 17179
4 Bottom Region 17180 21252
5 FCC DIS Region 21253 22899
6 FCC ENA Region 22900 24573
Internal error during command execution (11 8b)
三、结论
通过以上查询的信息得知,当前设备版本为8.2(1),CPU使用率在20%以下。结合故障log日志显示内容来看,符合当前在8.2(1)版本中上面的一个已知的bug,BUG相关链接如下:
https://bst.cloudapps.cisco.com/bugsearch/bug/CSCvh99074
该问题会在8.3(1)以上版本会修复,当前并无业务影响。
鉴于当前交换机版本,推荐升级版本至8.4(2f)。
https://www.cisco.com/c/en/us/td/docs/switches/datacenter/mds9000/sw/b_MDS_NX-OS_Recommended_Releases.html
四、建议
- 如果在生产环境中不方便升级,则忽略此报警;
2. 如有版本升级计划,建议更新至8.3(1)版本及以上,目前推荐的版本为8.4(2f);
3. 有原厂维保时建议通过收集show tech-support信息反馈至TAC,由厂家二线工程师排查后进行相应处置动作;
4. 以上故障处置仅供参考学习。