我在运行 OpenBSD 4.4 的 Supermicro 1025TC-TB 硬件上构建了四个相同的节点。其中两个节点是路由器,两个是负载平衡器。每组都在主动/被动配置中运行,CARP 提供故障转移。
活动路由器节点以大约每小时两个的速度记录这些错误:5 月 5 日 04:09:30 gw /bsd: ichiic0: abort failed, status 0x0
偶尔会弹出这个:May 5 03:59:41 gw /bsd: ichiic0: abort failed, status 0x40
我还没有找到记录这些的模式。ichiic 是英特尔 SMBus 驱动程序。此主板还具有 I/OAT 功能,但 OpenBSD 不支持这些功能。我正在运行最新的 BIOS。
我怀疑这些是在特别高的中断峰值期间记录的,但是 sysstat 足够慢,我无法直接关联它。我也无法手动复制此问题。负载平衡器不受影响。
有任何想法吗?
消息:
OpenBSD 4.4 (GENERIC) #1021: Tue Aug 12 17:16:55 MDT 2008
[email protected]:/usr/src/sys/arch/i386/compile/GENERIC
cpu0: Intel(R) Xeon(R) CPU E5405 @ 2.00GHz ("GenuineIntel" 686-class) 2.01 GHz
cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,SSE3,MWAIT,DS-CPL,VMX,TM2,CX16,xTPR
real mem = 3756871680 (3582MB)
avail mem = 3648520192 (3479MB)
mainbus0 at root
bios0 at mainbus0: AT/286+ BIOS, date 08/20/08, BIOS32 rev. 0 @ 0xfdb90, SMBIOS rev. 2.5 @ 0xdff5f000 (39 entries)
bios0: vendor Phoenix Technologies LTD version "1.1" date 08/20/2008
bios0: Supermicro X7DCT
acpi0 at bios0: rev 2
acpi0: tables DSDT FACP APIC MCFG BOOT SPCR ERST HEST BERT EINJ SLIC SSDT SSDT SSDT SSDT SSDT SSDT SSDT SSDT SSDT
acpi0: wakeup devices P0P2(S5) P0P4(S5) USB4(S1) USB5(S1) USB7(S1) ESB2(S1) EXP1(S5) EXP5(S5) EXP6(S5) USB1(S1) USB2(S1) USB3(S1) USB6(S1) ESB1(S1) PCIB(S5) COM1(S5) COM2(S5)
acpitimer0 at acpi0: 3579545 Hz, 24 bits
acpiprt0 at acpi0: bus 1 (P0P2)
acpiprt1 at acpi0: bus 2 (P0P4)
acpiprt2 at acpi0: bus 0 (PCI0)
acpiprt3 at acpi0: bus 3 (EXP1)
acpiprt4 at acpi0: bus 4 (EXP5)
acpiprt5 at acpi0: bus 5 (EXP6)
acpiprt6 at acpi0: bus 6 (PCIB)
acpicpu0 at acpi0
acpibtn0 at acpi0: PWRB
bios0: ROM list: 0xc0000/0xb000
ipmi at mainbus0 not configured
cpu0 at mainbus0
pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
pchb0 at pci0 dev 0 function 0 "Intel 5100 Host" rev 0x90
ppb0 at pci0 dev 2 function 0 "Intel 5100 PCIE" rev 0x90
pci1 at ppb0 bus 1
ppb1 at pci0 dev 4 function 0 "Intel 5100 PCIE" rev 0x90
pci2 at ppb1 bus 2
"Intel I/OAT SCNB" rev 0x90 at pci0 dev 8 function 0 not configured
pchb1 at pci0 dev 16 function 0 "Intel 5100 FSB" rev 0x90
pchb2 at pci0 dev 16 function 1 "Intel 5100 FSB" rev 0x90
pchb3 at pci0 dev 16 function 2 "Intel 5100 FSB" rev 0x90
pchb4 at pci0 dev 17 function 0 "Intel 5100 Reserved" rev 0x90
pchb5 at pci0 dev 19 function 0 "Intel 5100 Reserved" rev 0x90
pchb6 at pci0 dev 21 function 0 "Intel 5100 DDR" rev 0x90
pchb7 at pci0 dev 22 function 0 "Intel 5100 DDR" rev 0x90
uhci0 at pci0 dev 26 function 0 "Intel 82801I USB" rev 0x02: irq 5
ehci0 at pci0 dev 26 function 7 "Intel 82801I USB" rev 0x02: irq 11
usb0 at ehci0: USB revision 2.0
uhub0 at usb0 "Intel EHCI root hub" rev 2.00/1.00 addr 1
ppb2 at pci0 dev 28 function 0 "Intel 82801I PCIE" rev 0x02: irq 10
pci3 at ppb2 bus 3
ppb3 at pci0 dev 28 function 4 "Intel 82801I PCIE" rev 0x02: irq 10
pci4 at ppb3 bus 4
em0 at pci4 dev 0 function 0 "Intel PRO/1000MT (82573E)" rev 0x03: irq 10, address 00:30:48:d4:62:ea
ppb4 at pci0 dev 28 function 5 "Intel 82801I PCIE" rev 0x02: irq 10
pci5 at ppb4 bus 5
em1 at pci5 dev 0 function 0 "Intel PRO/1000MT (82573L)" rev 0x00: irq 10, address 00:30:48:d4:62:eb
uhci1 at pci0 dev 29 function 0 "Intel 82801I USB" rev 0x02: irq 7
uhci2 at pci0 dev 29 function 1 "Intel 82801I USB" rev 0x02: irq 11
uhci3 at pci0 dev 29 function 2 "Intel 82801I USB" rev 0x02: irq 10
ehci1 at pci0 dev 29 function 7 "Intel 82801I USB" rev 0x02: irq 7
usb1 at ehci1: USB revision 2.0
uhub1 at usb1 "Intel EHCI root hub" rev 2.00/1.00 addr 1
ppb5 at pci0 dev 30 function 0 "Intel 82801BA Hub-to-PCI" rev 0x92
pci6 at ppb5 bus 6
vga1 at pci6 dev 1 function 0 "ATI ES1000" rev 0x02
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
drm at vga1 unsupported
ichpcib0 at pci0 dev 31 function 0 "Intel 82801IR LPC" rev 0x02: PM disabled
pciide0 at pci0 dev 31 function 2 "Intel 82801I SATA" rev 0x02: DMA, channel 0 configured to native-PCI, channel 1 configured to native-PCI
pciide0: using irq 10 for native-PCI interrupt
wd0 at pciide0 channel 0 drive 0: <WDC WD1600BEKT-00F3T0>
wd0: 16-sector PIO, LBA48, 152627MB, 312581808 sectors
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 5
ichiic0 at pci0 dev 31 function 3 "Intel 82801I SMBus" rev 0x02: irq 10
iic0 at ichiic0
lm1 at iic0 addr 0x2d: W83627HF
wbng0 at iic0 addr 0x2f: w83793g
pciide1 at pci0 dev 31 function 5 "Intel 82801I SATA" rev 0x02: DMA, channel 0 wired to native-PCI, channel 1 wired to native-PCI
pciide1: using irq 11 for native-PCI interrupt
usb2 at uhci0: USB revision 1.0
uhub2 at usb2 "Intel UHCI root hub" rev 1.00/1.00 addr 1
usb3 at uhci1: USB revision 1.0
uhub3 at usb3 "Intel UHCI root hub" rev 1.00/1.00 addr 1
usb4 at uhci2: USB revision 1.0
uhub4 at usb4 "Intel UHCI root hub" rev 1.00/1.00 addr 1
usb5 at uhci3: USB revision 1.0
uhub5 at usb5 "Intel UHCI root hub" rev 1.00/1.00 addr 1
isa0 at ichpcib0
isadma0 at isa0
com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
com1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo
pckbc0 at isa0 port 0x60/5
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard, using wsdisplay0
pcppi0 at isa0 port 0x61
midi0 at pcppi0: <PC speaker>
spkr0 at pcppi0
wbsio0 at isa0 port 0x2e/2: W83627HF rev 0x41
wbsio0 port 0x290/2 not configured
npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16
biomask fde5 netmask fde5 ttymask ffff
mtrr: Pentium Pro MTRR support
softraid0 at root
root on wd0a swap on wd0b dump on wd0b
您正在运行最新的 BIOS 吗?那将是我要检查的第一件事。
就个人而言,我从未见过这个错误(我运行的是 OBSD 4.4 开发箱)。我为您进行了一些研究,我可以告诉您不止一个人遇到过同样的问题(甚至在 ODBS 4.4 之前。我相信您也遇到过他们的帖子),但没有确定的答案。有些人将其归因于超线程,有些人将其归因于主板故障。
我会将其保留在我的待发现/学习列表中。抱歉,我无法提供进一步的帮助。
该错误出现在ichiic.c
ichiic_i2c_exec()
的函数中,表示等待传输完成的超时。如果它非常罕见,那么我不会担心它。但我会在日志中密切关注它(日志检查器?)。如果它开始更频繁地发生,我会怀疑硬件。您可以尝试升级到快照并查看情况是否仍然存在?如果它不能解决问题,则值得将其报告给开发人员。
顺便说一句,您使用的版本中有一个令人讨厌的DoS 漏洞。