我的 FreeNAS 服务器 [build 9.10.2-U1 (86c7ef5)] 的 ZFS 卷上有一个降级磁盘,在尝试更换它之前,我重新启动了服务器。
以下是什么意思,我对那个磁盘有问题吗?
- 在启动时,即使所有磁盘都以卷状态重新联机,我也会得到以下信息:
- 在清理操作期间,一个新警报显示磁盘处于降级状态,校验和为 670(不确定这意味着什么):
- 擦洗结果:
The scrub operation is now finished. Here are the final results: state: DEGRADED status: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected. action: Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'. see: http://illumos.org/msg/ZFS-8000-9P scan: scrub repaired 66.7M in 16h55m with 0 errors on Sat Jan 2 13:32:13 2021 config: NAME STATE READ WRITE CKSUM storage DEGRADED 0 0 0 raidz1-0 DEGRADED 0 0 0 gptid/e0ef3f08-70b6-11e6-b8eb-1c98ec0f2cd4 ONLINE 0 0 0 gptid/e1b21671-70b6-11e6-b8eb-1c98ec0f2cd4 DEGRADED 0 0 1.29K too many errors gptid/e2841c02-70b6-11e6-b8eb-1c98ec0f2cd4 ONLINE 0 0 0 gptid/e3717f0c-70b6-11e6-b8eb-1c98ec0f2cd4 ONLINE 0 0 0 errors: No known data errors
smartctl -a
:SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed: read failure 90% 39365 172825824 # 2 Extended offline Completed: read failure 90% 39365 172825825 # 3 Short offline Completed without error 00% 39364 - SMART Selective self-test log data structure revision number 1 SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Not_testing 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
正如输出
smartctl -a
所示,驱动器报告其板载测试中的读取错误。这消除了您的 RAID 控制器或软件问题。这真是太糟了。采购一个新驱动器并尽快更换它。该错误可能是暂时的,因为它似乎发生在磁盘上的同一物理位置周围 - FreeNAS/zfs 可能没有再次访问该确切位置,直到您发出清理并告诉它检查整个卷,这就是为什么驱动器在下次启动时重新联机。