我在台式计算机 Intel i9 Core 16Gb RAM ASUS 主板中使用 Ubuntu 20.04。有时,当我运行 OBS Studio、Skype、Chrome 等应用程序时,我的计算机会突然重启。我不知道原因,也找不到可以帮助解决此问题的合适文章。我接下来解释我的尝试,试图找出我的硬件可能存在的问题。
输入后的结果last reboot
,显示我之前的 Ubuntu 运行在意外重启后显示为“仍在运行”:
reboot system boot 5.4.0-42-generic Wed Aug 26 11:00 still running
reboot system boot 5.4.0-42-generic Tue Aug 25 06:20 still running
reboot system boot 5.4.0-42-generic Mon Aug 24 06:38 - 00:06 (17:28)
reboot system boot 5.4.0-42-generic Sun Aug 23 18:52 - 23:36 (04:44)
reboot system boot 5.4.0-42-generic Sun Aug 23 06:32 - 23:36 (17:04)
reboot system boot 5.4.0-42-generic Thu Aug 20 09:42 - 18:17 (2+08:35)
reboot system boot 5.4.0-42-generic Mon Aug 17 21:55 - 22:22 (00:26)
reboot system boot 5.4.0-42-generic Mon Aug 17 09:22 - 21:55 (12:33)
reboot system boot 5.4.0-42-generic Mon Aug 17 09:00 - 21:55 (12:54)
reboot system boot 5.4.0-42-generic Mon Aug 17 08:55 - 21:55 (12:59)
reboot system boot 5.4.0-42-generic Mon Aug 17 05:56 - 07:37 (01:40)
reboot system boot 5.4.0-42-generic Mon Aug 17 05:34 - 07:37 (02:02)
reboot system boot 5.4.0-42-generic Sun Aug 16 21:09 - 00:07 (02:58)
reboot system boot 5.4.0-42-generic Sun Aug 16 20:52 - 21:09 (00:17)
reboot system boot 5.4.0-42-generic Sun Aug 16 20:38 - 20:51 (00:12)
reboot system boot 5.4.0-42-generic Sun Aug 16 20:14 - 20:38 (00:23)
reboot system boot 5.4.0-42-generic Sun Aug 16 20:05 - 20:38 (00:33)
reboot system boot 5.4.0-42-generic Sun Aug 16 19:31 - 20:38 (01:07)
reboot system boot 5.4.0-42-generic Sun Aug 16 18:39 - 19:30 (00:51)
reboot system boot 5.4.0-42-generic Sun Aug 16 18:27 - 18:38 (00:11)
reboot system boot 5.4.0-42-generic Sun Aug 16 18:22 - 18:27 (00:04)
reboot system boot 5.4.0-42-generic Sun Aug 16 18:18 - 18:27 (00:08)
reboot system boot 5.4.0-42-generic Sun Aug 16 18:16 - 18:27 (00:10)
reboot system boot 5.4.0-42-generic Sun Aug 16 18:11 - 18:27 (00:15)
reboot system boot 5.4.0-42-generic Sun Aug 16 16:42 - 18:11 (01:28)
reboot system boot 5.4.0-42-generic Sun Aug 16 16:30 - 16:42 (00:11)
reboot system boot 5.4.0-42-generic Sun Aug 16 16:22 - 16:30 (00:08)
reboot system boot 5.4.0-42-generic Sun Aug 16 16:13 - 16:22 (00:08)
reboot system boot 5.4.0-42-generic Sun Aug 16 15:50 - 16:13 (00:23)
reboot system boot 5.4.0-42-generic Sun Aug 16 15:46 - 16:13 (00:27)
reboot system boot 5.4.0-42-generic Sun Aug 16 14:01 - 15:42 (01:41)
reboot system boot 5.4.0-42-generic Sun Aug 16 13:50 - 14:00 (00:09)
电脑的硬件配置如下:
00:01.0 PCI bridge: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor PCIe Controller (x16) (rev 0d)
00:02.0 VGA compatible controller: Intel Corporation UHD Graphics 630 (Desktop 9 Series) (rev 02)
00:14.0 USB controller: Intel Corporation 200 Series/Z370 Chipset Family USB 3.0 xHCI Controller
00:16.0 Communication controller: Intel Corporation 200 Series PCH CSME HECI #1
00:17.0 SATA controller: Intel Corporation 200 Series PCH SATA controller [AHCI mode]
00:1c.0 PCI bridge: Intel Corporation 200 Series PCH PCI Express Root Port #5 (rev f0)
00:1c.7 PCI bridge: Intel Corporation 200 Series PCH PCI Express Root Port #8 (rev f0)
00:1d.0 PCI bridge: Intel Corporation 200 Series PCH PCI Express Root Port #11 (rev f0)
00:1f.0 ISA bridge: Intel Corporation Device a2ca
00:1f.2 Memory controller: Intel Corporation 200 Series/Z370 Chipset Family Power Management Controller
00:1f.3 Audio device: Intel Corporation 200 Series PCH HD Audio
00:1f.4 SMBus: Intel Corporation 200 Series/Z370 Chipset Family SMBus Controller
01:00.0 VGA compatible controller: NVIDIA Corporation GK208 [GeForce GT 710] (rev a1)
01:00.1 Audio device: NVIDIA Corporation GF119 HDMI Audio Controller (rev a1)
03:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 15)
当我第一次安装 Ubuntu 时,我尝试了几次让 Nvidia 驱动程序工作,但是任何官方的 nvidia 驱动程序都成功识别我的 nvidia 卡。因此,我目前正在运行 Noveau 驱动程序。
我使用该工具对我的 CPU 进行了压力测试stress-ng
并安装powertop
以检查我的硬件设备的功耗。我的电脑连接到不间断电源(600 Va),压力测试期间我的硬件的最大功耗为 104W。根据sensors
,我的 cpu 核心在压力测试期间的温度是:
coretemp-isa-0000
Adapter: ISA adapter
Package id 0: +92.0°C (high = +86.0°C, crit = +100.0°C)
Core 0: +91.0°C (high = +86.0°C, crit = +100.0°C)
Core 1: +87.0°C (high = +86.0°C, crit = +100.0°C)
Core 2: +92.0°C (high = +86.0°C, crit = +100.0°C)
Core 3: +91.0°C (high = +86.0°C, crit = +100.0°C)
Core 4: +92.0°C (high = +86.0°C, crit = +100.0°C)
Core 5: +91.0°C (high = +86.0°C, crit = +100.0°C)
Core 6: +89.0°C (high = +86.0°C, crit = +100.0°C)
Core 7: +89.0°C (high = +86.0°C, crit = +100.0°C)
acpitz-acpi-0
Adapter: ACPI interface
temp1: +27.8°C (crit = +119.0°C)
temp2: +29.8°C (crit = +119.0°C)
powertop
同一压力测试期间的输出:
System baseline power is estimated at 104 W
Power est. Usage Device name
85.4 W 1065% CPU core
9.68 W 1065% CPU misc
1.01 W 1065% DRAM
100,0% PCI Device: NVIDIA Corporation GK208 [GeForce GT 710]
100,0% USB device: xHCI Host Controller
100,0% USB device: USB Optical Mouse (Logitech)
100,0% USB device: USB Keyboard (USB)
100,0% PCI Device: Intel Corporation 200 Series/Z370 Chipset Family Power Management Controller
100,0% PCI Device: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller
100,0% PCI Device: Intel Corporation 200 Series PCH SATA controller [AHCI mode]
100,0% PCI Device: Intel Corporation 200 Series PCH PCI Express Root Port #5
100,0% PCI Device: Intel Corporation Device a2ca
100,0% PCI Device: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor PCIe Controller (x16)
100,0% PCI Device: Intel Corporation 200 Series PCH PCI Express Root Port #8
100,0% PCI Device: Intel Corporation 200 Series PCH HD Audio
100,0% PCI Device: Intel Corporation 8th Gen Core 8-core Desktop Processor Host Bridge/DRAM Registers [Coffee
100,0% PCI Device: Intel Corporation 200 Series PCH PCI Express Root Port #11
100,0% PCI Device: Intel Corporation UHD Graphics 630 (Desktop 9 Series)
100,0% PCI Device: Intel Corporation 200 Series/Z370 Chipset Family USB 3.0 xHCI Controller
100,0% Audio codec hwC0D0: Realtek
18,6 pkts/s Network interface: enp3s0 (r8169)
谁能给我提示一下我的电脑发生了什么?我很感激这些建议!
谢谢!
CPU 温度
该
stress-ng
工具显示所有 8 个 CPU 的 CPU 温度为 87.0°C 到 92.0°C(几乎 200°F)。这些临时工会毁坏你的机器。检查您的风扇是否正确接线、连接和运行。
检查您的 BIOS 以获取自定义风扇设置。
尽快把这些温度降下来!
超频
如果您的 CPU 或 RAM 超频,请将它们恢复为默认值。
BIOS
华硕 PRIME H310M-E R2.0/BR
您拥有 2020 年 5 月 21 日的 BIOS 版本 1402。
有更新的 BIOS 可用,版本 1605,日期为 2020 年 8 月 14 日,可在此处下载。
注意:验证我是否为您的主板提供了正确的网页。
注意:在更新 BIOS 之前做好备份。
英伟达
英伟达公司 GK208 [GeForce GT 710]
关于 Nvidia 的问题……目前的驱动是 450.66 版本,可以在这里下载。
确认在 BIOS 中禁用了安全启动。
清除所有当前的 Nvidia 驱动程序,然后安装新的驱动程序。
更新#1:
您从 Nvidia 驱动程序返回的消息表明 450.66 不支持您的视频卡,因此它们不适用于您的配置。您需要联系 Nvidia Support 以询问要使用的驱动程序。在那之前,选择 Nouveau 视频驱动程序,然后再次清除所有 Nvidia 的东西。
的输出
ps auxc | grep therm
是:我成功更新了BIOS版本并安装了Nvidia驱动450,但安装过程中电脑自行重启。
我的电脑空闲时的温度如下:
重新启动后,我看到 Nvidia 450 驱动程序已经安装,但是当我输入 时
nvidia-smi
,我收到消息:NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
Ps.:这台电脑很新颖……我是两周前买的。