这是第 5 次。它每周发生一次(周二或周三 03:00-07:00 UTC+0)。在控制台上,它显示可用但无法访问。我们尝试等待实例是否会自行恢复,大约 30 分钟后没有任何反应。所以我手动重新启动它,然后在重新启动后(约 5 分钟)再次上线。
了解实际出了什么问题会很有帮助。这只是一个用户和记录很少的开发服务器。
Engine: Aurora MySQL 5.7.12
DB instance class: db.t2.small
Backup time: 16:00-16:30 UTC+0
Maintenance time: sun:17:00-sun:17:30 UTC+0
以下是重启实例后可用日志的唯一列表。
error/mysql-error-running.log.2018-07-24.03 Tue Jul 24 11:14:06 GMT+800 2018 11.8 kB
error/mysql-error-running.log.2018-07-24.04 Tue Jul 24 11:30:00 GMT+800 2018 285.5 kB
error/mysql-error-running.log.2018-07-24.05 Tue Jul 24 12:30:00 GMT+800 2018 31.1 kB
error/mysql-error-running.log.2018-07-24.06 Tue Jul 24 13:30:00 GMT+800 2018 31.8 kB
error/mysql-error-running.log.2018-07-24.07 Tue Jul 24 14:30:00 GMT+800 2018 32.9 kB
error/mysql-error-running.log.2018-07-24.08 Tue Jul 24 15:30:00 GMT+800 2018 29 kB
error/mysql-error-running.log.2018-07-24.09 Tue Jul 24 16:30:00 GMT+800 2018 32.1 kB
error/mysql-error-running.log.2018-07-24.10 Tue Jul 24 17:30:00 GMT+800 2018 27.5 kB
error/mysql-error-running.log.2018-07-24.11 Tue Jul 24 18:30:00 GMT+800 2018 31.7 kB
error/mysql-error-running.log.2018-07-24.12 Tue Jul 24 19:30:00 GMT+800 2018 27.1 kB
error/mysql-error-running.log.2018-07-24.13 Tue Jul 24 20:30:00 GMT+800 2018 22.4 kB
error/mysql-error-running.log.2018-07-24.14 Tue Jul 24 21:30:00 GMT+800 2018 22.8 kB
error/mysql-error-running.log.2018-07-24.15 Tue Jul 24 22:30:00 GMT+800 2018 24.7 kB
error/mysql-error-running.log.2018-07-24.16 Tue Jul 24 23:30:00 GMT+800 2018 24.7 kB
error/mysql-error.log Wed Jul 25 00:34:45 GMT+800 2018 2.6 kB
external/mysql-external.log Wed Jul 25 00:30:00 GMT+800 2018 7.6 kB
外部/mysql-external.log
/rdsdbbin/oscar/bin/mysqld, Version: 5.7.12 (MySQL Community Server (GPL)). started with:
Tcp port: 3306 Unix socket: /tmp/mysql.sock
Time,ServerHost,User,UserHost,Command,Payload
/rdsdbbin/oscar/bin/mysqld, Version: 5.7.12 (MySQL Community Server (GPL)). started with:
Tcp port: 3306 Unix socket: /tmp/mysql.sock
Time,ServerHost,User,UserHost,Command,Payload
/rdsdbbin/oscar/bin/mysqld, Version: 5.7.12 (MySQL Community Server (GPL)). started with:
Tcp port: 3306 Unix socket: /tmp/mysql.sock
Time,ServerHost,User,UserHost,Command,Payload
----------------------- END OF LOG ----------------------
错误/mysql-error-running.log.2018-07-24.03显示:https ://pastebin.com/ywmXLR5g 。
错误/mysql-error-running.log.2018-07-24.04显示:https ://pastebin.com/g1dkR6rj 。
错误/mysql-error-running.log.2018-07-24.18显示:https ://pastebin.com/g0aAXfaT 。
所有其他日志均未显示任何内容(见图)。
事件日志
July 24, 2018 at 11:14:14 AM UTC+8 DB instance restarted
July 24, 2018 at 11:13:31 AM UTC+8 Error restarting mysql: Engine bootstrap failed with no mysqld process running...
July 24, 2018 at 11:12:01 AM UTC+8 Recovery of the DB instance is complete.
July 24, 2018 at 11:04:26 AM UTC+8 Recovery of the DB instance has started. Recovery time will vary with the amount of data to be recovered.
特别感谢@WilsonHauck。经过 4 周的监控,手动将 Aurora 升级到最新版本即可解决问题。
在 2.01.1 上已经有几个解决意外重启的错误修正。https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/AuroraMySQL.Updates.20Updates.html
要手动升级您的 Aurora: