我的家族企业使用 GoDaddy 运营一个网站,用于促销目的。前几天我检查了日志,发现了一些我不太理解的非常奇怪的流量:
188.234.248.201 - - [31/Jan/2019:10:22:26 -0700] "GET / HTTP/1.1" 301 241 "http://reduslimofficial.ru/about/" "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/64.0.3282.140 Safari/537.36 Edge/17.17134"
185.127.17.253 - - [31/Jan/2019:10:23:15 -0700] "GET / HTTP/1.1" 301 237 "-" "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.11 (KHTML, like Gecko) Chrome/23.0.1271.64 Safari/537.11"
216.244.66.244 - - [31/Jan/2019:11:06:44 -0700] "GET /robots.txt HTTP/1.1" 301 251 "-" "Mozilla/5.0 (compatible; DotBot/1.1; http://www.opensiteexplorer.org/dotbot, [email protected])"
178.137.89.36 - - [31/Jan/2019:11:37:03 -0700] "GET /index.php/about/ HTTP/1.1" 301 257 "https://videotop.biz/" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; InfoPath.1"
178.137.89.36 - - [31/Jan/2019:11:37:03 -0700] "GET /index.php/about/ HTTP/1.1" 301 257 "https://videotop.biz/" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; InfoPath.1"
178.137.89.36 - - [31/Jan/2019:11:37:04 -0700] "GET /index.php/about/ HTTP/1.1" 301 257 "https://videotop.biz/" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; InfoPath.1"
178.137.89.36 - - [31/Jan/2019:11:37:04 -0700] "GET /index.php/about/ HTTP/1.1" 301 257 "https://videotop.biz/" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; InfoPath.1"
178.137.89.36 - - [31/Jan/2019:11:37:05 -0700] "GET /index.php/about/ HTTP/1.1" 301 257 "https://videotop.biz/" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; InfoPath.1"
178.137.89.36 - - [31/Jan/2019:11:37:05 -0700] "GET /index.php/about/ HTTP/1.1" 301 257 "https://videotop.biz/" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; InfoPath.1"
34.73.100.219 - - [31/Jan/2019:11:40:15 -0700] "GET /robots.txt HTTP/1.0" 301 247 "-" "ZoominfoBot (zoominfobot at zoominfo dot com)"
34.73.100.219 - - [31/Jan/2019:11:40:15 -0700] "GET / HTTP/1.0" 301 237 "-" "ZoominfoBot (zoominfobot at zoominfo dot com)"
180.76.15.9 - - [31/Jan/2019:11:45:18 -0700] "GET / HTTP/1.1" 301 237 "-" "Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)"
216.244.66.244 - - [31/Jan/2019:11:53:47 -0700] "GET /robots.txt HTTP/1.1" 301 247 "-" "Mozilla/5.0 (compatible; DotBot/1.1; http://www.opensiteexplorer.org/dotbot, [email protected])"
60.191.38.77 - - [31/Jan/2019:11:55:31 -0700] "GET / HTTP/1.1" 301 237 "-" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10.11; rv:47.0) Gecko/20100101 Firefox/47.0"
159.138.5.222 - - [31/Jan/2019:12:01:57 -0700] "GET /administrator/ HTTP/1.0" 301 251 "http://www.alancoxonachip.com/wp-login.php" "Opera/9.80 (Windows NT 6.1) Presto/2.12.388 Version/12.12"
其中一些我理解,例如,ZoominfoBot 似乎很可能只是一个爬虫。百度我理解是中国的谷歌,我也看到俄罗斯的谷歌 Yandex 也为我的网站编制索引。但是,所有这些网站是什么,例如“videotop.biz/”和“reduslimofficial.ru/about/”。有大量流量似乎说“301”而不是“200”,这让我有些担心。
一种想法是我不了解日志的性质。那也许是一个引荐域。另一种可能性是服务器上有一些恶意软件用于代理。
注意:我不确定他们使用什么网络服务器。我很确定操作系统是 RHEL。但是 SSH 进入服务器沙箱并ps aux
提供非常有限的信息。鉴于公司的性质,我强烈怀疑它是 apache(404 页面没有说nginx
,尽管我不确定这是否能证明什么)。
我认为您猜对了,假设您使用的是相当常见的
combined
Apache 日志文件格式,在这种情况下,该字段是\"%{Referer}i\"
,它完全在请求客户端的控制之下。它要么是实际人类点击进入您网站的链接,要么是由某种类型的爬虫机器人中的代码填充的。网站管理员发现这些数据有助于确定其网站的使用模式并了解谁可能链接到他们的网站。如果数据对您的业务没有帮助,我的建议是忽略它。