今天在跑例行更新的爬蟲時
發現了這個問題
我用的是 chromedriver + selenium
Traceback (most recent call last):
File "/Users/yanzewei/Documents/PythonProjects/ig_parser/Updata_all_ig_img", line 463, in <module>
runInOneAccount(accObj.getAccount(), accObj.getLimitation())
File "/Users/yanzewei/Documents/PythonProjects/ig_parser/Updata_all_ig_img", line 354, in runInOneAccount
driver.get(source_url)
File "/Library/Python/2.7/site-packages/selenium/webdriver/remote/webdriver.py", line 320, in get
self.execute(Command.GET, {'url': url})
File "/Library/Python/2.7/site-packages/selenium/webdriver/remote/webdriver.py", line 306, in execute
response = self.command_executor.execute(driver_command, params)
File "/Library/Python/2.7/site-packages/selenium/webdriver/remote/remote_connection.py", line 460, in execute
return self._request(command_info[0], url, body=data)
File "/Library/Python/2.7/site-packages/selenium/webdriver/remote/remote_connection.py", line 484, in _request
resp = self._conn.getresponse()
File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/httplib.py", line 1132, in getresponse
response.begin()
File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/httplib.py", line 453, in begin
version, status, reason = self._read_status()
File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/httplib.py", line 417, in _read_status
raise BadStatusLine(line)
httplib.BadStatusLine: ''
這時候就疑惑了
明明程式碼都沒修改,怎麼突然會發生這種事呢?
檢查了 web 的 element 有沒有變動,X-Path 是不是正確之後
才到 google 尋求解答
經歷一番「走錯路」後,才發現原因
其實是 Chromedriver 的版本太舊了
更新一下就順利 work 了。
Chromedriver 更新網址如下
http://chromedriver.chromium.org/
留言列表