list index out of range #31

hanyunxuan · 2017-11-09T15:15:52Z

Traceback (most recent call last): File "crawler.py", line 163, in <module> crawler.run() File "crawler.py", line 90, in run for index, url in enumerate(self.parse_menu(self.request(self.start_url))): File "crawler.py", line 116, in parse_menu menu_tag = soup.find_all(class_="uk-nav uk-nav-side")[1]

The text was updated successfully, but these errors were encountered:

wzming · 2017-11-15T17:13:53Z

同样出现了越界问题
Traceback (most recent call last):
File "crawler.py", line 163, in
crawler.run()
File "crawler.py", line 90, in run
for index, url in enumerate(self.parse_menu(self.request(self.start_url))):
File "crawler.py", line 116, in parse_menu
menu_tag = soup.find_all(class_="uk-nav uk-nav-side")[1]
IndexError: list index out of range

daolanfler · 2017-11-17T15:19:24Z

在request 函数 return response那里加个断点，这时候response.content 的值为 ...503 Service Temporarily Unanaliable..，说明访问流量过大，list是空的。
我是这样理解的啊哈，但是我把源码下载到本地，oup.find_all(class_="uk-nav uk-nav-side")[1]，还是报错，这一点我就不明白了。。。

afetmin · 2017-12-15T08:24:39Z

廖老师的网站有反爬技术，请求多了就给个503

fw6669998 · 2019-07-18T13:45:19Z

廖老师的网站有反爬技术，请求多了就给个503

在发送请求那儿加上个请求头就可以了
headers={
'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.25 Safari/537.36 Core/1.70.3704.400 QQBrowser/10.4.3588.400'
}
response = requests.get(url,headers=headers, **kwargs)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

list index out of range #31

list index out of range #31

hanyunxuan commented Nov 9, 2017

wzming commented Nov 15, 2017

daolanfler commented Nov 17, 2017

afetmin commented Dec 15, 2017

fw6669998 commented Jul 18, 2019

list index out of range #31

list index out of range #31

Comments

hanyunxuan commented Nov 9, 2017

wzming commented Nov 15, 2017

daolanfler commented Nov 17, 2017

afetmin commented Dec 15, 2017

fw6669998 commented Jul 18, 2019