我正在使用BeautifulSoup搜索网页中的几个元素。
我保存了我找到的元素,但是因为我的脚本可能会查找一个元素,而它解析的特定页面不存在该元素,所以我对每个元素都使用了try/except语句:
# go through a bunch of webpages
for soup in soups:
try: # look for HTML element
data['val1'].append(soup.find('div', class_="something").text)
except: # add NA if nothing found
data['val1'].append("N/A")
try:
data['val2'].append(soup.find('span', class_="something else").text)
except:
data['val2'].append("N/A")
# and more and more try/excepts for more elements of interest
有没有更干净或者更好的方式来写这样的东西呢?
发布于 2018-06-03 07:28:28
尝试使用except是很昂贵的。我会使用if else语句。
v = soup.find('div', class_="something")
if v:
data['val1'].append(v.text)
else:
data['val1'].append("N/A")
发布于 2018-06-03 07:39:12
这实现了您想要的功能,并且通过将代码包装在for循环中,进一步减少了代码的重复:
info= [("val1", "div", "something"),
("val2", "span", "something else")]
# go through a bunch of webpages
for soup in soups:
for (val, element, class1) in info:
query = soup.find(element, class_=class1)
data[val].append(query.text if query else "N/A")
https://stackoverflow.com/questions/50661994
复制相似问题