报错“The result of the xpath expression is: [object Attr]. It should be an element”
yutube爬虫动态加载,需要用到selenium-webdriver,使用过程中,首先使用
find_elements_by_xpath进行批量标签的定位选取,之后
使用find_element_by_xpath精细筛选选标签的时候出现上面错误提示,
原因是这个webdriver的定位方法和浏览器xpath不一样,不能直接定位到标签的属性
需要首先定位到webelement,之后get到属性
正确
try: temp[‘host_url‘] = node.find_element_by_xpath(‘./div/div/div/ytd-video-meta-block/div/div/div/yt-formatted-string/a‘).get_attribute(‘href‘) except Exception as e: print(e) try: temp[‘show_url‘] = node.find_element_by_xpath(‘./div/ytd-thumbnail/a‘).get_attribute(‘href‘) except Exception as e: print(e) try: temp[‘title‘] = node.find_element_by_xpath(‘./div/div/div[1]/div/h3/a‘).get_attribute(‘title‘) except Exception as e: print(e) try: temp[‘user‘] = node.find_element_by_xpath(‘./div/div/div/ytd-video-meta-block/div/div/div/yt-formatted-string/a‘).text except Exception as e:
错误:
try: temp[‘host_url‘] = node.find_element_by_xpath(‘./div/div/div/ytd-video-meta-block/div/div/div/yt-formatted-string/a/@href‘) except Exception as e: print(e) try: temp[‘show_url‘] = node.find_element_by_xpath(‘./div/ytd-thumbnail/a/@href‘) except Exception as e: print(e) try: temp[‘title‘] = node.find_element_by_xpath(‘./div/div/div[1]/div/h3/a/@title‘) except Exception as e: print(e) try: temp[‘user‘] = node.find_element_by_xpath(‘./div/div/div/ytd-video-meta-block/div/div/div/yt-formatted-string/a/text()‘) except Exception as e: print(e)
原文地址:https://www.cnblogs.com/chenxi188/p/11640729.html
时间: 2024-10-23 15:25:03