如下图,爬取标题,标题为中文内容,打印出乱码
解决方法,需要对请求后得返回数据重新编码
response = requests.get(url=url2) response.encoding = ‘utf-8‘ #处理编码得步骤 print(url2) wb_data = response.text # 将页面转换成文档树 html = etree.HTML(wb_data) b = html.xpath(‘//div[@class = "picmainer"]/h1/text()‘) b=(b[0]) print(b)
response.encoding = ‘utf-8‘
内容正常显示了
原文地址:https://www.cnblogs.com/becks/p/12687064.html
时间: 2024-10-18 21:46:04