说明
一.项目介绍
对于足球竞猜网页的信息进行爬取并且对信息分析
二.部分代码展示
import requests
from lxml.html import etree
headers = {'Referer': 'http://www.okooo.com/jingcai/',
'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/75.0.3770.100 Safari/537.36'}
url = 'XXXXXXXXXXX'
response = requests.get(url, headers=headers)
response.encoding = response.apparent_encoding
response_html = etree.HTML(response.text)
id_xpath = '//*[@class="touzhu_1"]/@data-mid'
hname_xpath = '//*[@class="touzhu_1"]/@data-hname'
aname_xpath = '//*[@class="touzhu_1"]/@data-aname'
id_list = response_html.xpath(id_xpath)
hname_list = response_html.xpath(hname_xpath)
aname_list = response_html.xpath(aname_xpath)
三.完整代码至于压缩文件夹中
项目链接:https://github.com/a568972484/Crawl_for_football_infor
核心动态代码也至于压缩文件夹中
需要请联系作者
作者名称:a568972484
作者博客:小小咸鱼ywy
博客链接https://www.cnblogs.com/pythonywy
原文地址:https://www.cnblogs.com/pythonywy/p/11209323.html
时间: 2024-10-10 23:00:42