人人爬取

import urllib.request,http.cookiejar,urllib.parse,re

print(‘start to login‘)
url=‘http://www.renren.com/PLogin.do‘
logindomain=‘renren.com‘

user_agent=‘Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/37.0.2062.44 Safari/537.36 OPR/24.0.1558.25 (Edition Next)‘
myheader={‘User-Agent‘:user_agent,
‘Host‘:‘www.renren.com‘,
‘GET‘:url}
#msg=input(‘please your msg:‘)
msg=‘hello,everyone,I come from pythonClient.This is just a test\n陈耀烨力量很强‘
cj=http.cookiejar.CookieJar()
opener=urllib.request.build_opener(urllib.request.HTTPCookieProcessor(cj))
urllib.request.install_opener(opener)

values={‘email‘:‘18782938709‘,
‘password‘:‘******‘,
‘domain‘:logindomain}
data=urllib.parse.urlencode(values)
data=data.encode(‘utf-8‘)
req=urllib.request.Request(url,data,headers=myheader)
response=urllib.request.urlopen(req)
#print(response)
responselate=response.read().decode(‘utf-8‘)
#print(responselate)
reInfo = re.compile(r"get_check:‘(.*?)‘,get_check_x:‘(.*?)‘.*?‘id‘:‘(.*?)‘", re.DOTALL)
info = reInfo.findall(responselate)
myid= info[0][2]
print(myid)
tok= info[0][0]
print(tok)
rtk= info[0][1]
print(rtk)

url1=‘http://shell.renren.com/‘+myid+‘/status‘
values1={‘content‘:msg,
‘hostid‘:myid,
‘requestToken‘:tok,
‘_rtk‘:rtk,
‘channel‘:‘renren‘
}
data1=urllib.parse.urlencode(values1)
data1=data1.encode(‘utf-8‘)
myheader1={‘User-Agent‘:user_agent,
‘Host‘:‘shell.renren.com‘,
‘Origin‘:‘http://shell.renren.com‘,
‘Referer‘:‘http://shell.renren.com/ajaxproxy.htm‘,
‘POST‘:url1}
req1=urllib.request.Request(url1,data1,headers=myheader1)
response1=urllib.request.urlopen(req1).read()
print(‘you have broadcasted a status using your renrenAcount‘)

时间： 2024-10-06 15:05:57

人人爬取

人人爬取的相关文章

Python爬取CSDN博客文章

Python爬虫新手教程：爬取了6574篇文章，告诉你产品经理在看什么！

Python Scrapy的QQ音乐爬虫音乐下载、爬取歌曲信息、歌词、精彩评论

使用 Chrome 浏览器插件 Web Scraper 10分钟轻松实现网页数据的爬取

python网络爬虫学习(六)利用Pyspider+Phantomjs爬取淘宝模特图片

Python爬虫实战（2）：爬取京东商品列表

简单爬取京东百万商品的缺货记录

[Python爬虫] Selenium爬取新浪微博客户端用户信息、热点话题及评论 (上)

福利贴——爬取美女图片的Java爬虫小程序代码