[Python] urllib2.HTTPError: HTTP Error 403: Forbidden

搬运自http://www.2cto.com/kf/201309/242273.html，感谢原作。

之所以出现上面的异常,是因为如果用 urllib.request.urlopen 方式打开一个URL，服务器端只会收到一个单纯的对于该页面访问的请求。
但是服务器并不知道发送这个请求使用的浏览器，操作系统，硬件平台等信息,而缺失这些信息的请求往往都是非正常的访问,例如爬虫。
有些网站为了防止这种非正常的访问,会验证请求信息中的UserAgent(它的信息包括硬件平台、系统软件、应用软件和用户个人偏好)。
如果UserAgent存在异常或者是不存在,那么这次请求将会被拒绝。

可行的解决方案是在请求中加入UserAgent的信息。
以下是一次成功的例子：

URL=‘TestURL‘　　#用真实的URL替代TestURL
HEADERS={‘User-Agent‘:‘Mozilla/5.0 (Windows NT 6.1; WOW64; rv:36.0) Gecko/20100101 Firefox/36.0‘}
req=urllib2.Request(url=URL,headers=HEADERS)htmlcode=urllib2.urlopen(req).read()

时间： 2024-10-24 01:13:16

[Python] urllib2.HTTPError: HTTP Error 403: Forbidden的相关文章

urllib2.HTTPError: HTTP Error 403: Forbidden

这个问题主要是没有headers,加入一些内容就可以了示例: # -*- coding: UTF-8 -*- import urllib2 site= "http://www.nseindia.com/live_market/dynaContent/live_watch/get_quote/getHistoricalData.jsp?symbol=JPASSOCIAT&fromDate=1-JAN-2012&toDate=1-AUG-2012&datePeriod=un

urllib2.HTTPError: HTTP Error 403: Forbidden 解决方法

参考: https://stackoverflow.com/questions/13303449/urllib2-httperror-http-error-403-forbidden https://segmentfault.com/q/1010000000470724 通过测试应该是request中header的问题. 1 class S0819MtimeTiantangPipeline(object): 2 def process_item(self, item, spider): 3 he

urllib.error.HTTPError: HTTP Error 403: Forbidden

问题: urllib.request.urlopen() 方法经常会被用来打开一个网页的源代码,然后会去分析这个页面源代码,但是对于有的网站使用这种方法时会抛出"HTTP Error 403: Forbidden"异常例如执行下面的语句时 [python] <span style="font-size:14px;"> urllib.request.urlopen("http://blog.csdn.net/eric_sunah/articl

爬虫403问题解决urllib.error.HTTPError: HTTP Error 403: Forbidden

一.爬虫时,出现urllib.error.HTTPError: HTTP Error 403: Forbidden Traceback (most recent call last): File "D:/访问web.py", line 75, in <module> downHtml(url=url) File "D:/urllib访问web.py", line 44, in downHtml html=request.urlre

Python 3.x HTTP Error 403: Forbidden

The Fobidden error often raised when using request.open to open some urls. such as: url_1 = 'https://movie.douban.com/subject/26363254/comments?status=P'url_2 = 'https://www.glassdoor.com/Interview/Texas-Instruments-Interview-Questions-E651_P4.htm' r

python urllib2导出elasticsearch数据时返回 "urllib2.HTTPError: HTTP Error 500: Internal Server Error"

0.业务场景将ES中某个index的某个字段的所有数据,导出到文件中 1.ES数据导出方法简述 ES数据导出方法,我主要找到了以下几个方面,欢迎大家补充: ES官方API:snapshot and restore module The snapshot and restore module allows to create snapshots of individual indices or an entire cluster into a remote repository like sha

解决git提交问题error: The requested URL returned error: 403 Forbidden while accessing

git提交代码时,出现这个错误"error: The requested URL returned error: 403 Forbidden while accessing https" 解决方法: 编辑.git文件夹下的config文件就可以. vim .git/config #改动对于的配置 #原来的url = https://github.com/elitecodegroovy/PhoenixC.git url = https://[email protected]/elitec

解决github push错误The requested URL returned error: 403 Forbidden while accessing

来源:http://blog.csdn.net/happyteafriends/article/details/11554043 github push错误: [html] view plaincopyprint? git push error: The requested URL returned error: 403 Forbidden while accessing https://github.com/wangz/future.git/info/refs git version 1.7.

Apache error: 403 Forbidden You don't have permission to access

CentOS 6 solution: chcon -t httpd_sys_content_t -R /directory refer to: https://www.centos.org/forums/viewtopic.php?f=19&t=15128&start=10#p70999 Apache error: 403 Forbidden You don't have permission to access

猜你喜欢

第一个React程序HelloWorld

一.程序步骤 1.用React.createClass生成组件 2.调用React.render把组件渲染到页面中,dom的操作由react自动完成二.代码 <!DOCTYPE html> ...

struts2官方中文教程系列七：消息资源文件

介绍在本教程中,我们将探索使用Struts 2消息资源功能(也称为 resource bundles 资源绑定).消息资源提供了一种简单的方法,可以将文本放在一个视图页面中,通过应用程序,创建表单字 ...

# include <stdio.h> # include <string.h> int main(void) { int i,n,q,r,s[100]; while(scan ...

python 之初学者的代码示例(短小精悍)(一)

学习Python也有个把月了,最近整理自己初学的代码示例,一个是为了增加自己对细节的把握,一个是让像我一样的初学者能够熟练地使用基础,基础的重要性就不说了,我希望自己能够把这些精巧的小而短的示例分享给 ...

viewWillLayoutSubView

当viewController的bounds又改变,调用这个方法来实现subview的位置.可重写这个方法来实现父视图变化subview跟着变化. > Lifecycle events orde ...

[知识点]A*搜索(启发式搜索)

// 此博文为迁移而来,写于2015年4月4日,不代表本人现在的观点与看法.原始地址:http://blog.sina.com.cn/s/blog_6022c4720102vwud.html 1.前言 ...

项目感悟之团队建设

有时候,估计很多人有同感:一件事情,从刚开始就可以预见结果.是的,抛开其他因素不谈,单纯从一个团队的人员组成上就能看出端倪一二来. 有时候,自己感觉自己都有点苛刻,是呀,与人无仇,可是为什么非要和人家 ...

WinCE: prefetch abort

prefetch abort 是一类比较难解决的问题,因为很难定位出错的位置. 更奇怪的是:程序单独运行就会出错,使用VS2008按 F5 运行就不会出错.更不用想单步调试了,也不会出错啦! 类似于 ...

Unity3D入门俄罗斯方块总结（一）

主要参考:http://blog.csdn.net/kobbbb/article/details/8900974 绘制了俄罗斯方块,进行unity的入门学习. 结果如下: 基本功能实现了. 总结如下: ...

堆排序-algorithms_3th

1 #include <iostream> 2 using namespace std; 3 4 int PARENT(const int &i){ 5 return (i> ...

8Python全栈之路系列之Django Cookie 与Sessi

Python全栈之路系列之Django Cookie与Sessi Cookies cookies是浏览器为Web服务器存储的一小段信息,每次浏览器从某个服务器请求页面时,它向服务器回送之前收到的coo ...

activemq集群的搭建

一.环境准备 1.上传 apache-activemq-5.11.1-bin.tar 和 zookeeper-3.4.5.tar.gz Linux服务器(/usr/local/install 目录下) ...

MVC获取文件路径

string ss = base.HttpContext.Request.RawUrl;//获取当前项目路径 string cc= Server.MapPath("DDTek.lic&quo ...

Oracle学习之DATAGUARD(十) 在同台机器上使用11g rman新特性创建DG

首先使用dbca建立一个数据库,db_name=primary . 2. 为两个数据库准备静态监听.及连接彼此的TNSNAME 11gdg1-> cat listener.ora tnsna ...

10046事件和tkprof命令

使用10046事件是在oralce数据库中查看目标sql的执行计划的另外一种方法.这种方法与使用explain plan命令,dbms_xplan包和autotrace开关的不同之处在于,所得到的执行 ...

Python 入门（四）List和Tuple类型

创建list Python内置的一种数据类型是列表:list.list是一种有序的集合,可以随时添加和删除其中的元素. 比如,列出班里所有同学的名字,就可以用一个list表示: >>> ...

实验五 04彭得源

#include"stdio.h" #include"stdlib.h" #include"time.h" struct wuli{ int ...

Java初级学习1

上午一整天都在做Java的在线聊天系统,遇到点困难,于是稍停一会,在51CTO上随便看看,偶然看到了一句话,正符合我的心情--" 现在,我能大胆的承认自己不知道的知识,因为,我经常告诫自己: ...

C# 多线程操作队列

using System;using System.Collections.Generic;using System.Linq;using System.Text;using System.Threa ...

全网VIP视频免费观看，从此不再买会员

大家好,下面给大家介绍一下免费看VIP电影及电视剧教程1,首先我们要有一部手机或者电脑.2,我们把手机安装上微信,在微信"搜索微信公众:ylzb00"点击关注.(这里涉及广告,但是 ...

专题

随机推荐

© 2024 憋错料 | info#biecuoliao.com | 10 q. 0.020 s.