Python 3.x HTTP Error 403: Forbidden

The Fobidden error often raised when using request.open to open some urls.

such as:

url_1 = ‘https://movie.douban.com/subject/26363254/comments?status=P‘
url_2 = ‘https://www.glassdoor.com/Interview/Texas-Instruments-Interview-Questions-E651_P4.htm‘

request.urlopen(url_1) # ** no error raised

request.urlopen(url_2) # ** Fobidden error raised

Here is the reason:

When using urllib.request.urlopen to visit a URL, the server will only receive a simple request for this webpage without knowing the hidden infos about exploer,operating system,platform, which are abnormal.

Some websited will vefify the UserAgent info to prevent the abnoraml visisting.

So the solution : Add these infos to the UserAgent to acts as using exploer to visit

req = request.Request(url,headers={‘User-Agent‘: ‘Mozilla/5.0‘}) # ** this would fix, also you can add other infos to User-Agent

时间： 2025-01-11 03:32:10

Python 3.x HTTP Error 403: Forbidden的相关文章

[Python] urllib2.HTTPError: HTTP Error 403: Forbidden

搬运自http://www.2cto.com/kf/201309/242273.html,感谢原作. 之所以出现上面的异常,是因为如果用 urllib.request.urlopen 方式打开一个URL,服务器端只会收到一个单纯的对于该页面访问的请求.但是服务器并不知道发送这个请求使用的浏览器,操作系统,硬件平台等信息,而缺失这些信息的请求往往都是非正常的访问,例如爬虫.有些网站为了防止这种非正常的访问,会验证请求信息中的UserAgent(它的信息包括硬件平台.系统软件.应用软件和用户个人偏好

urllib.error.HTTPError: HTTP Error 403: Forbidden

问题: urllib.request.urlopen() 方法经常会被用来打开一个网页的源代码,然后会去分析这个页面源代码,但是对于有的网站使用这种方法时会抛出"HTTP Error 403: Forbidden"异常例如执行下面的语句时 [python] <span style="font-size:14px;"> urllib.request.urlopen("http://blog.csdn.net/eric_sunah/articl

爬虫403问题解决urllib.error.HTTPError: HTTP Error 403: Forbidden

一.爬虫时,出现urllib.error.HTTPError: HTTP Error 403: Forbidden Traceback (most recent call last): File "D:/访问web.py", line 75, in <module> downHtml(url=url) File "D:/urllib访问web.py", line 44, in downHtml html=request.urlre

解决git提交问题error: The requested URL returned error: 403 Forbidden while accessing

git提交代码时,出现这个错误"error: The requested URL returned error: 403 Forbidden while accessing https" 解决方法: 编辑.git文件夹下的config文件就可以. vim .git/config #改动对于的配置 #原来的url = https://github.com/elitecodegroovy/PhoenixC.git url = https://[email protected]/elitec

解决github push错误The requested URL returned error: 403 Forbidden while accessing

来源:http://blog.csdn.net/happyteafriends/article/details/11554043 github push错误: [html] view plaincopyprint? git push error: The requested URL returned error: 403 Forbidden while accessing https://github.com/wangz/future.git/info/refs git version 1.7.

Apache error: 403 Forbidden You don't have permission to access

CentOS 6 solution: chcon -t httpd_sys_content_t -R /directory refer to: https://www.centos.org/forums/viewtopic.php?f=19&t=15128&start=10#p70999 Apache error: 403 Forbidden You don't have permission to access

git推送到github报错：error: The requested URL returned error: 403 Forbidden while accessing https://github.com

最近使用git命令从github克隆仓库到版本,然后进行提交到github时报错如下: [[email protected] git_test]# git push origin mastererror: The requested URL returned error: 403 Forbidden while accessing https://github.com/jsonhc/git_test.git/info/refs fatal: HTTP request failed 解决办法:参考

Python爬虫报错："HTTP Error 403: Forbidden"

错误原因:主要是由于该网站禁止爬虫导致的,可以在请求加上头信息,伪装成浏览器访问User-Agent. 新增user-agent信息: headers = {'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/71.0.3578.80 Safari/537.36'} req = request.Request(Spider.url, header

HTTP Error 403: Forbidden

在写网页爬虫的时候,有的网站会有反爬取措施,所以有可能出现上面所示bug 出现bug的地方可能有两处: 1. requests请求时 requests.get(url),返回结果是403. 解决方法: headers= { 'User-Ageent':'一些字符', 'Cookie':'一些字符' } requests.get(url, headers=headers), 此时返回结果应该就是200,正常.加入headers的目的是,模拟人的行为,让服务器认为是人在操作, User-Agent,

猜你喜欢

Python 字符串操作

转义字符 (\) 原始字符串(在字符串开始的引号之前加上r) 三重引号的多行字符串(''' ''') 多行注释(""" """) 字符串下标 ...

#Shaass and Lights:CodeForces - 294C

1 /************************************************************************************************* ...

Valid Sudoku leetcode

Determine if a Sudoku is valid, according to: Sudoku Puzzles - The Rules. The Sudoku board could be ...

郭正远博文列表

1.软件推介系列 1.1 ABBYY FineReader12安装破解 2.经验分享系列 2.1 获取资源的"手段" 2.2 快速启动程序几种方法

package jiexi; import javax.xml.parsers.DocumentBuilder; import javax.xml.parsers.DocumentBuilderFac ...

unity, 鼠标与场景交点

在鼠标与场景交点上放一个mark,并于1s后消失: 新建一个空GameObject,命名为moushHitTest,添加下面脚本: using UnityEngine;using System.Col ...

Android服务器端如何遍历MySql查询结果

============问题描述============ 如题,想做一个类似于微信朋友圈或者新浪微博的发状态功能.用户进入后显示所有人发的状态,可是在服务器端该怎么写遍历Sql呢?用Cursor遍历的 ...

自动下载快手视频

@echo off :: 自动下载快手视频 :: get kuaishou video _GKV :: _kslists.txt - https://www.kuaishou.com/live/use ...

homework3:课本习题练习

首先,书上给的代码如下: /******************************************************* * Finds and prints n prime int ...

理解Js的parseInt(转)

parseInt() 方法首先查看位置 0 处的字符,判断它是否是个有效数字:如果不是,该方法将返回 NaN,不再继续执行其他操作.但如果该字符是有效数字,该方法将查看位置 1 处的字符,进行同样的测 ...

一.ubuntu常见的命令

所有命令按字母顺序排列,只介绍最常用参数,相信等你看完之后,就有能力man更详细的用法了此前own也曾发表过几篇文章,详细的介绍了几个命令比如ls,sudo,chmod等等,看不懂man的,请自行查 ...

linux 安装配置JDK

从jdk官网下载 jdk1.7 以我下载的安装包为例子,我下载的是jdk-7u79-linux-x64.tar.gz 1.解压压缩包:tar zxvf jdk-7u79-linux-x64.tar.g ...

.net学习之进程外Session的配置

转载地址:http://www.cnblogs.com/rohelm/archive/2012/05/13/2498465.html 人人都知道怎么去使用session,但是初学者,尤其是自学的学生可 ...

epoll IO多路复用（异步阻塞AIO）

epoll的异步阻塞(AIO): 用户线程创建epoll后,其实是内核线程负责扫描 fd 列表(在网络服务器上可以是socket,socket在创建后返回的也是文件描述符),并填充事件链表.但是,并不 ...

【转】关于动态库和静态库

参考:http://blog.jobbole.com/86852/ 由于我只在windows下使用,linux部分就不多说了,总结一下windows下面的相关知识好了: 静态库之所以成为[静态库], ...

HDU 3268 Columbus’s bargain（最短路 Spfa）

题目链接:http://acm.hdu.edu.cn/showproblem.php?pid=3268 Problem Description On the evening of 3 August 1 ...

POJ 2251-Dungeon Master (三维空间求最短路径）

Description You are trapped in a 3D dungeon and need to find the quickest way out! The dungeon is co ...

关于微信支付冲突的问题

琪琪使用第三方H5制作后,添加组件支付,并配置制服组件: 一.公众平台设置说明:(1)开通有支付权限的公众号:(2)支付授权目录设置:微信支付----开发者配置—设置支付授权目录(http://u.l ...

稀疏矩阵的压缩存储及转置

没有经过处理的稀疏矩阵其实就是一个特殊的二维数组,数组中的大部分元素是0或者其他类型的非法值,只有少数几个非零元素. 为了实现压缩存储,可以只存储稀疏矩阵的非0元素.在存储稀疏矩阵中的非0元素时,必须 ...

Matplotlib——第一章轻松画个图

首先安装matplotlib,使用pip install matplotlib.安装完成后在python的命令行敲入import matplotlib,如果没问题,说明安装成功可以开始画图了. 看好了 ...

专题

随机推荐

© 2025 憋错料 | info#biecuoliao.com | 10 q. 0.021 s.