httrack

修改浏览器id

Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)

暂停,恢复任务

Subject: Re: Pause, continue and stop via command line
Author: Xavier Roche
Date: 10/22/2005 10:35
 
> You do it by pressing the ctrl + c keys. This pauses
> HTTrack. You also get the options to quit,
> interrupt, run as background task, run as blind
> background task or to cancel the mirroring. 

And if you want to control a background job:

To pause, you can create a file named hts-stop.lock in the project directory
(at the same level of hts-log.txt). HTTrack will erase it, wait for pending
transfers to finish, create a new file named hts-paused.lock, and will wait
until this file is erased. The mirror will continue from this point.

https://forum.httrack.com/readmsg/12607/12604/index.html

网页版UI

命令 webhttrack

---恢复内容结束---

修改浏览器id

Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)

暂停,恢复任务

Subject: Re: Pause, continue and stop via command line
Author: Xavier Roche
Date: 10/22/2005 10:35
 
> You do it by pressing the ctrl + c keys. This pauses
> HTTrack. You also get the options to quit,
> interrupt, run as background task, run as blind
> background task or to cancel the mirroring. 

And if you want to control a background job:

To pause, you can create a file named hts-stop.lock in the project directory
(at the same level of hts-log.txt). HTTrack will erase it, wait for pending
transfers to finish, create a new file named hts-paused.lock, and will wait
until this file is erased. The mirror will continue from this point.

https://forum.httrack.com/readmsg/12607/12604/index.html

网页版UI

命令 webhttrack

最大链接数

Subject: Re: Too many URLs, giving up..(>100000)
Author: Christian
Date: 04/18/2015 09:44
 
Soimosan and Russ:

I have the same questions you two had. The DEv here are doing this AMAZING
project as a "charity" to average people like us, and have limited time to
answer questions. So I am going to tell you hot to fix this easily and finish
your project:

I know this is a kind of old thread and despite having been using HTTrack for
almost 7 years, I hit the frustrating "Panic: too many links" error that
prevents pages from being mirrored all the way this week (April 2015)!

But I KNOW there are average users like me that still have this problem and I
am here to help. :)

***I am going to post here the fix to this problem, easy step-by-step for
n00bs and experts alike. :) ***

I confess, I almost went crazy with frustration. HTTrack works so dang well, I
NEVER look at the error log! So for the first time I hit the "Too many URLs,
giving up" error. 

I tried everything: adding that line (example: -#L1000000) to "Rules" to no
avail. Tried the line in the "add url" and "web addresses" to no avail. I
tried deleting the dang .tmp files, not work either. I spent hours creating +*
Rules for each major link artery... nothing. 

Well... After banging my head against the wall over 3 days, I checked error
logs carefully. Ta-da!

So here is the easy way to fix this. The whole "use #L option for more links
(example: -#L1000000)" was not making sense. 

So here is what I did:

1) Do NOT delete hts-cache or .tmp files like I did! Otherwise, the download
will take a long time again from scratch. Go back to the project and open the
existing project. Select action: "Continue interrupted download";

2. Click on "Set Options" --> Limits (tab) --> "Maximum Number of links"
(bottom field) --> Just go for highest available of threads or write
999999999.

But be warned! You need to constantly check the In Progress/action download
screen to be sure you are not copying the entire Internet! lol  One way to do
that is to put "0" under the "Maximum external depth" field (Limits tab) and
checkmark "No external pages" under "Build" tab. I know this can cause you to
miss some files, but that can be fixed manually under "Rules".

Hope this helps! :) Be blessed mirroring!

THANK YOU to the crew at Httrack for this awesome project! :) I cannot write
code, but I am happy to donate yearly to keep the site/host going. :) Thanks
all. :)

Christian
 
Reply Create subthread

All articles

Subject Author Date
Too many URLs, giving up..(>100000)
Soimosan Roger

05/22/2001 10:33
Re: Too many URLs, giving up..(>100000)
Xavier Roche

05/22/2001 11:01
Re: Too many URLs, giving up..(>100000)
russ

01/29/2010 11:05
Re: Too many URLs, giving up..(>100000)
Christian

04/18/2015 09:44
Re: Too many URLs, giving up..(>100000)
muf

03/08/2019 13:09
Re: Too many URLs, giving up..(>100000)
duff

06/01/2019 22:21
Re: Too many URLs, giving up..(>100000)
Homer

07/14/2019 02:26

https://forum.httrack.com/readmsg/33944/310/index.html

原文地址:https://www.cnblogs.com/Searchor/p/11739136.html

时间: 2024-08-06 02:32:39

httrack的相关文章

【工具】今天有人问我可以直接离线一个完整的网站吗?有没有什么工具之类的?我推荐一款:Httrack (网站复制机)案例:离线你的博客园

搞安全的朋友这款工具可能用的比较多,毕竟信息收集阶段还是很必须的,这篇文章主要讲一下离线你的博客园案例 官网:http://www.httrack.com/ 里面有windows,linux,Android等等几个版本 先简单介绍一下,具体的你可以自己摸索一下 一.windows下 安装很简单就不说了,讲下简单使用(里面有简体中文) 新建一个工程名,最好是英文的,选择一下存放路径(分类可以不选) 设置要离线的网站地址(可以设置一些参数)随意演示一下,你们保持默认即可 准备阶段结束 开始离线了~等

网站下载器WebZip、Httrack及AWWWB.COM网站克隆器

 动机 闲扯节点,可略读. 下载并试用这些软件并非是为了一己之私,模仿他人网站以图利.鉴于国内网络环境之艰苦,我等屌丝级半罐水程序员,纵有百度如诸葛大神万般协力相助,也似后主般无能不能解决工作和娱乐中的诸多困难.就那如近段时间自学Android开发为例,不说下载Android SDK费了九牛二虎之力,想看看Android官网文档,每次都想尽一切办法,使用各种手段(fg742p.exe?fg752p.exe?goagent?  看官或许笑我,何须如此费力,VPS+VPN,带你装逼带你飞,so ea

小白学习安全测试——httrack的安装和使用(二)

httrack是一款免费的网站镜像程序,简单理解就是可以在网站结构(网页及一些主要信息文件),下载到本地,可离线浏览,我是按照搭建成功后的console直译过来的 下面说下安装: 我都是在Linux环境下安装的: centos7下安装: #wget http://download.httrack.com/httrack-3.46.1.tar.gz #tar -xzvf httrack-3.46.1.tar.gz #cd httrack-3.46.1 #./configure #make #mak

httrack,webdup,WinHTTrack,WebZip

怎么下载摄像头游戏jabbo,并使其能离线运行?修改 1.摄像头游戏jabbo:JABBO Ultimatum by LiveMurals Interactive电脑为:windows 7 32位.试过很多办法,比如chrome审查元素下载 http://www.livemurals.com/jabbo/ultimatum/Ultimatum_secure.swf 联网运行OK,但只要断开网络,马上不能播放.使用httrack或者webdup进行整站下载也不行,播放器使用的是Flash Play

HTTrack(网站复制机制一)

1.今天争取把这个工具的所有操作都罗列出来 首先打开kali 中的这个工具 Web信息收集工具HTTrack ,会直接出现出现使用语法. 对于传统的像存在Robots.txt的网站,如果程序运行的时候不做限制,在默认的环境下程序不会把网站镜像,简单来说HTTPrack跟随基本的JavaScript或者APPLet.flash中的链接,对于复杂的链接(使用函数和表达式创建的链接)或者服务器端的ImageMap则不能镜像. 2.将一般的参数选项做个解释 Enter project name  //输

Web侦察工具HTTrack (爬取整站)

Web侦察工具HTTrack (爬取整站) HTTrack介绍 爬取整站的网页,用于离线浏览,减少与目标系统交互,HTTrack是一个免费的(GPL,自由软件)和易于使用的离线浏览器工具.它允许您从Internet上下载万维网站点到本地目录,递归地构建所有目录,从服务器获取HTML,图像和其他文件到您的计算机.HTTrack安排原始网站的相关链接结构.只需在浏览器中打开"镜像"网站的页面,即可从链接到链接浏览网站,就像在线查看网站一样.HTTrack也可以更新现有的镜像站点,并恢复中断

HTTrack - 克隆任意网站

HTTrack可以克隆指定网站-把整个网站下载到本地. 可以用在离线浏览上,也可以用来收集信息(甚至有网站使用隐藏的密码文件). 一些仿真度极高的伪网站(为了骗取用户密码),也是使用类似工具做的.Linux默认安装了HTTrack. 如没有可以自行下载安装.Liunx 安装: yum provides httrack yum install httrack MacOS zsh: brew install httrack HTTrack帮助: httrack --help 使用示例: httrac

Google Hacking Database(2016-2017最新)

inurl:share.cgi?ssid=                                  ##云服务器文件和文件夹共享intitle:"Welcome to QNAP Turbo NAS"      ##  QNAP登录页面inurl:"img/main.cgi?next_file"                     ##在线摄像头ext:log inurl:"/pgadmin"                    

★Kali信息收集★8.Nmap :端口扫描

★Kali信息收集~ 0.Httrack 网站复制机 http://www.cnblogs.com/dunitian/p/5061954.html ★Kali信息收集~ 1.Google Hacking + Github Hacking http://www.cnblogs.com/dunitian/p/5074765.html ★Kali信息收集~2.Whois :域名信息 http://www.cnblogs.com/dunitian/p/5074768.html ★Kali信息收集~3.子