[wget-notify] [bug #20414] wget: wrong behaviour with -A and -R
during recursive retrieval
NoèlKöthe
INVALID.NOREPLY at gnu.org
Mon Jul 9 13:04:53 PDT 2007
URL:
<http://savannah.gnu.org/bugs/?20414>
Summary: wget: wrong behaviour with -A and -R during
recursive retrieval
Project: GNU Wget
Submitted by: nok
Submitted on: Montag 09.07.2007 um 22:04
Category: Feature Request
Severity: 3 - Normal
Status: None
Privacy: Public
Assigned to: None
Originator Name:
Originator Email:
Open/Closed: Open
Discussion Lock: Any
Release: trunk
Operating System: GNU/Linux
Reproducibility: None
Fixed Release: None
Planned Release: None
Regression: None
Work Required: None
Patch Included: Yes
_______________________________________________________
Details:
Hello,
a forwarded bugreport with a patch from http://bugs.debian.org/294336
"I don't know whether this behaviour is intended or not: during recursive
retrieval, when wget has to decide whether to enqueue or not a discovered
url, it scans accept/reject lists with u->file, instead of u->url. as a
result, it is often wrong about what is to be crawled.
A very short patch is included. However, I am not 100% sure it agrees with
the original author's intended behaviour. For sure, recursive retrievals
might change significantly, so I don't recommend applying this patch without
extensive testing.
regards
g.b.
--- wget-1.9.1/src/recur.c.old 2003-10-11 15:57:11.000000000 +0200
+++ wget-1.9.1/src/recur.c 2005-02-04 18:15:44.000000000 +0100
@@ -525,7 +525,7 @@
&& depth != INFINITE_RECURSION
&& depth < opt.reclevel - 1))
{
- if (!acceptable (u->file))
+ if (!acceptable (url))
{
DEBUGP (("%s (%s) does not match acc/rej rules.\n",
url, u->file));
"
_______________________________________________________
Reply to this item at:
<http://savannah.gnu.org/bugs/?20414>
_______________________________________________
Nachricht geschickt von/durch Savannah
http://savannah.gnu.org/
More information about the wget-notify
mailing list