[wget-notify] [bug #20414] wget: wrong behaviour with -A and -R during recursive retrieval

NoèlKöthe INVALID.NOREPLY at gnu.org
Mon Jul 9 13:04:53 PDT 2007


URL:
  <http://savannah.gnu.org/bugs/?20414>

                 Summary: wget: wrong behaviour with -A and -R during
recursive retrieval
                 Project: GNU Wget
            Submitted by: nok
            Submitted on: Montag 09.07.2007 um 22:04
                Category: Feature Request
                Severity: 3 - Normal
                  Status: None
                 Privacy: Public
             Assigned to: None
         Originator Name: 
        Originator Email: 
             Open/Closed: Open
         Discussion Lock: Any
                 Release: trunk
        Operating System: GNU/Linux
         Reproducibility: None
           Fixed Release: None
         Planned Release: None
              Regression: None
           Work Required: None
          Patch Included: Yes

    _______________________________________________________

Details:

Hello,

a forwarded bugreport with a patch from http://bugs.debian.org/294336

"I don't know whether this behaviour is intended or not:  during recursive
retrieval, when wget has to decide whether to enqueue or not a discovered
url, it scans accept/reject lists with u->file, instead of u->url. as a
result, it is often wrong about what is to be crawled.

A very short patch is included.  However, I am not 100% sure it agrees with
the original author's intended behaviour.  For sure, recursive retrievals
might change significantly, so I don't recommend applying this patch without
extensive testing.

regards
g.b.

--- wget-1.9.1/src/recur.c.old	2003-10-11 15:57:11.000000000 +0200
+++ wget-1.9.1/src/recur.c	2005-02-04 18:15:44.000000000 +0100
@@ -525,7 +525,7 @@
 	   && depth != INFINITE_RECURSION
 	   && depth < opt.reclevel - 1))
     {
-      if (!acceptable (u->file))
+      if (!acceptable (url))
 	{
 	  DEBUGP (("%s (%s) does not match acc/rej rules.\n",
 		   url, u->file));

"




    _______________________________________________________

Reply to this item at:

  <http://savannah.gnu.org/bugs/?20414>

_______________________________________________
  Nachricht geschickt von/durch Savannah
  http://savannah.gnu.org/



More information about the wget-notify mailing list