Wget not doing what I expect.

Joshua Lutes joshua at lutes.me
Tue Feb 17 10:16:27 MST 2009


Thanks Kimball,

That, combined with the pattern matching of the accept is perfect and
everything I need.

Joshua

On Mon, Feb 16, 2009 at 3:22 PM, Kimball Larsen
<kimball at kimballlarsen.com> wrote:
> Ok, I got it - you need to specify -H so that wget will span hosts:
>
> wget -r -l1 -A.mp3 -nd -H
> http://lds.org/conference/sessions/display/0,5239,23-1-947,00.html
>
> That works for me.
>
>
> - Kimball
> http://www.kimballlarsen.com
>
> On Feb 15, 2009, at 11:03 AM, Joshua Lutes wrote:
>
>> Thanks,
>>
>> The odd thing is that I tried it without the -A.mp3 and it downloaded
>> 106 files, some of them pdfs but none of them mp3.  Perplexing.
>>
>> Joshua
>>
>> On Sun, Feb 15, 2009 at 10:45 AM, Joseph Hall <joseph at thatworks.com>
>> wrote:
>>>
>>> On Sun, Feb 15, 2009 at 9:19 AM, Joshua Lutes <joshua at lutes.me> wrote:
>>>>
>>>> I am trying to download all of the mp3 files from the general
>>>> conference sessions that the church has on its website.  I think the
>>>> following command should do it, but it doesn't.
>>>>
>>>> wget -r -l1 -A.mp3 -nd
>>>> http://lds.org/conference/sessions/display/0,5239,23-1-947,00.html
>>>>
>>>> So, recurse down one level, only save files that are .mp3, don't copy
>>>> the directory structure and start at the most recent general
>>>> conference page.  What do I not know that is causing the following
>>>> output to come?
>>>
>>> My guess: You told wget only to save .mp3 files, and
>>> 0,5239,23-1-947,00.html isn't an .mp3 file.
>>>
>>> I couldn't get wget to handle it by itself either, but this will do
>>> what you want:
>>>
>>> elinks -dump
>>> http://lds.org/conference/sessions/display/0,5239,23-1-947,00.html
>>> | grep mp3 | sed 's/.*http/http/' | wget -i -
>>>
>>> Yes, that is a hyphen by itself right after the -i, don't forget it or
>>> you'll be sad.
>>>
>>> --
>>> Joseph
>>> http://blog.josephhall.com/
>>>
>>> /*
>>> PLUG: http://plug.org, #utah on irc.freenode.net
>>> Unsubscribe: http://plug.org/mailman/options/plug
>>> Don't fear the penguin.
>>> */
>>>
>>
>> /*
>> PLUG: http://plug.org, #utah on irc.freenode.net
>> Unsubscribe: http://plug.org/mailman/options/plug
>> Don't fear the penguin.
>> */
>
>
> /*
> PLUG: http://plug.org, #utah on irc.freenode.net
> Unsubscribe: http://plug.org/mailman/options/plug
> Don't fear the penguin.
> */
>



More information about the PLUG mailing list