Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #12981

Re: Create an index from a webpage [RANT, DNFTT]

Date 2011-09-09 09:40 +1000
From Simon Cropper <simoncropper@fossworkflowguides.com>
Subject Re: Create an index from a webpage [RANT, DNFTT]
References <mailman.874.1315484806.27778.python-list@python.org> <1537032.qVoOGUtdWV@PointedEars.de> <4e68db21$0$30002$c3e8da3$5496439d@news.astraweb.com>
Newsgroups comp.lang.python
Message-ID <mailman.886.1315525252.27778.python-list@python.org> (permalink)

Show all headers | View raw


On 09/09/11 01:11, Steven D'Aprano wrote:
> [SNIP]
> It's no harder to put the search terms into a google URL, which still gets
> the point across without being a dick about it:
 > [SNIP]

[RANT]

OK I was not going to say anything but...

1. Being told to google-it when I explicitly stated in my initial post 
that I had been doing this and had not been able to find anything is 
just plain rude. It is unconstructive and irritating.

2. I presume that python-list is a mail list for python users - 
beginners, intermediate and advanced. If it is not then tell me and I 
will go somewhere else.

3. Some searches, particularly for common terms throw millions of hits. 
'Python' returns 147,000,000 results on google, 'Sitemap' returns 
1,410,000,000 results. Even 'Python AND Sitemap' still returns 5,020 
results. Working through these links takes you round and round with no 
clear solutions. Asking for help on the primary python mail list -- 
after conducting a preliminary investigation for tools, libraries, code 
snippets seemed legitimate.

4. AND YES, I could write a program but why recreate code when there is 
a strong likelihood that code already exists. One of the advantages of 
python is that a lot of code is redistributed under licences that 
promote reuse. So why reinvent the wheel when their is a library full of 
code. Sometimes you just need help finding the door.

4. If someone is willing to help me, rather than lecture me (or poke me 
to see if they get a response), I would appreciate it.

[END RANT]

For people that are willing to help. My original request was...

I am after a way of pointing a python routine to my website and have it
create a tree, represented as a hierarchical HTML list in a webpage, of
all the pages in that website (recursive list of internal links to HTML
documents; ignore images, etc.).

In subsequent notes to Thomas 'PointedEars'...

I pointed to an example of the desired output here 
http://lxml.de/sitemap.html

-- 
Cheers Simon

    Simon Cropper - Open Content Creator / Website Administrator

    Free and Open Source Software Workflow Guides
    ------------------------------------------------------------
    Introduction               http://www.fossworkflowguides.com
    GIS Packages               http://gis.fossworkflowguides.com
    bash / Python        http://scripting.fossworkflowguides.com

Back to comp.lang.python | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

Create an index from a webpage Simon Cropper <simoncropper@fossworkflowguides.com> - 2011-09-08 22:26 +1000
  Re: Create an index from a webpage Thomas 'PointedEars' Lahn <PointedEars@web.de> - 2011-09-08 14:38 +0200
    Re: Create an index from a webpage Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2011-09-09 01:11 +1000
      Re: Create an index from a webpage [RANT, DNFTT] Simon Cropper <simoncropper@fossworkflowguides.com> - 2011-09-09 09:40 +1000
        Re: Create an index from a webpage [RANT, DNFTT] "Rhodri James" <rhodri@wildebst.demon.co.uk> - 2011-09-09 01:32 +0100
          Re: Create an index from a webpage [RANT, DNFTT] Simon Cropper <simoncropper@fossworkflowguides.com> - 2011-09-09 12:09 +1000
            Re: Create an index from a webpage [RANT, DNFTT] Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2011-09-09 12:16 +1000
            Re: Create an index from a webpage [RANT, DNFTT] Duncan Booth <duncan.booth@invalid.invalid> - 2011-09-09 10:29 +0000
        Re: Create an index from a webpage [RANT, DNFTT] Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2011-09-09 12:14 +1000
          Re: Create an index from a webpage [RANT, DNFTT] Simon Cropper <simoncropper@fossworkflowguides.com> - 2011-09-09 12:43 +1000
          Re: Create an index from a webpage [RANT, DNFTT] Chris Angelico <rosuav@gmail.com> - 2011-09-09 12:59 +1000
          Re: Create an index from a webpage [RANT, DNFTT] Simon Cropper <simoncropper@fossworkflowguides.com> - 2011-09-09 13:20 +1000
          Re: Create an index from a webpage [RANT, DNFTT] Chris Angelico <rosuav@gmail.com> - 2011-09-09 13:46 +1000

csiph-web