Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #73666

Re: How to extract contents of inner text of html tag?

Newsgroups comp.lang.python
Date 2014-06-27 10:36 -0700
References <mailman.7534.1393704366.18130.python-list@python.org>
Message-ID <34303ae2-c719-407d-bf83-744ccf05c21c@googlegroups.com> (permalink)
Subject Re: How to extract contents of inner text of html tag?
From Jesse Adam <jaahush@gmail.com>

Show all headers | View raw


I don't have BeautifulSoup installed so I am unable to tell whether

a) for line in all_kbd:
processes one line at a time as given in the input, or do you get the clean
text in single lines in a list as shown in the example in the doc 
http://www.crummy.com/software/BeautifulSoup/bs4/doc/#searching-the-tree


b) for inside_line in line:
  Does this process one token at a time? 

In any case, it looks like the reason you got "None" in the output is 
because you assume that every single line contains <code> and </code> tags.
This may not be case all the time, so, prior to printing extract_code
perhaps you could check whether that is None.

Back to comp.lang.python | Previous | Next | Find similar | Unroll thread


Thread

Re: How to extract contents of inner text of html tag? Jesse Adam <jaahush@gmail.com> - 2014-06-27 10:36 -0700

csiph-web