Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #60471

Re: parsing nested unbounded XML fields with ElementTree

Path csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!news.mixmin.net!feeds.phibee-telecom.net!newsfeed.xs4all.nl!newsfeed1.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
Return-Path <larry.martell@gmail.com>
X-Original-To python-list@python.org
Delivered-To python-list@mail.python.org
X-Spam-Status OK 0.075
X-Spam-Evidence '*H*': 0.85; '*S*': 0.00; 'xml,': 0.05; 'string': 0.09; 'subject:fields': 0.09; 'subject:parsing': 0.09; 'cc:addr :python-list': 0.11; '&gt;&gt;': 0.16; 'archives,': 0.16; 'maliciously': 0.16; 'nesting': 0.16; 'preserve': 0.16; 'quoted': 0.16; 'spacing': 0.16; 'subject:XML': 0.16; 'tags.': 0.16; 'email addr:python.org,': 0.16; 'wrote:': 0.18; 'all,': 0.19; 'later': 0.20; 'not,': 0.20; 'email addr:gmail.com&gt;': 0.22; 'cc:addr:python.org': 0.22; 'parse': 0.24; 'initial': 0.24; 'mon,': 0.24; 'people,': 0.24; 'cc:2**0': 0.24; 'cc:no real name:2**0': 0.24; "i've": 0.25; '&gt;': 0.26; 'posts': 0.26; 'this:': 0.26; 'post': 0.26; 'defined': 0.27; 'header:In-Reply- To:1': 0.27; 'point': 0.28; 'specifically': 0.29; 'fixed': 0.29; 'chris': 0.29; 'am,': 0.29; 'generally': 0.29; 'thus': 0.29; 'xml': 0.29; "doesn't": 0.30; 'message-id:@mail.gmail.com': 0.30; "i'm": 0.30; 'went': 0.31; 'software,': 0.31; 'too.': 0.31; '(since': 0.31; '25,': 0.31; 'larry': 0.31; 'file': 0.32; 'skip:m 30': 0.32; 'quite': 0.32; 'text': 0.33; 'monday,': 0.33; 'noticed': 0.34; 'subject:with': 0.35; 'case,': 0.35; 'but': 0.35; 'received:google.com': 0.35; 'google': 0.35; 'there': 0.35; 'really': 0.36; 'replies': 0.36; "didn't": 0.36; 'thanks': 0.36; 'list': 0.37; 'being': 0.38; 'skip:& 10': 0.38; 'nov': 0.38; 'pm,': 0.38; 'skip:& 20': 0.39; 'sure': 0.39; 'mailing': 0.39; 'users': 0.40; 'even': 0.60; 'read': 0.60; 'easy': 0.60; 'problems.': 0.60; 'information,': 0.61; 'first': 0.61; "you'll": 0.62; 'show': 0.63; 'term': 0.63; 'more': 0.64; 'email name :python-list': 0.65; 'to:addr:gmail.com': 0.65; 'news': 0.67; '26,': 0.68; 'subject': 0.69; 'groups.': 0.74; 'saw': 0.77; '9:45': 0.84; 'deja': 0.84; 'off,': 0.84; 'replied.': 0.84; 'unaware': 0.84; 'carries': 0.91; 'good,': 0.91; 'on?': 0.91; 'fight': 0.97; '2013': 0.98
DKIM-Signature v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=9C93fKVmLncctqHV0U9Qlu+Jp5u4vpRA0+0H8Th/U/4=; b=gD6hqcPIDDiFZMiKqoLKqUu0rVG6k8HEFiGXyAw+iI2tQen/Zg+3Ge2YdN7TfxEGID 4eWaozzcLSVZVq8hvavgg+gKyxPmEnf6zTQ8ji7nJZxdHRVu2IAI3XxLloUaWmW2aIHt y/vTSIsPhGH8fWK8GH/QPzwz0YqXCRwE6Xcm8NbCJ1oSu4wESTkwFnzZxLOd0pWabxKN 3a/g7staO8o1Uu2dBlQjgoMN0D9dQKXoPgn3d/t5lz3Iqao4Z4bOXCNQ4J7ZZQ2zyxXp rPCr2jW0A5ReiUA3pahZi1M5mQ7wxja/exAFDp3e4diNjGmtVQ0m0zPEUhFEwjpfFl9k mjNw==
MIME-Version 1.0
X-Received by 10.180.185.130 with SMTP id fc2mr15450485wic.43.1385421955122; Mon, 25 Nov 2013 15:25:55 -0800 (PST)
In-Reply-To <CAPTjJmp4Tb9UjeK=t7YzX3Rc72KV0oRSk86ZdVmkkfpc07uybg@mail.gmail.com>
References <CACwCsY7iXJ_1OB2yzMTrP0i60skoBk9smwYuDd6LVpMO+ozbGg@mail.gmail.com> <CAPTjJmp4Tb9UjeK=t7YzX3Rc72KV0oRSk86ZdVmkkfpc07uybg@mail.gmail.com>
Date Mon, 25 Nov 2013 18:25:55 -0500
Subject Re: parsing nested unbounded XML fields with ElementTree
From Larry Martell <larry.martell@gmail.com>
To Chris Angelico <rosuav@gmail.com>
Content-Type multipart/alternative; boundary=001a11c350da7fcc2204ec08af2f
Cc python-list@python.org
X-BeenThere python-list@python.org
X-Mailman-Version 2.1.15
Precedence list
List-Id General discussion list for the Python programming language <python-list.python.org>
List-Unsubscribe <https://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe>
List-Archive <http://mail.python.org/pipermail/python-list/>
List-Post <mailto:python-list@python.org>
List-Help <mailto:python-list-request@python.org?subject=help>
List-Subscribe <https://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe>
Newsgroups comp.lang.python
Message-ID <mailman.3208.1385423842.18130.python-list@python.org> (permalink)
Lines 128
NNTP-Posting-Host 2001:888:2000:d::a6
X-Trace 1385423842 news.xs4all.nl 15949 [2001:888:2000:d::a6]:35548
X-Complaints-To abuse@xs4all.nl
Xref csiph.com comp.lang.python:60471

Show key headers only | View raw


[Multipart message — attachments visible in raw view] - view raw

On Mon, Nov 25, 2013 at 6:19 PM, Chris Angelico <rosuav@gmail.com> wrote:

> On Tue, Nov 26, 2013 at 9:45 AM, Larry Martell <larry.martell@gmail.com>
> wrote:
> > On Monday, November 25, 2013 5:30:44 PM UTC-5, Chris Angelico wrote:
> >
> >> First off, please clarify: Are there five corresponding </Node> tags
> >> later on? If not, it's not XML, and nesting will have to be defined
> >> some other way.
> >
> > Yes, there are corresponding </Node> tags. I just didn't show them.
>
> Good good, I just saw the "unbounded" in your subject line and got
> worried :) I'm pretty sure there's a way to parse that will preserve
> the current nesting information, but others can describe that better
> than I can.
>

The term 'unbounded' is used in the XML xsd file like this:

<xs:sequence maxOccurs="unbounded">


> >> Secondly, please get off Google Groups. Your initial post is
> >> malformed, and unless you specifically fight the software, your
> >> replies will be even more malformed, to the point of being quite
> >> annoying. There are many other ways to read a newsgroup, or you can
> >> subscribe to the mailing list python-list@python.org, which carries
> >> the same content.
> >
> > Not sure what you mean by malformed. I don't really care for Google
> Groups,
> > but I've been using it to post to this any other groups for years (since
> rn
> > and deja news went away) and no one ever said my posts were malformed. In
> > any case, I did not know the group was available as a ML. I've subbed to
> > that and will post that way.
>
> The mailing list works well for me too. Google Groups is deceptively
> easy for a lot of people, but if you look through the list's archives,
> you'll see that the posts it makes are unwrapped (and thus string out
> to the right an arbitrary length), and all quoted text is
> double-spaced, among other problems. Its users are generally unaware
> of this, and like you are not maliciously inflicting that on us all,
> but that doesn't make it any less painful to read :) Thanks for
> switching.
>
>
I had noticed the double spacing and I always fixed that when I replied.

Back to comp.lang.python | Previous | NextNext in thread | Find similar | Unroll thread


Thread

Re: parsing nested unbounded XML fields with ElementTree Larry Martell <larry.martell@gmail.com> - 2013-11-25 18:25 -0500
  Curing google groups issues (was parsing nested unbounded XML…) rusi <rustompmody@gmail.com> - 2013-11-28 05:31 -0800

csiph-web