Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #64915

Re: Wikipedia XML Dump

References <9ec53bc0-f2da-46f4-ad58-2c9a75653dbf@googlegroups.com> <03d8894a-417c-4445-aeb5-f0b1003ca5eb@googlegroups.com>
Date 2014-01-28 12:15 -0600
Subject Re: Wikipedia XML Dump
From Skip Montanaro <skip@pobox.com>
Newsgroups comp.lang.python
Message-ID <mailman.6077.1390932918.18130.python-list@python.org> (permalink)

Show all headers | View raw


> Another point:
> sax is painful to use compared to full lxml (dom)
> But then sax is the only choice when files cross a certain size
> Thats why the above question

No matter what the choice of XML parser, I suspect you'll want to
convert it to some other form for processing.

Skip

Back to comp.lang.python | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

Wikipedia XML Dump kevingloveruk@gmail.com - 2014-01-28 03:45 -0800
  Re: Wikipedia XML Dump Rustom Mody <rustompmody@gmail.com> - 2014-01-28 09:11 -0800
    Re: Wikipedia XML Dump Skip Montanaro <skip@pobox.com> - 2014-01-28 12:15 -0600
  Re: Wikipedia XML Dump Kevin Glover <kevingloveruk@gmail.com> - 2014-01-28 14:31 -0800
    Re: Wikipedia XML Dump Burak Arslan <burak.arslan@arskom.com.tr> - 2014-01-29 00:47 +0200
      Re: Wikipedia XML Dump Rustom Mody <rustompmody@gmail.com> - 2014-01-28 17:52 -0800
  Re: Wikipedia XML Dump alex23 <wuwei23@gmail.com> - 2014-01-29 11:39 +1000

csiph-web