Groups | Search | Server Info | Keyboard shortcuts | Login | Register
Groups > cz.comp.lang.python > #3366
| Path | csiph.com!newsfeed.xs4all.nl!newsfeed7.news.xs4all.nl!news.uzoreto.com!newsreader4.netcologne.de!news.netcologne.de!peer02.ams1!peer.ams1.xlned.com!news.xlned.com!peer02.am4!peer.am4.highwinds-media.com!news.highwinds-media.com!fx02.am4.POSTED!not-for-mail |
|---|---|
| Received-SPF | none (Address does not pass the Sender Policy Framework) SPF=HELO; sender=mail-wm1-f54.google.com; remoteip=::ffff:209.85.128.54; remotehost=; helo=mail-wm1-f54.google.com; receiver=hugo.zagamma.cz; |
| Received-SPF | pass (Address passes the Sender Policy Framework) SPF=MAILFROM; sender=rainbof@gmail.com; remoteip=::ffff:209.85.128.54; remotehost=; helo=mail-wm1-f54.google.com; receiver=hugo.zagamma.cz; |
| DKIM-Signature | v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-transfer-encoding; bh=v9TPCcWT6oAHJpuH0CKAlya3Ue6pzGxY94F0Ny+0oZ4=; b=Rrg5rxIN4SbZ8ji84iL5dds8/BbpKmzleRmK52oDK5rFZvdo3wuIENFXU6gbCPGkfy vSYU5YKDvIFF7I2cpWkvKv3XxUWJtLZyjUV1XJWXQqsrerNIr9XSFzm2JUjpI6opZRlG JHCl7/pq+ne6bHFSUPAatNuNiE+xM+knyZrbYzEUm8ruwGDt7dN7T9FdNwmMTY4eHrAh gPeIcxGaBBY/T76iMM6PAWL9cEKz8yXN2mFivoPlLyu5UzT2TWFeZLNHlmybFtyYNyrm xZhR9OD9XDqMNS2BSrb9g+hDun1SL7JcUzeV1pt2QdSRm+Vjl7ZglesUjkBiZEE4q73P UMQw== |
| X-Google-DKIM-Signature | v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-transfer-encoding; bh=v9TPCcWT6oAHJpuH0CKAlya3Ue6pzGxY94F0Ny+0oZ4=; b=mLyvhogJKKnCwincfw5ivPpF0RAEqBXzTWF0UmEYFDxyJcPP1TukquRYdaRUjgv4NV UWOfpDdH3X6OsWKk8LRUAybA/HjtcFLaNwZh8dkUKGePPDUQoiBnPK/tF1iJwKl1Br8d ovD03/eZ2XFv24qQsD1w6RiLBhZ0hxhNfYFxxXc1fHWlOhF1vJY8cpRtyistcYLNU87U aDelf/n88rYaLzLv0HefVaARsUH4KH46C3wbhYyvX4Lv2KUbTaA+Y+HYiZbkKdzcRsct hHwC+ZsmevxdAstX0uQjw22a/uKnlvd1nNrXvyBQN4MvSp9wdPReNlpExX0vqn1vkp9Q UE0Q== |
| X-Gm-Message-State | APjAAAW8BTz2qCfhBm+J8kntUlP4iTF6zr5fRBzeo4P/aNOiQV1XBh8e LPckHoMtgEK4fHINtL8YXhrnURTL9+nbaGUfIOMQmQCAqrY= |
| X-Google-Smtp-Source | APXvYqxNciuls1gLdxZCNI2A9EHhXg+4CXbwY5IU5MMElkkIpBuNMBHiYqU5Fw1mrGnglLP52+gGQZaTpD8SHPX41FE= |
| X-Received | by 2002:a05:600c:1087:: with SMTP id e7mr2958232wmd.19.1565699881711; Tue, 13 Aug 2019 05:38:01 -0700 (PDT) |
| MIME-Version | 1.0 |
| In-Reply-To | <CA+7MNVoPRuTDH9hqO4aUaM114K2tLBWPZtdnXaVvB_go9j+g-A@mail.gmail.com> |
| From | Ondrej Beranek <rainbof@gmail.com> |
| Content-Type | text/plain; charset="UTF-8" |
| Content-Transfer-Encoding | quoted-printable |
| Subject | Re: [python] xml parsování |
| X-BeenThere | python@py.cz |
| X-Mailman-Version | 2.1.29 |
| Precedence | list |
| List-Id | Konference PyCZ <python.py.cz> |
| List-Unsubscribe | <http://www.py.cz/mailman/options/python>, <mailto:python-request@py.cz?subject=unsubscribe> |
| List-Archive | <http://www.py.cz/pipermail/python/> |
| List-Post | <mailto:python@py.cz> |
| List-Help | <mailto:python-request@py.cz?subject=help> |
| List-Subscribe | <http://www.py.cz/mailman/listinfo/python>, <mailto:python-request@py.cz?subject=subscribe> |
| Newsgroups | cz.comp.lang.python |
| Message-ID | <mailman.15.1565699884.5720.python@py.cz> (permalink) |
| X-Mailman-Original-Message-ID | <CAAEb_Q0yh+c7qDRswO86r6++EWJiNRgNVgQ=2d3=x5bbvhrHZg@mail.gmail.com> |
| X-Mailman-Original-References | <6ot.DzVH.657}E95Dld6.1TKbRh@seznam.cz> <CAK9Q5BT2AzSfg2tQomMiWstaQUqved1hH5Yw4dE6YXP5kSyhig@mail.gmail.com> <CAAEb_Q0jT26UfPA9vV_Jgf8=TcNxpzbSmzrs857NcQWtE2euyg@mail.gmail.com> <6B8BDAAB-4F03-404F-9288-972742AE194D@gmail.com> <CA+7MNVoPRuTDH9hqO4aUaM114K2tLBWPZtdnXaVvB_go9j+g-A@mail.gmail.com> |
| References | <6ot.DzVH.657}E95Dld6.1TKbRh@seznam.cz> <CAK9Q5BT2AzSfg2tQomMiWstaQUqved1hH5Yw4dE6YXP5kSyhig@mail.gmail.com> <CAAEb_Q0jT26UfPA9vV_Jgf8=TcNxpzbSmzrs857NcQWtE2euyg@mail.gmail.com> <6B8BDAAB-4F03-404F-9288-972742AE194D@gmail.com> <CA+7MNVoPRuTDH9hqO4aUaM114K2tLBWPZtdnXaVvB_go9j+g-A@mail.gmail.com> <CAAEb_Q0yh+c7qDRswO86r6++EWJiNRgNVgQ=2d3=x5bbvhrHZg@mail.gmail.com> |
| Lines | 114 |
| X-Complaints-To | abuse@usenet-news.net |
| NNTP-Posting-Date | Tue, 13 Aug 2019 12:38:04 UTC |
| Organization | usenet-news.net |
| Date | Tue, 13 Aug 2019 14:38:00 +0200 |
| X-Received-Bytes | 7786 |
| X-Received-Body-CRC | 3305238284 |
| Xref | csiph.com cz.comp.lang.python:3366 |
Show key headers only | View raw
Ahoj,
Trosku to rozvedu, existuje zarizeni co generuje XML po multicast
vysilani. problem je ze ho zapnes a ono jede a jede a jede co mas mas,
co ne jsi ztratil. Kdyz se k nemu pripojis tak ti uz chybi hlavicka a
chytis to v pulce neceho. tak zacnes chytat co mas a postupne skladas
strom. Muze se ale stat (a stava se) ze se tagy kapku pokrizi a misto
koncu vetvi /B /A se obevi /A /B s tim ze mezitim jeste litaji nejaka
data. V takovemto pripade o situaci chci vedet a sam si ji podle uz
nactene urovne osetrit. Cast dat jsou data predana takze si nejsem ani
jist tim ze se struktura nemuze pozmenit...
Aby toho nebylo malo nektere tagy pro jistotu nahodne nekonci vubec
ale da se dopocitat jejich zanoreni. A podobne kratochvile.
Jo a zarizeni nemuzu restartovat.
13.08.19, starenka . <starenka0@gmail.com>:
> tak spatny xml se obcas prihodi, kdyz to nejakej jouda generuje nejak
> obskurne, nebo se ti usekne stream nebo nevimcoeste.... lxml ucrite umi
> podoktorovat html, mozna i xml....
> ---
> In Perl you shoot yourself in the foot, but nobody can understand how you
> did it. Six months later, neither can you. | print 'aknerats'[::-1]
>
>
> On Tue, Aug 13, 2019 at 1:06 PM Petr Messner <petr.messner@gmail.com>
> wrote:
>
>> Jak jako vadný tag? Potom to není XML.
>>
>> Na HTML použij HTML parser.
>>
>> Petr Messner
>>
>> 13. 8. 2019 v 11:30, Ondrej Beranek <rainbof@gmail.com>:
>>
>> > Ahoj,
>> >
>> > když jste to tu tak nakousli, když bych chtěl zpracovávat xml streamy
>> > předem neznámého obsahu, co na to použít ? Potřebuju aby se to
>> > nesložilo když přijde nějaký vadný tag abych měl šanci si to ošetřit.
>> > Tak než začnu psát kolo...
>> >
>> > rainbof
>> >
>> > 13.08.19, Petr Messner <petr.messner@gmail.com>:
>> >> V těchto případech je asi nejednodušší xpath. Minimálně lxml ho umí.
>> >>
>> >> PM
>> >>
>> >> út 13. 8. 2019 v 8:22 odesílatel Ctibor.Plasek
>> >> <Ctibor.Plasek@seznam.cz
>> >
>> >> napsal:
>> >>
>> >>> Ahoj.
>> >>> Mám takovýto xml soubor (viz.přiložený soubor) - export z ffprobe.
>> >>> Pro ty hodnoty, které jsou u <stream používám toto:
>> >>>
>> >>> from xml.dom import minidom
>> >>> xmldoc = minidom.parse('RainS02E06.mkv.xml')
>> >>> streamlist = xmldoc.getElementsByTagName('stream')
>> >>> for s in streamlist:
>> >>> print(s.attributes['index'].value)
>> >>> print(s.attributes['codec_name'].value)
>> >>> try:
>> >>> print(s.attributes['sample_rate'].value)
>> >>> except:
>> >>> print('none')
>> >>>
>> >>> ale nevím jakým způsobem se dostat k těm "hlubším" hodnotám - např.
>> >>> <tag key="DURATION-eng" - jaká je např.hodnota value u stream
>> >>> index="1"
>> >>>
>> >>> Mohli byste mi, prosím, poradit?
>> >>>
>> >>> Děkuji Ctibor
>> >>>
>> >>>
>> >>>
>> >>>
>> >>> _______________________________________________
>> >>> Python mailing list
>> >>> python@py.cz
>> >>> http://www.py.cz/mailman/listinfo/python
>> >>>
>> >>> Visit: http://www.py.cz
>> >>>
>> >>
>> > _______________________________________________
>> > Python mailing list
>> > python@py.cz
>> > http://www.py.cz/mailman/listinfo/python
>> >
>> > Visit: http://www.py.cz
>> _______________________________________________
>> Python mailing list
>> python@py.cz
>> http://www.py.cz/mailman/listinfo/python
>>
>> Visit: http://www.py.cz
>>
>
Back to cz.comp.lang.python | Previous | Next | Find similar
Re: [python] xml parsování Ondrej Beranek <rainbof@gmail.com> - 2019-08-13 14:38 +0200
csiph-web