Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > linux.debian.maint.python > #16587

Re: URL mangling in https://pypi.debian.net/

Path csiph.com!news.mixmin.net!weretis.net!feeder8.news.weretis.net!fu-berlin.de!bofh.it!news.nic.it!robomod
From Dmitry Shachnev <mitya57@debian.org>
Newsgroups linux.debian.maint.python
Subject Re: URL mangling in https://pypi.debian.net/
Date Wed, 18 Dec 2024 10:20:02 +0100
Message-ID <JUYnM-sha-11@gated-at.bofh.it> (permalink)
References <JUOHM-jAq-7@gated-at.bofh.it>
X-Original-To Alexandre Detiste <alexandre.detiste@gmail.com>
X-Mailbox-Line From debian-python-request@lists.debian.org Wed Dec 18 09:19:20 2024
Old-Return-Path <mitya57@gmail.com>
X-Amavis-Spam-Status No, score=-11.398 tagged_above=-10000 required=5.3 tests=[BAYES_00=-2, FOURLA=0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FORGED_FROMDOMAIN=0.001, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.249, LDO_WHITELIST=-5, PGPSIGNATURE=-5, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_VALIDITY_CERTIFIED_BLOCKED=0.001, RCVD_IN_VALIDITY_RPBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
X-Policyd-Weight NOT_IN_SBL_XBL_SPAMHAUS=-1.5 CL_IP_EQ_HELO_IP=-2 (check from: .gmail. - helo: .mail-lf1-f42.google. - helo-domain: .google.) FROM/MX_MATCHES_HELO(DOMAIN)=-2; rate: -5.5
X-Gm-Message-State AOJu0YzE3XkK/oS51u+hmkqCYmapL5/nr7XRxJGHTXrZCdzE0iEYZUbe Dhk+Lc7AxCgL7F5c0arIj+yT5lkxhMON5yFv1dgDsq20l40RjMLyrK4VMvrq
X-Gm-Gg ASbGncuOaZXQYFh2ScALV2BYqJsWSvk36VFIJuVEl/R96HW7RMWxxOulIrFYD/bmU6x xYy3u1E056g2VzCqa0f8PS1+hM/8qPwkDlLfrLtHG7nt6rrRcngcGsA1ukoH3Ip5CpTUDa76dLi O5ojgn1PptNHeOvHj57UnlQzdcopazYWsUBmc3dXQDNqZXQq3MN4gVa7ZBfPhvW32sThmND5Z11 rUDh2PAX010glG1dFWXzAVgBPvN+3ThAh51QCY5G1l9l7LOA8xkwsG+YXbWvip7Em9qc1cHjX98 WEgfHv0=
X-Google-SMTP-Source AGHT+IHfzlAnfUnbe6L0yeEUefyOHF+OI35NapWQAXOXPJ8ZGxYkHjL1c303Otb7FXhg0aX8kQFicw==
X-Received by 2002:a05:6512:2245:b0:53e:383c:dfa9 with SMTP id 2adb3069b0e04-541e674ccf9mr840757e87.30.1734513544660; Wed, 18 Dec 2024 01:19:04 -0800 (PST)
MIME-Version 1.0
Content-Type multipart/signed; micalg=pgp-sha512; protocol="application/pgp-signature"; boundary="tUS/j4s4VKg2N5v6"
Content-Disposition inline
X-Mailing-List <debian-python@lists.debian.org> archive/latest/22679
List-ID <debian-python.lists.debian.org>
List-URL <https://lists.debian.org/debian-python/>
List-Archive https://lists.debian.org/msgid-search/Z2KTtHnBVM9oxTQN@mitya57.me
Approved robomod@news.nic.it
Lines 69
Organization linux.* mail to news gateway
Sender robomod@news.nic.it
X-Original-Cc Debian Python <debian-python@lists.debian.org>
X-Original-Date Wed, 18 Dec 2024 12:19:48 +0300
X-Original-Message-ID <Z2KTtHnBVM9oxTQN@mitya57.me>
X-Original-References <CADsTwj+SAGSUtggMXfPEybJsfpZ9MXabsPqNH+oQUc0ewtdP-Q@mail.gmail.com>
Xref csiph.com linux.debian.maint.python:16587

Show key headers only | View raw


[Multipart message β€” attachments visible in raw view] - view raw

Hi Alexandre!

On Tue, Dec 17, 2024 at 11:57:18PM +0100, Alexandre Detiste wrote:
> Hi,
>
> I've noticed a recent pattern with archives published on PyPi :
> the "-" we expect in the regexp specified in d/watch is now an underscore.
>
> So the tracker got the false information that everything is up-to-date
>
> With some horribly wretched code I can find some projects with updates pending.
>   https://paste.debian.net/1340327/
>
> One field got duplicated in the output but I'm not running
> the code again immediately because it can be considered abuse
> by who run pypi.debian.net.
>
> Ideas ?

I think pypi.debian.net does not mangle the file names in any way, it just
takes them from upstream PyPI verbatim.

And the change from - to _ is caused by more build tools adopting this
specification [1], which says:

β€œIn distribution names, any run of -_. characters (HYPHEN-MINUS, LOW LINE and
FULL STOP) should be replaced with _ (LOW LINE), and uppercase characters
should be replaced with corresponding lowercase ones.”

This link is for binary distributions, but there is a separate specification
for source distributions [2] which says that rules are the same.

[1]: https://packaging.python.org/en/latest/specifications/binary-distribution-format/#escaping-and-unicode
[2]: https://packaging.python.org/en/latest/specifications/source-distribution-format/#source-distribution-file-name

--
Dmitry Shachnev

Back to linux.debian.maint.python | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

URL mangling in https://pypi.debian.net/ Alexandre Detiste <alexandre.detiste@gmail.com> - 2024-12-18 00:00 +0100
  Re: URL mangling in https://pypi.debian.net/ Dmitry Shachnev <mitya57@debian.org> - 2024-12-18 10:20 +0100
    Re: URL mangling in https://pypi.debian.net/ Alexandre Detiste <alexandre.detiste@gmail.com> - 2024-12-18 19:00 +0100

csiph-web