Path: csiph.com!news.mixmin.net!weretis.net!feeder8.news.weretis.net!fu-berlin.de!bofh.it!news.nic.it!robomod From: Dmitry Shachnev Newsgroups: linux.debian.maint.python Subject: Re: URL mangling in https://pypi.debian.net/ Date: Wed, 18 Dec 2024 10:20:02 +0100 Message-ID: References: X-Original-To: Alexandre Detiste X-Mailbox-Line: From debian-python-request@lists.debian.org Wed Dec 18 09:19:20 2024 Old-Return-Path: X-Amavis-Spam-Status: No, score=-11.398 tagged_above=-10000 required=5.3 tests=[BAYES_00=-2, FOURLA=0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FORGED_FROMDOMAIN=0.001, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.249, LDO_WHITELIST=-5, PGPSIGNATURE=-5, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_VALIDITY_CERTIFIED_BLOCKED=0.001, RCVD_IN_VALIDITY_RPBL_BLOCKED=0.001] autolearn=ham autolearn_force=no X-Policyd-Weight: NOT_IN_SBL_XBL_SPAMHAUS=-1.5 CL_IP_EQ_HELO_IP=-2 (check from: .gmail. - helo: .mail-lf1-f42.google. - helo-domain: .google.) FROM/MX_MATCHES_HELO(DOMAIN)=-2; rate: -5.5 X-Gm-Message-State: AOJu0YzE3XkK/oS51u+hmkqCYmapL5/nr7XRxJGHTXrZCdzE0iEYZUbe Dhk+Lc7AxCgL7F5c0arIj+yT5lkxhMON5yFv1dgDsq20l40RjMLyrK4VMvrq X-Gm-Gg: ASbGncuOaZXQYFh2ScALV2BYqJsWSvk36VFIJuVEl/R96HW7RMWxxOulIrFYD/bmU6x xYy3u1E056g2VzCqa0f8PS1+hM/8qPwkDlLfrLtHG7nt6rrRcngcGsA1ukoH3Ip5CpTUDa76dLi O5ojgn1PptNHeOvHj57UnlQzdcopazYWsUBmc3dXQDNqZXQq3MN4gVa7ZBfPhvW32sThmND5Z11 rUDh2PAX010glG1dFWXzAVgBPvN+3ThAh51QCY5G1l9l7LOA8xkwsG+YXbWvip7Em9qc1cHjX98 WEgfHv0= X-Google-SMTP-Source: AGHT+IHfzlAnfUnbe6L0yeEUefyOHF+OI35NapWQAXOXPJ8ZGxYkHjL1c303Otb7FXhg0aX8kQFicw== X-Received: by 2002:a05:6512:2245:b0:53e:383c:dfa9 with SMTP id 2adb3069b0e04-541e674ccf9mr840757e87.30.1734513544660; Wed, 18 Dec 2024 01:19:04 -0800 (PST) MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha512; protocol="application/pgp-signature"; boundary="tUS/j4s4VKg2N5v6" Content-Disposition: inline X-Mailing-List: archive/latest/22679 List-ID: List-URL: List-Archive: https://lists.debian.org/msgid-search/Z2KTtHnBVM9oxTQN@mitya57.me Approved: robomod@news.nic.it Lines: 69 Organization: linux.* mail to news gateway Sender: robomod@news.nic.it X-Original-Cc: Debian Python X-Original-Date: Wed, 18 Dec 2024 12:19:48 +0300 X-Original-Message-ID: X-Original-References: Xref: csiph.com linux.debian.maint.python:16587 --tUS/j4s4VKg2N5v6 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Hi Alexandre! On Tue, Dec 17, 2024 at 11:57:18PM +0100, Alexandre Detiste wrote: > Hi, > > I've noticed a recent pattern with archives published on PyPi : > the "-" we expect in the regexp specified in d/watch is now an underscore. > > So the tracker got the false information that everything is up-to-date > > With some horribly wretched code I can find some projects with updates pe= nding. > https://paste.debian.net/1340327/ > > One field got duplicated in the output but I'm not running > the code again immediately because it can be considered abuse > by who run pypi.debian.net. > > Ideas ? I think pypi.debian.net does not mangle the file names in any way, it just takes them from upstream PyPI verbatim. And the change from - to _ is caused by more build tools adopting this specification [1], which says: =E2=80=9CIn distribution names, any run of -_. characters (HYPHEN-MINUS, LO= W LINE and FULL STOP) should be replaced with _ (LOW LINE), and uppercase characters should be replaced with corresponding lowercase ones.=E2=80=9D This link is for binary distributions, but there is a separate specification for source distributions [2] which says that rules are the same. [1]: https://packaging.python.org/en/latest/specifications/binary-distribut= ion-format/#escaping-and-unicode [2]: https://packaging.python.org/en/latest/specifications/source-distribut= ion-format/#source-distribution-file-name -- Dmitry Shachnev --tUS/j4s4VKg2N5v6 Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQIzBAABCgAdFiEEq2sdvrA0LydXHe1qsmYUtFL0RrYFAmdik7EACgkQsmYUtFL0 RrYs9BAAtf7Las8wm5glDcUS6w6OEhnP1Ei00M49hAFtuDcvcibLraW1yfMIa0io 9SDjnivZeR6ap53cfxTEnY/f74czteLZaS8xJcbfI+m6htF45OkkUic03h7zWdr5 ERHvWSAIgl00i6fSTATlxo//fbtutu3qOzxlGJ3Be0gxIDzgxmVL6vaa9RR9/zgR Bu16Am4nHqpQM7qKDBpHTs1D5Nr8bBoad7yW4LSnPb9JJP1GgnINLg6fzGES1GE7 beK5f5qqcxAu8MVcG16Z28X+AnM3GKpq4OslaIaUDG+6xvLZLcdGtmSLwZtHJhq1 A0U4/Sqoj3k7YZYfvmCcPeO5H86HIwggbFaWTVzl67qaY1Ih6CNDpk0ZgFp1khMH RXwP3BFiMFMn+1kiAF9XRkHQc7B1PdXhU8ENfe/nhW5mbx+0szVXTb8r7dEavxWp sZ1XsX6E72Anol+qE8KMDCOIm3nSUUY3dvjpI1XbSjK5g09FXYGHI/XDedsu4/Ng o9piMzDOuHZ02adpou7WricVYa9NMgDiRxEdkNdp/+ttCEuVMuWeDjkvCtM3sRk8 Rvvtq67tftYy4ZJ0BC6LEatO8yzebMkIQIzkDqSJm7V1RSq4DahwMi5Axw4GkbwF CpwogUpJZ+ixrZVRUCJv4rCdrstjUanfoYSnDFfacr3dEpJKcAA= =A57/ -----END PGP SIGNATURE----- --tUS/j4s4VKg2N5v6--