Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > linux.debian.maint.python > #17113

Bug#1118359: ITP: python-ftfy -- Fixes mojibake and other Unicode text problems

Path csiph.com!eternal-september.org!feeder.eternal-september.org!2.eu.feeder.erje.net!3.eu.feeder.erje.net!feeder.erje.net!news.roellig-ltd.de!open-news-network.org!weretis.net!feeder8.news.weretis.net!fu-berlin.de!bofh.it!news.nic.it!robomod
From Edward Betts <edward@4angle.com>
Newsgroups linux.debian.bugs.dist, linux.debian.devel, linux.debian.maint.python
Subject Bug#1118359: ITP: python-ftfy -- Fixes mojibake and other Unicode text problems
Date Sat, 18 Oct 2025 16:50:01 +0200
Message-ID <LHgpP-68B1-1@gated-at.bofh.it> (permalink)
X-Original-To Debian Bug Tracking System <submit@bugs.debian.org>
X-Mailbox-Line From debian-bugs-dist-request@lists.debian.org Sat Oct 18 14:47:09 2025
Old-Return-Path <debbugs@buxtehude.debian.org>
X-Spam-Flag NO
X-Spam-Score -2.399
Reply-To Edward Betts <edward@4angle.com>, 1118359@bugs.debian.org
Resent-To debian-bugs-dist@lists.debian.org
Resent-Cc debian-devel@lists.debian.org, debian-python@lists.debian.org, wnpp@debian.org
X-Debian-Pr-Message report 1118359
X-Debian-Pr-Package wnpp
MIME-Version 1.0
Content-Type text/plain; charset=us-ascii
Content-Disposition inline
X-Debian-Message from BTS
X-Mailing-List <debian-bugs-dist@lists.debian.org> archive/latest/1931896
List-ID <debian-bugs-dist.lists.debian.org>
List-URL <https://lists.debian.org/debian-bugs-dist/>
Approved robomod@news.nic.it
Lines 28
Organization linux.* mail to news gateway
Sender robomod@news.nic.it
X-Original-Date Sat, 18 Oct 2025 15:45:51 +0100
X-Original-Message-ID <nwrs4awrw4ae6ji55zucqovto2splgosr5ay25a53k652hcsht@s4pgkacyf2nz>
Xref csiph.com linux.debian.bugs.dist:1266559 linux.debian.devel:119294 linux.debian.maint.python:17113

Cross-posted to 3 groups.

Show key headers only | View raw


Package: wnpp
Severity: wishlist
Owner: Edward Betts <edward@4angle.com>
X-Debbugs-Cc: debian-devel@lists.debian.org, debian-python@lists.debian.org

* Package name    : python-ftfy
  Version         : 6.3.1
  Upstream Author : Robyn Speer <rspeer@arborelia.net>
* URL             : https://github.com/rspeer/python-ftfy
* License         : Apache-2.0
  Programming Lang: Python
  Description     : Fixes mojibake and other Unicode text problems

  This library automatically repairs text that has been corrupted by misapplied
  character encodings, such as mojibake or other encoding-related issues. It
  analyzes strings to identify and correct cases where characters were
  incorrectly decoded, reconstructing the intended Unicode text. This includes
  fixing multiple layers of encoding errors, handling curly quote characters,
  and decoding HTML entities that are outside of proper HTML contexts, even with
  unusual capitalization. The library is designed to avoid making unnecessary or
  incorrect changes to text that is already correctly encoded. It helps restore
  text readability in content that has been malformed through various data
  handling and transfer processes, such as those involving databases,
  spreadsheets, or outputs from web sources. It does not attempt to detect
  encodings from scratch, but rather focuses on repairing commonly-encountered
  forms of corrupted Unicode text.

I plan to maintain this package as part of the Python team.

Back to linux.debian.maint.python | Previous | Next | Find similar


Thread

Bug#1118359: ITP: python-ftfy -- Fixes mojibake and other Unicode text problems Edward Betts <edward@4angle.com> - 2025-10-18 16:50 +0200

csiph-web