Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #92406 > unrolled thread

Re: Python NBSP DWIM

Started byrandom832@fastmail.us
First post2015-06-10 11:57 -0400
Last post2015-06-10 11:57 -0400
Articles 1 — 1 participant

Back to article view | Back to comp.lang.python

This discussion starts older than the indexed window; earlier articles aren't shown. The article labeled Started by below is the oldest one visible, not the original post.


Contents

  Re: Python NBSP DWIM random832@fastmail.us - 2015-06-10 11:57 -0400

#92406 — Re: Python NBSP DWIM

Fromrandom832@fastmail.us
Date2015-06-10 11:57 -0400
SubjectRe: Python NBSP DWIM
Message-ID<mailman.348.1433951861.13271.python-list@python.org>
On Wed, Jun 10, 2015, at 11:03, Laura Creighton wrote:
> In these unicode days, this thinking may need to be revisited.  There
> are many languages where whitespace does not separate words -- either
> words aren't separated, or in Vietnamese, spaces separate syllables,
> so entire words have spaces in them.

Text wrapping for CJK scripts is another topic that might be worth
addressing in textwrap - words aren't space-separated, but there are
still rules about where you can place a line break. Generally these are
centered around preventing punctuation marks from being orphaned rather
than any attempt to algorithmically find word boundaries.

For the process called "Oikomi", while messing with kerning is not
strictly possible for monospaced text, it might be worthwhile in general
to have "preferred" and "maximum" line widths as parameters for
textwrap.

http://en.wikipedia.org/wiki/Line_breaking_rules_in_East_Asian_languages

[toc] | [standalone]


Back to top | Article view | comp.lang.python


csiph-web