Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.lang.python > #92406 > unrolled thread
| Started by | random832@fastmail.us |
|---|---|
| First post | 2015-06-10 11:57 -0400 |
| Last post | 2015-06-10 11:57 -0400 |
| Articles | 1 — 1 participant |
Back to article view | Back to comp.lang.python
This discussion starts older than the indexed window; earlier articles aren't shown. The article labeled Started by
below is the oldest one visible, not the original post.
Re: Python NBSP DWIM random832@fastmail.us - 2015-06-10 11:57 -0400
| From | random832@fastmail.us |
|---|---|
| Date | 2015-06-10 11:57 -0400 |
| Subject | Re: Python NBSP DWIM |
| Message-ID | <mailman.348.1433951861.13271.python-list@python.org> |
On Wed, Jun 10, 2015, at 11:03, Laura Creighton wrote: > In these unicode days, this thinking may need to be revisited. There > are many languages where whitespace does not separate words -- either > words aren't separated, or in Vietnamese, spaces separate syllables, > so entire words have spaces in them. Text wrapping for CJK scripts is another topic that might be worth addressing in textwrap - words aren't space-separated, but there are still rules about where you can place a line break. Generally these are centered around preventing punctuation marks from being orphaned rather than any attempt to algorithmically find word boundaries. For the process called "Oikomi", while messing with kerning is not strictly possible for monospaced text, it might be worthwhile in general to have "preferred" and "maximum" line widths as parameters for textwrap. http://en.wikipedia.org/wiki/Line_breaking_rules_in_East_Asian_languages
Back to top | Article view | comp.lang.python
csiph-web