Groups > comp.lang.c > #77357 > unrolled thread

Working efficiently with 32-bit Unicode output streams, locale etc.

Started by	"Morten W. Petersen" <morphex@gmail.com>
First post	2015-11-29 01:06 +0100
Last post	2015-12-02 09:58 -0800
Articles	10 on this page of 210 — 25 participants

Back to article view | Back to comp.lang.c

  Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-29 01:06 +0100
    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Nobody <nobody@nowhere.invalid> - 2015-11-29 02:01 +0000
      Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-29 03:31 +0100
        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-11-29 00:09 -0600
        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Robert Wessel <robertwessel2@yahoo.com> - 2015-11-29 00:22 -0600
        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Richard Damon <Richard@Damon-Family.org> - 2015-11-29 14:31 -0500
        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Nobody <nobody@nowhere.invalid> - 2015-11-29 23:51 +0000
          Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-30 01:21 +0100
            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Keith Thompson <kst-u@mib.org> - 2015-11-30 00:41 -0800
            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-11-30 03:16 -0600
      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Jorgen Grahn <grahn+nntp@snipabacken.se> - 2015-11-29 08:28 +0000
      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-11-29 02:54 -0600
    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Ian Collins <ian-news@hotmail.com> - 2015-11-29 16:30 +1300
      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-11-28 23:53 -0800
        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-11-29 02:23 -0600
          Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-11-29 00:30 -0800
            Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-30 01:33 +0100
              Re: Working efficiently with 32-bit Unicode output streams, locale   etc. Ian Collins <ian-news@hotmail.com> - 2015-11-30 13:54 +1300
                Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-30 02:03 +0100
                  Re: Working efficiently with 32-bit Unicode output streams, locale   etc. Ian Collins <ian-news@hotmail.com> - 2015-11-30 14:15 +1300
                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-30 02:34 +0100
                      Re: Working efficiently with 32-bit Unicode output streams, locale   etc. Ian Collins <ian-news@hotmail.com> - 2015-11-30 14:42 +1300
                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-30 04:16 +0100
                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-11-29 20:20 -0600
                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-30 04:34 +0100
                          Re: Working efficiently with 32-bit Unicode output streams, locale   etc. Ian Collins <ian-news@hotmail.com> - 2015-11-30 17:09 +1300
                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-30 06:17 +0100
                              Re: Working efficiently with 32-bit Unicode output streams, locale   etc. Ian Collins <ian-news@hotmail.com> - 2015-11-30 19:44 +1300
                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-11-29 23:36 -0600
                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-30 07:39 +0100
                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-11-30 13:56 -0600
                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-12-01 09:17 +0100
                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-02 13:40 -0600
                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-12-04 00:34 +0100
                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-03 16:03 -0800
                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-11-29 23:07 -0800
                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-30 08:20 +0100
                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-11-29 23:40 -0800
                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-30 08:48 +0100
                              Re: Working efficiently with 32-bit Unicode output streams, locale   etc. Ian Collins <ian-news@hotmail.com> - 2015-11-30 20:52 +1300
                                Re: Working efficiently with 32-bit Unicode output streams, locale     etc. Ian Collins <ian-news@hotmail.com> - 2015-11-30 21:04 +1300
                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-11-30 00:34 -0800
                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-11-30 03:50 -0600
                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-11-30 12:16 +0000
                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-11-30 06:11 -0800
                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-11-30 13:23 -0600
                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-11-30 13:18 -0600
                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Keith Thompson <kst-u@mib.org> - 2015-11-30 13:23 -0800
                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-11-30 22:32 +0000
                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Keith Thompson <kst-u@mib.org> - 2015-11-30 15:10 -0800
                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-11-30 21:05 -0600
                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-01 12:38 +0000
                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Ben Bacarisse <ben.usenet@bsb.me.uk> - 2015-12-01 14:43 +0000
                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-01 12:09 -0800
                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Ian Collins <ian-news@hotmail.com> - 2015-12-02 09:14 +1300
                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-01 12:27 -0800
                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Ian Collins <ian-news@hotmail.com> - 2015-12-02 10:14 +1300
                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-01 18:01 -0600
                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-01 20:41 +0000
                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Keith Thompson <kst-u@mib.org> - 2015-12-01 12:53 -0800
                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-01 21:32 +0000
                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Keith Thompson <kst-u@mib.org> - 2015-12-01 13:55 -0800
                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. raltbos@xs4all.nl (Richard Bos) - 2015-12-04 10:30 +0000
                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-01 18:46 -0600
                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. Say, what? <<nothing@nowhere.nohow>> - 2015-12-01 14:07 -0800
                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-01 23:54 +0000
                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. Say, what? <<nothing@nowhere.nohow>> - 2015-12-01 17:13 -0800
                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Martin Shobe <martin.shobe@yahoo.com> - 2015-12-01 09:08 -0600
                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-01 20:02 +0000
                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Martin Shobe <martin.shobe@yahoo.com> - 2015-12-01 17:03 -0600
                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-02 00:17 +0000
                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-01 16:53 -0800
                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Martin Shobe <martin.shobe@yahoo.com> - 2015-12-01 21:17 -0600
                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-02 09:37 -0600
                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. James Kuyper <jameskuyper@verizon.net> - 2015-12-02 10:59 -0500
                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-02 17:43 +0000
                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-02 13:22 -0600
                                                Re: Working efficiently with 32-bit Unicode output streams, locale   etc. Ian Collins <ian-news@hotmail.com> - 2015-12-03 09:32 +1300
                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-02 21:12 +0000
                                                    Re: Working efficiently with 32-bit Unicode output streams, locale   etc. Ian Collins <ian-news@hotmail.com> - 2015-12-03 10:36 +1300
                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-02 22:00 +0000
                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-02 17:55 -0600
                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-02 17:04 -0800
                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-03 01:11 +0000
                                                            Re: Working efficiently with 32-bit Unicode output streams, locale   etc. Ian Collins <ian-news@hotmail.com> - 2015-12-03 14:19 +1300
                                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-02 23:16 -0600
                                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-03 00:54 -0600
                                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-03 04:07 -0800
                                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. glen herrmannsfeldt <gah@ugcs.caltech.edu> - 2015-12-03 18:31 +0000
                                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. Eric Sosman <esosman@comcast-dot-net.invalid> - 2015-12-03 13:59 -0500
                                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. glen herrmannsfeldt <gah@ugcs.caltech.edu> - 2015-12-03 19:45 +0000
                                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-03 14:38 -0800
                                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. glen herrmannsfeldt <gah@ugcs.caltech.edu> - 2015-12-03 22:43 +0000
                                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-03 12:14 +0000
                                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Richard Heathfield <rjh@cpax.org.uk> - 2015-12-03 12:38 +0000
                                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-03 13:19 +0000
                                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-03 05:54 -0800
                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. raltbos@xs4all.nl (Richard Bos) - 2015-12-04 10:50 +0000
                                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Richard Heathfield <rjh@cpax.org.uk> - 2015-12-03 14:26 +0000
                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-03 09:19 -0600
                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. David Brown <david.brown@hesbynett.no> - 2015-12-03 16:25 +0100
                                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Richard Heathfield <rjh@cpax.org.uk> - 2015-12-03 15:33 +0000
                                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. David Brown <david.brown@hesbynett.no> - 2015-12-03 16:47 +0100
                                                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Richard Heathfield <rjh@cpax.org.uk> - 2015-12-03 16:54 +0000
                                                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-03 09:32 -0800
                                                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. David Brown <david.brown@hesbynett.no> - 2015-12-03 18:53 +0100
                                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Steve Thompson <stevet810@gmail.com> - 2015-12-03 19:00 +0000
                                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. David Brown <david.brown@hesbynett.no> - 2015-12-04 14:07 +0100
                                                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Steve Thompson <stevet810@gmail.com> - 2015-12-04 18:41 +0000
                                                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. David Brown <david.brown@hesbynett.no> - 2015-12-05 16:09 +0100
                                                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Steve Thompson <stevet810@gmail.com> - 2015-12-05 21:15 +0000
                                                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. David Brown <david.brown@hesbynett.no> - 2015-12-06 12:35 +0100
                                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Keith Thompson <kst-u@mib.org> - 2015-12-03 09:02 -0800
                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-03 19:12 +0000
                                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-03 16:58 -0600
                                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. David Brown <david.brown@hesbynett.no> - 2015-12-03 15:47 +0100
                                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Richard Heathfield <rjh@cpax.org.uk> - 2015-12-03 14:51 +0000
                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. David Brown <david.brown@hesbynett.no> - 2015-12-03 16:50 +0100
                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. raltbos@xs4all.nl (Richard Bos) - 2015-12-04 10:55 +0000
                                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-03 08:56 -0600
                                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-03 05:24 -0800
                                                                Re: Working efficiently with 32-bit Unicode output streams, locale   etc. Ian Collins <ian-news@hotmail.com> - 2015-12-04 08:49 +1300
                                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-03 07:07 -0800
                                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-03 10:27 -0600
                                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-03 09:01 -0800
                                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. fir <profesor.fir@gmail.com> - 2015-12-03 10:16 -0800
                                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-12-04 01:21 +0100
                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-03 16:42 -0800
                                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. David Brown <david.brown@hesbynett.no> - 2015-12-04 11:15 +0100
                                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-12-08 01:57 +0100
                                                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. David Brown <david.brown@hesbynett.no> - 2015-12-08 09:08 +0100
                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-04 09:44 -0600
                                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Richard Heathfield <rjh@cpax.org.uk> - 2015-12-04 15:58 +0000
                                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-04 11:43 -0600
                                                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Geoff <geoff@invalid.invalid> - 2015-12-04 10:56 -0800
                                                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. Keith Thompson <kst-u@mib.org> - 2015-12-04 11:20 -0800
                                                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-04 15:24 -0600
                                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-04 09:30 -0600
                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Richard Heathfield <rjh@cpax.org.uk> - 2015-12-04 15:52 +0000
                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-04 09:07 -0800
                                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-04 09:53 -0800
                                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-04 10:56 -0800
                                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-04 15:04 -0600
                                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. Ben Bacarisse <ben.usenet@bsb.me.uk> - 2015-12-04 21:32 +0000
                                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-04 13:38 -0800
                                                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-04 16:13 -0600
                                                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-04 16:21 -0800
                                                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-04 19:10 -0600
                                                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. Geoff <geoff@invalid.invalid> - 2015-12-04 19:16 -0800
                                                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-04 21:19 -0800
                                                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-05 12:44 -0600
                                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-06 09:01 -0800
                                                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-06 12:34 -0600
                                                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-06 18:32 -0800
                                                                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-07 10:43 -0600
                                                                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-07 10:02 -0800
                                                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-05 03:53 -0800
                                                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-05 09:39 -0800
                                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. glen herrmannsfeldt <gah@ugcs.caltech.edu> - 2015-12-05 18:36 +0000
                                                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-05 12:26 -0600
                                                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-05 11:36 -0800
                                                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Udyant Wig <udyantw@gmail.com> - 2015-12-06 16:42 +0530
                                                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-06 03:59 -0800
                                                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. Robert Wessel <robertwessel2@yahoo.com> - 2015-12-07 02:17 -0600
                                                                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. supercat@casperkitty.com - 2015-12-07 07:33 -0800
                                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. fir <profesor.fir@gmail.com> - 2015-12-03 03:57 -0800
                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-12-04 00:58 +0100
                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Ben Bacarisse <ben.usenet@bsb.me.uk> - 2015-12-03 01:34 +0000
                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-03 11:38 +0000
                                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Ben Bacarisse <ben.usenet@bsb.me.uk> - 2015-12-03 14:09 +0000
                                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-03 10:10 -0600
                                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-03 08:28 -0800
                                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Ben Bacarisse <ben.usenet@bsb.me.uk> - 2015-12-03 21:33 +0000
                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Richard Heathfield <rjh@cpax.org.uk> - 2015-12-02 21:47 +0000
                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-02 16:05 -0600
                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Keith Thompson <kst-u@mib.org> - 2015-12-02 14:12 -0800
                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-02 22:47 +0000
                                                        Re: Working efficiently with 32-bit Unicode output streams, locale   etc. Ian Collins <ian-news@hotmail.com> - 2015-12-03 14:00 +1300
                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-03 01:38 -0600
                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-03 02:20 -0800
                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. raltbos@xs4all.nl (Richard Bos) - 2015-12-04 10:40 +0000
                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Nobody <nobody@nowhere.invalid> - 2015-12-03 02:42 +0000
                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Richard Damon <Richard@Damon-Family.org> - 2015-12-01 20:48 -0500
                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-02 12:08 +0000
                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-02 04:21 -0800
                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-02 14:05 +0000
                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-12-04 01:31 +0100
                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. Ben Bacarisse <ben.usenet@bsb.me.uk> - 2015-12-02 14:23 +0000
                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-02 08:00 -0800
                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. Ben Bacarisse <ben.usenet@bsb.me.uk> - 2015-12-02 16:49 +0000
                                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-02 11:50 -0800
                                                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Ben Bacarisse <ben.usenet@bsb.me.uk> - 2015-12-02 20:02 +0000
                                                        Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-02 12:31 -0800
                                                          Re: Working efficiently with 32-bit Unicode output streams, locale etc. Ben Bacarisse <ben.usenet@bsb.me.uk> - 2015-12-03 01:43 +0000
                                                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. Keith Thompson <kst-u@mib.org> - 2015-12-02 09:21 -0800
                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Richard Damon <Richard@Damon-Family.org> - 2015-12-02 07:29 -0500
                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-12-02 05:47 -0800
                                                Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-02 11:03 -0600
                                              Re: Working efficiently with 32-bit Unicode output streams, locale etc. BartC <bc@freeuk.com> - 2015-12-02 14:16 +0000
                                                Re: Working efficiently with 32-bit Unicode output streams, locale   etc. Ian Collins <ian-news@hotmail.com> - 2015-12-03 09:56 +1300
                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-02 13:49 -0600
                                            Re: Working efficiently with 32-bit Unicode output streams, locale etc. Philip Lantz <prl@canterey.us> - 2015-12-02 22:11 -0800
                                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-12-02 15:06 -0600
                      Re: Working efficiently with 32-bit Unicode output streams, locale etc. Jorgen Grahn <grahn+nntp@snipabacken.se> - 2015-11-30 22:14 +0000
              Re: Working efficiently with 32-bit Unicode output streams, locale etc. Stephen Sprunk <stephen@sprunk.org> - 2015-11-29 23:03 -0600
                Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-30 06:26 +0100
                  Re: Working efficiently with 32-bit Unicode output streams, locale etc. Keith Thompson <kst-u@mib.org> - 2015-11-30 00:39 -0800
                    Re: Working efficiently with 32-bit Unicode output streams, locale etc. Malcolm McLean <malcolm.mclean5@btinternet.com> - 2015-11-30 01:57 -0800
        Re: Working efficiently with 32-bit Unicode output streams, locale etc. "Morten W. Petersen" <morphex@gmail.com> - 2015-11-29 15:32 +0100
    Re: Working efficiently with 32-bit Unicode output streams, locale etc. fir <profesor.fir@gmail.com> - 2015-12-02 09:58 -0800

Page 11 of 11 — ← Prev page 1 … 9 10 [11]

#77654

From	Stephen Sprunk <stephen@sprunk.org>
Date	2015-12-02 13:49 -0600
Message-ID	<n3nhrt$oot$1@dont-email.me>
In reply to	#77606

On 02-Dec-15 06:08, BartC wrote:
> On 02/12/2015 01:48, Richard Damon wrote:
>> On 12/1/15 3:02 PM, BartC wrote:
>>> 
>>> This is the problem I have with people saying that UTF8 can be be
>>> used transparently.
>> 
>> There is one simple rule that a program needs to follow for UTF-8
>> to be transparent, the program must only break the string at the
>> boundaries of known characters (either from known characters (like
>> space or new line) or boundaries of known valid strings.
>> 
>> UTF-8 is specifically designed that no code point is a sub-string
>> of another code point.
>> 
>> Programs that assume that a string can be broken at arbitrary
>> points, can be broken by UTF-8.
> 
> I think 99% of the programs I've ever written have needed at some
> point to deal with strings character by character.

Then you are working in a very unusual problem domain.  I assume I've
worked on code that _did_ need to do that at some point, but I can't
recall any particular instances.

> Actually just a few days ago I had to take a bunch of separate
> strings, truncate each one to fit a certain pixel-width, and combine
> them into one string separated by tabs and 'reverse-tabs' which
> corresponded to a particular pixel-specified table of tab stops.

That sounds like part of a text rendering engine, which is one of the
areas that _obviously_ has to handle such things--but it also has to
deal with other sorts of Unicode oddities, e.g. combining characters and
non-characters, and encoding is the _least_ of such problems.

> If I can't use 8-bit characters for that then I'd need to switch to
> 16 or 32-bit ones. Then I could use the same code, as only data types
> will have changed.

Nope; see above.

> Trying to do it directly on UTF8 would mean rewriting pretty much 
> everything.

No, you'd just need to adjust it so that it didn't split a leading byte
from its trailing byte(s).  You'd need to do the same with UTF-16
surrogate pairs.  Only UTF-32 is exempt from that problem.

> BTW I notice nobody has given me an example of how to rewrite that 
> little example of mine to properly display the separate characters of
> a string (show each one on a separate line together with its hex and 
> decimal code).
> 
> I guess it's not so easy after all...

Or we weren't interested because the solution is so obvious.  I'll take
a stab at it later today since you brought it up again.

S

-- 
Stephen Sprunk         "God does not play dice."  --Albert Einstein
CCIE #3723         "God is an inveterate gambler, and He throws the
K5SSS        dice at every possible opportunity." --Stephen Hawking

[toc] | [prev] | [next] | [standalone]

#77707

From	Philip Lantz <prl@canterey.us>
Date	2015-12-02 22:11 -0800
Message-ID	<MPG.30c97cfa3620410abd@news.eternal-september.org>
In reply to	#77606

BartC wrote:
> Richard Damon wrote:
> > BartC wrote:
> >> This is the problem I have with people saying that UTF8 can be be used
> >> transparently.
> >
> > There is one simple rule that a program needs to follow for UTF-8 to be
> > transparent, the program must only break the string at the boundaries of
> > known characters (either from known characters (like space or new line)
> > or boundaries of known valid strings.
> >
> > UTF-8 is specifically designed that no code point is a sub-string of
> > another code point.
> >
> > Programs that assume that a string can be broken at arbitrary points,
> > can be broken by UTF-8.
> 
> I think 99% of the programs I've ever written have needed at some point 
> to deal with strings character by character.
> 
> Actually just a few days ago I had to take a bunch of separate strings, 
> truncate each one to fit a certain pixel-width, and combine them into 
> one string separated by tabs and 'reverse-tabs' which corresponded to a 
> particular pixel-specified table of tab stops.

In programs I've worked on that did string handling, I need to look
for specific characters. Say for example I was reading an ini file
with key=value lines and maybe sections demarcated by [section]. Then
I need to look for brackets, equals, newlines, possibly quotation marks,
possibly digits, if I need to convert values to numbers. But the section
names, key names, values (if they are uninterpreted strings) could all
be UTF-8 and my parsing code wouldn't care *at all*. The exact same code
that works for ASCII will just work. That is, I think, what people mean
when they say that UTF-8 can be used transparently.

Philip

[toc] | [prev] | [next] | [standalone]

#77664

From	Stephen Sprunk <stephen@sprunk.org>
Date	2015-12-02 15:06 -0600
Message-ID	<n3nmbk$cee$1@dont-email.me>
In reply to	#77502

On 01-Dec-15 06:38, BartC wrote:
> On 01/12/2015 03:05, Stephen Sprunk wrote:
>> On 30-Nov-15 16:32, BartC wrote:
>>> Meanwhile I still occasionally come across problems with the 
>>> representation of £ or €; maybe they should fix those first
>>> before we worry about ancient scripts or rare Chinese ideograms.
>> 
>> If you're getting mojibake or replacement characters, that is
>> usually due to folks using some ancient encoding rather than
>> something modern and sensible, e.g. UTF-8.
> 
> This is a typical problem I would get (source code was UTF8):

Untested, so there's probably stupid typos, but here's a rough idea:

size_t str8len(char * const s) {
  int len=0;
  while (*s) len+=(*s++&0xC0)==0x80;
  return len;
}

size_t chr8len(char * const s) {
  if ((*s&0xC0)==0x00) return 1;
  if ((*s&0xE0)==0xC0) return 2;
  if ((*s&0xF0)==0xE0) return 3;
  if ((*s&0xF0)==0xF0) return 4;
  return 0;
}

int main(void) {
  char s[] = "£100 = €140";
  char *p;
  int i,j;

  printf("%s\n",s);
  printf("code units = %zu, code points = %zu\n", strlen(s), str8len(s));

  for (i=0, p=s; *p; p+=chr8len(p), i++)
    switch (chr8len(p)) {
      case 1:
        printf("%2d: %03d %02X <%c>\n", i, p[0], p[0], p[0]); break;
      case 2:
        printf("%2d: %03d %03d %02X %02X <%c%c>\n", i, p[0], p[1], p[0],
p[1], p[0], p[1]); break;
      case 3:
        printf("%2d: %03d %03d %03d %02X %02X %02X <%c%c%c>\n", i, p[0],
p[1], p[2], p[0], p[1], p[2], p[0], p[1], p[2]); break;
      case 4:
        printf("%2d: %03d %03d %03d %03d %02X %02X %02X %02X
<%c%c%c%c>\n", i, p[0], p[1], p[2], p[3], p[0], p[1], p[2], p[3], p[0],
p[1], p[2], p[3]); break;
    }
}

The loop is rather ugly, but that's a result of your chosen format,
which was obviously designed to make UTF-8 look bad.

> This is basic stuff. And I'm doing serial access on the string, not
> random.

You're still _splitting_ the string, which is the root problem.

> So much for the majority of programs being able to work unchanged
> with UTF8!

If you treat them as opaque blobs or even concatenate them, there are no
changes required.  You can even search a string for code points, though
you have to use strstr() rather than strchr() for a single code point
above U+007F, same as multiple ASCII characters.

> I'd need to start going into the multi-byte and wide char stuff. On
> Windows, that means UCS2 or UTF16 or whatever it is now, which
> apparently isn't good enough either.

UTF-16 takes the worst of UTF-8 and UTF-32 and the best of neither.

UCS-2 isn't too bad, but it can't handle non-BMP characters.

S

-- 
Stephen Sprunk         "God does not play dice."  --Albert Einstein
CCIE #3723         "God is an inveterate gambler, and He throws the
K5SSS        dice at every possible opportunity." --Stephen Hawking

[toc] | [prev] | [next] | [standalone]

#77485

From	Jorgen Grahn <grahn+nntp@snipabacken.se>
Date	2015-11-30 22:14 +0000
Message-ID	<slrnn5piht.5q5.grahn+nntp@frailea.sa.invalid>
In reply to	#77424

On Mon, 2015-11-30, Morten W. Petersen wrote:
> On 30.11.2015 02:15, Ian Collins wrote:
...
>> Have you ever encountered a UTF-32 (or UTF-16) encoded XML document?  I
>> can't imagine why anyone would want to create one given the lack of
>> applications that can read, let alone parse, it.  UTF-8 is universally
>> popular (off Windows) because being a super-set of ASCII, just about
>> anything can display it.
>
> Well, let's say you have some organization that wants to create an
> archive of lots of non-latin history, in XML.
>
> For them, choosing XML is right, and UTF-8 uses 3 bytes on characters 
> U+0800 through U+FFFF, but only 2 bytes in UTF-16.
>
> However, UTF-16 is vulnerable to the entire string being corrupted
> after invalid data has been encountered.
>
> So this organization chooses to use UTF-32, because the unnecessary byte
> there also acts as a delimiter.
>
> This is plausible.

Having read http://www.thedailywtf.com/, anything is plausible.
But not everything that is plausible ought to be encouraged.

/Jorgen

-- 
  // Jorgen Grahn <grahn@  Oo  o.   .     .
\X/     snipabacken.se>   O  o   .

[toc] | [prev] | [next] | [standalone]

#77431

From	Stephen Sprunk <stephen@sprunk.org>
Date	2015-11-29 23:03 -0600
Message-ID	<n3gl7e$f5c$1@dont-email.me>
In reply to	#77419

On 29-Nov-15 18:33, Morten W. Petersen wrote:
> On 29.11.2015 09:30, Malcolm McLean wrote:
>> You haven't been following the ng over the a last few weeks. Morten
>> is developing an XML parser. This is the first we've heard about
>> the XML writer (as is typical, a writer is a lot easier to develop
>> than a reader).
> 
> I really don't want to do more than I have to, what I want to do is 
> create a library/program that will enable reading, writing and 
> manipulating XML files.  And do it securely, correctly, and fast.

If it can't read or write UTF-8 files like every other XML library, then
your users probably won't consider it "correct".

> Everyone is on about UTF-8 it seems, and that's the world as it is 
> today..

And tomorrow.

> UTF-16 is sort of the middle way which requires some tricks to
> represent all characters

UTF-16 is not the "middle way"; it is an ugly hack so that a handful of
(admittedly popular) systems designed around UCS-2 could continue to
work after Unicode extended past the BMP.  Except they don't work, due
to all the code that still _thinks_ it's dealing with UCS-2.

It shouldn't be used in _new_ systems, and it should be ripped out of
existing systems as quickly as feasible.

> while UTF-32 is what it is, but requires more storage.

Which is part of why _nobody_ uses it for wire/file formats.  It's a
decent representation in memory because you can't accidentally split a
code point, but it perpetuates the misconception that a code point is a
"character" in the senses that users care about.

> There has been a long discussion about UTF-x on this newsgroup 
> earlier, and that discussion shows the reasoning behind everything.

Apparently, it didn't sink in.

> There might be UTF-8 and UTF-16 reading/writing support tacked on 
> later in the development process,

Don't bother releasing an XML library until it supports UTF-8, because
that's what _everyone_ else uses.  There is little point in supporting
UTF-16, and absolutely none in supporting UTF-32.

> but for example bzipping an XML file I assume would produce files
> of roughly equal size if it is in UTF-8, 16 or 32, ...

Yes, but that's no reason to _start_ with files that're 2-4 times as
large as they need to be.

S

-- 
Stephen Sprunk         "God does not play dice."  --Albert Einstein
CCIE #3723         "God is an inveterate gambler, and He throws the
K5SSS        dice at every possible opportunity." --Stephen Hawking

[toc] | [prev] | [next] | [standalone]

#77433

From	"Morten W. Petersen" <morphex@gmail.com>
Date	2015-11-30 06:26 +0100
Message-ID	<n3gmli$l4l$1@speranza.aioe.org>
In reply to	#77431

On 30.11.2015 06:03, Stephen Sprunk wrote:
> On 29-Nov-15 18:33, Morten W. Petersen wrote:
[...]
>> UTF-16 is sort of the middle way which requires some tricks to
>> represent all characters
>
> UTF-16 is not the "middle way"; it is an ugly hack so that a handful of
> (admittedly popular) systems designed around UCS-2 could continue to
> work after Unicode extended past the BMP.  Except they don't work, due
> to all the code that still _thinks_ it's dealing with UCS-2.
>
> It shouldn't be used in _new_ systems, and it should be ripped out of
> existing systems as quickly as feasible.

OK.

>> while UTF-32 is what it is, but requires more storage.
>
> Which is part of why _nobody_ uses it for wire/file formats.  It's a
> decent representation in memory because you can't accidentally split a
> code point, but it perpetuates the misconception that a code point is a
> "character" in the senses that users care about.
>
>> There has been a long discussion about UTF-x on this newsgroup
>> earlier, and that discussion shows the reasoning behind everything.
>
> Apparently, it didn't sink in.

Maybe that thread didn't cover everything, but I've made up my mind;
if you have specific points that I based my decision on that are wrong
we could discuss that.

>> There might be UTF-8 and UTF-16 reading/writing support tacked on
>> later in the development process,
>
> Don't bother releasing an XML library until it supports UTF-8, because
> that's what _everyone_ else uses.  There is little point in supporting
> UTF-16, and absolutely none in supporting UTF-32.

OK, that's your opinion.  But I think it's strange that you're trying to
tell me what I should and should not do, that's up to me to decide,
based on what I think is important.

-Morten

[toc] | [prev] | [next] | [standalone]

#77448

From	Keith Thompson <kst-u@mib.org>
Date	2015-11-30 00:39 -0800
Message-ID	<lny4dfx4nn.fsf@kst-u.example.com>
In reply to	#77433

"Morten W. Petersen" <morphex@gmail.com> writes:
> On 30.11.2015 06:03, Stephen Sprunk wrote:
>> On 29-Nov-15 18:33, Morten W. Petersen wrote:
[...]
>>> There might be UTF-8 and UTF-16 reading/writing support tacked on
>>> later in the development process,
>>
>> Don't bother releasing an XML library until it supports UTF-8, because
>> that's what _everyone_ else uses.  There is little point in supporting
>> UTF-16, and absolutely none in supporting UTF-32.
>
> OK, that's your opinion.  But I think it's strange that you're trying to
> tell me what I should and should not do, that's up to me to decide,
> based on what I think is important.

What's strange about it?  You're discussing it here in a public
form; of course you're going to get advice.  And of course you're
free to ignore it.

If I understand you correctly, you plan to release an XML library
that, at least in its initial version, writes its output in UTF-32
(with what endianness?  With or without a BOM?)  and is not able
to write UTF-16 or UTF-8.  I seriously doubt that anyone else would
find such a thing useful; I certainly wouldn't.

-- 
Keith Thompson (The_Other_Keith) kst-u@mib.org  <http://www.ghoti.net/~kst>
Working, but not speaking, for JetHead Development, Inc.
"We must do something.  This is something.  Therefore, we must do this."
    -- Antony Jay and Jonathan Lynn, "Yes Minister"

[toc] | [prev] | [next] | [standalone]

#77453

From	Malcolm McLean <malcolm.mclean5@btinternet.com>
Date	2015-11-30 01:57 -0800
Message-ID	<d33b2fad-d337-4a0d-8431-4539a458592c@googlegroups.com>
In reply to	#77448

On Monday, November 30, 2015 at 8:39:15 AM UTC, Keith Thompson wrote:
> "Morten W. Petersen" <morphex@gmail.com> writes:
> 
> If I understand you correctly, you plan to release an XML library
> that, at least in its initial version, writes its output in UTF-32
> (with what endianness?  With or without a BOM?)  and is not able
> to write UTF-16 or UTF-8.  I seriously doubt that anyone else would
> find such a thing useful; I certainly wouldn't.
> 
The odd thing is that it's probably more useful that way.

There are a lot of XML parsers out there. I put one on the web myself. This one is
also a writer,and whilst generating XML is a lot easier than reading it, I don't
doubt there are also lots of writers.
But a parser / writer that is focused on UTF-32? I think that's pretty hard to
find. But there are probably some people out there who use UTF-32. So
there's a niche use for it.

[toc] | [prev] | [next] | [standalone]

#77394

From	"Morten W. Petersen" <morphex@gmail.com>
Date	2015-11-29 15:32 +0100
Message-ID	<n3f28f$cd7$1@speranza.aioe.org>
In reply to	#77371

On 29.11.2015 08:53, Malcolm McLean wrote:
> On Sunday, November 29, 2015 at 3:30:55 AM UTC, Ian Collins wrote:
>> Morten W. Petersen wrote:
>>> Hi there.
>>>
>>> By now I should be a little bit known for working on my XML library
>>> Smash XML :), here
>>>
>>> https://github.com/morphex/smash_xml
>>>
>>> Now I'm at the point where I'm writing some output (write XML files)
>>> functionality..  I want to output in an UTF32-LE encoding, which means
>>> each "character" takes up 32 bits or 4 regular chars.
>>
>> Why do that when just about everyone else uses UTF-8?  I don't think
>> I've seen an XML document that uses anything else.  Even MS Office XML
>> uses UTF-8.
>>
> Since UTF-32 is allowed, a general purpose reader must support it.
> Since it is never used, such a reader can only be tested if you first
> produce a UTF-32 writer.
>
> (I think the only easy way is to dump the UTF-32 as binary then open it
> in a sophisticated word-processor).

The UTF-8 vs 32 discussion has been had and a decision has been made.

As for writing as binary yes that was what I was thinking, and using an
advanced word-processor that is what I've been doing so far (jEdit
FYI).

-Morten

[toc] | [prev] | [next] | [standalone]

#77645

From	fir <profesor.fir@gmail.com>
Date	2015-12-02 09:58 -0800
Message-ID	<30dc4da4-37bc-4b96-8794-aac21e148362@googlegroups.com>
In reply to	#77357

ps i didnt noticed this thread, (nor readed it) writing near here about 'unicode fail', (only some simple coincidence here)

[toc] | [prev] | [standalone]

Page 11 of 11 — ← Prev page 1 … 9 10 [11]

csiph-web

Working efficiently with 32-bit Unicode output streams, locale etc.

Contents

#77654

#77707

#77664

#77485

#77431

#77433

#77448

#77453

#77394

#77645