Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.java.programmer > #7740

Re: contains

From supercalifragilisticexpialadiamaticonormalizeringelimatisticantations <supercalifragilisticexpialadiamaticonormalizeringelimatisticantations@averylongandannoyingdomainname.com>
Newsgroups comp.lang.java.programmer
Subject Re: contains
Date 2011-09-08 20:14 -0400
Organization supercalifragilisticexpialadiamaticonormalizeringelimatisticantations
Message-ID <j4blp3$ub9$1@speranza.aioe.org> (permalink)
References (1 earlier) <Zw1aq.8466$CQ4.1852@newsfe09.iad> <13987de0-042f-45e7-8279-25e9f7bcfb0e@glegroupsg2000goo.googlegroups.com> <3n4aq.6818$GV2.28@newsfe20.iad> <7bd53b6f-ed95-4f77-995e-a179f4f30ad0@glegroupsg2000goo.googlegroups.com> <0Q8aq.1219$tT1.1195@newsfe21.iad>

Show all headers | View raw


On 08/09/2011 3:37 PM, Arved Sandstrom wrote:
> I'm not exactly advising any programmer to deal with just ASCII; what I
> am saying here is that if you know that your text is ASCII text (*way*
> more common than you make out) that lowercasing and uppercasing in this
> particular situation is a potential approach. By ASCII text I still mean
> Unicode; simply the ASCII subset thereof.

Not only that -- if everything is passed through 
.toLowerCase().toUpperCase() then the input set of strings gets 
projected down onto a particular set of canonical representations. Some 
stuff will get conflated, but I think they amount only to alternative 
spellings of the same thing -- so finding matches among them does amount 
to there being substrings in common among the original inputs.

Back to comp.lang.java.programmer | Previous | NextPrevious in thread | Next in thread | Find similar


Thread

contains bob <bob@coolgroups.com> - 2011-09-08 00:44 -0700
  Re: contains Arved Sandstrom <asandstrom3minus1@eastlink.ca> - 2011-09-08 08:19 -0300
    Re: contains Lew <lewbloch@gmail.com> - 2011-09-08 06:37 -0700
      Re: contains Arved Sandstrom <asandstrom3minus1@eastlink.ca> - 2011-09-08 11:33 -0300
        Re: contains Lew <lewbloch@gmail.com> - 2011-09-08 07:47 -0700
          Re: contains Andreas Leitgeb <avl@gamma.logic.tuwien.ac.at> - 2011-09-08 18:59 +0000
            Re: contains Lew <lewbloch@gmail.com> - 2011-09-08 16:59 -0700
          Re: contains Arved Sandstrom <asandstrom3minus1@eastlink.ca> - 2011-09-08 16:37 -0300
            Re: contains supercalifragilisticexpialadiamaticonormalizeringelimatisticantations <supercalifragilisticexpialadiamaticonormalizeringelimatisticantations@averylongandannoyingdomainname.com> - 2011-09-08 20:14 -0400
  Re: contains Arne Vajhøj <arne@vajhoej.dk> - 2011-09-08 20:05 -0400

csiph-web