Groups | Search | Server Info | Login | Register


Groups > comp.lang.awk > #9801

Re: (Long post) Metaphone Algorithm In AWK

From Ben Bacarisse <ben@bsb.me.uk>
Newsgroups comp.lang.awk
Subject Re: (Long post) Metaphone Algorithm In AWK
Date 2024-08-21 00:58 +0100
Organization A noiseless patient Spider
Message-ID <87wmkapx0x.fsf@bsb.me.uk> (permalink)
References <v9qbgh$1u7qe$1@dont-email.me> <878qwts8bd.fsf@bsb.me.uk> <va1aht$3906i$1@dont-email.me>

Show all headers | View raw


porkchop@invalid.foo (Mike Sanders) writes:

> Ben Bacarisse <ben@bsb.me.uk> wrote:
>
>> Using a word list, I found some odd matches.  For example:
>> 
>> $ echo "drunkeness indigestion" | awk -f metaphone.awk -v find=texas
>> drunkeness
>> indigestion
>> 
>> Are these really metaphone matches for "texas"?  It's possible (I don't
>> know the algorithm at all well) but I found it surprising.
>
> Ben, give this try when you can. Finally starting to wrap my mind around
> its usage a little more...

I don't know what your are asking for as this (your latest AWK) is not
just an implementation of the metaphone algorithm.  With the extra
Levenshtein test it "texas" matches only a few words.

However, if I remove the extra condition (that levenshtein($x, find) <=
2) your AWK code matches a different set of words to the C
implementation.  Looking a bit deeper, your AWK code give the code TKSS
to the word "texas" but the C code assigns is "TKS".

-- 
Ben.

Back to comp.lang.awk | Previous | NextPrevious in thread | Next in thread | Find similar


Thread

(Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-17 14:18 +0000
  Re: (Long post) Metaphone Algorithm In AWK Ben Bacarisse <ben@bsb.me.uk> - 2024-08-19 00:46 +0100
    Re: (Long post) Metaphone Algorithm In AWK Ben Bacarisse <ben@bsb.me.uk> - 2024-08-19 02:15 +0100
    Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-19 03:22 +0000
      Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-19 04:34 +0000
    Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-20 05:45 +0000
      Re: (Long post) Metaphone Algorithm In AWK Ben Bacarisse <ben@bsb.me.uk> - 2024-08-21 00:58 +0100
        Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-21 01:07 +0000
          Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-21 02:50 +0000
          Re: (Long post) Metaphone Algorithm In AWK Ben Bacarisse <ben@bsb.me.uk> - 2024-08-21 09:15 +0100
            Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-21 19:13 +0000
  Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-20 11:33 +0000
  Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-21 02:42 +0000
    AWK language trivia (Was: (Long post) Metaphone Algorithm In AWK) gazelle@shell.xmission.com (Kenny McCormack) - 2024-08-21 03:13 +0000
      Re: AWK language trivia porkchop@invalid.foo (Mike Sanders) - 2024-08-21 05:32 +0000
  Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-21 19:03 +0000
  Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-23 06:13 +0000

csiph-web