Groups | Search | Server Info | Login | Register
Groups > comp.lang.awk > #9801
| From | Ben Bacarisse <ben@bsb.me.uk> |
|---|---|
| Newsgroups | comp.lang.awk |
| Subject | Re: (Long post) Metaphone Algorithm In AWK |
| Date | 2024-08-21 00:58 +0100 |
| Organization | A noiseless patient Spider |
| Message-ID | <87wmkapx0x.fsf@bsb.me.uk> (permalink) |
| References | <v9qbgh$1u7qe$1@dont-email.me> <878qwts8bd.fsf@bsb.me.uk> <va1aht$3906i$1@dont-email.me> |
porkchop@invalid.foo (Mike Sanders) writes: > Ben Bacarisse <ben@bsb.me.uk> wrote: > >> Using a word list, I found some odd matches. For example: >> >> $ echo "drunkeness indigestion" | awk -f metaphone.awk -v find=texas >> drunkeness >> indigestion >> >> Are these really metaphone matches for "texas"? It's possible (I don't >> know the algorithm at all well) but I found it surprising. > > Ben, give this try when you can. Finally starting to wrap my mind around > its usage a little more... I don't know what your are asking for as this (your latest AWK) is not just an implementation of the metaphone algorithm. With the extra Levenshtein test it "texas" matches only a few words. However, if I remove the extra condition (that levenshtein($x, find) <= 2) your AWK code matches a different set of words to the C implementation. Looking a bit deeper, your AWK code give the code TKSS to the word "texas" but the C code assigns is "TKS". -- Ben.
Back to comp.lang.awk | Previous | Next — Previous in thread | Next in thread | Find similar
(Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-17 14:18 +0000
Re: (Long post) Metaphone Algorithm In AWK Ben Bacarisse <ben@bsb.me.uk> - 2024-08-19 00:46 +0100
Re: (Long post) Metaphone Algorithm In AWK Ben Bacarisse <ben@bsb.me.uk> - 2024-08-19 02:15 +0100
Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-19 03:22 +0000
Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-19 04:34 +0000
Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-20 05:45 +0000
Re: (Long post) Metaphone Algorithm In AWK Ben Bacarisse <ben@bsb.me.uk> - 2024-08-21 00:58 +0100
Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-21 01:07 +0000
Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-21 02:50 +0000
Re: (Long post) Metaphone Algorithm In AWK Ben Bacarisse <ben@bsb.me.uk> - 2024-08-21 09:15 +0100
Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-21 19:13 +0000
Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-20 11:33 +0000
Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-21 02:42 +0000
AWK language trivia (Was: (Long post) Metaphone Algorithm In AWK) gazelle@shell.xmission.com (Kenny McCormack) - 2024-08-21 03:13 +0000
Re: AWK language trivia porkchop@invalid.foo (Mike Sanders) - 2024-08-21 05:32 +0000
Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-21 19:03 +0000
Re: (Long post) Metaphone Algorithm In AWK porkchop@invalid.foo (Mike Sanders) - 2024-08-23 06:13 +0000
csiph-web