Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.lang.java.programmer > #10828
| Path | csiph.com!x330-a1.tempe.blueboxinc.net!usenet.pasdenom.info!aioe.org!eternal-september.org!feeder.eternal-september.org!mx04.eternal-september.org!.POSTED!not-for-mail |
|---|---|
| From | markspace <-@.> |
| Newsgroups | comp.lang.java.programmer |
| Subject | Re: Fast String search algorithm |
| Date | Sat, 17 Dec 2011 09:26:44 -0800 |
| Organization | A noiseless patient Spider |
| Lines | 25 |
| Message-ID | <jcijcm$lpg$1@dont-email.me> (permalink) |
| References | <i02ne7ptvvl8ofv9uracdu2jm3oq61v2h3@4ax.com> <alpine.DEB.2.00.1112161852030.13913@urchin.earth.li> |
| Mime-Version | 1.0 |
| Content-Type | text/plain; charset=ISO-8859-1; format=flowed |
| Content-Transfer-Encoding | 7bit |
| Injection-Date | Sat, 17 Dec 2011 17:26:46 +0000 (UTC) |
| Injection-Info | mx04.eternal-september.org; posting-host="XjIWM99mD7Ijfdu600oVPA"; logging-data="22320"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19qnEeW5ECzOgeuLzhjZB7UH/cseEHJZiA=" |
| User-Agent | Mozilla/5.0 (Windows NT 6.1; WOW64; rv:8.0) Gecko/20111105 Thunderbird/8.0 |
| In-Reply-To | <alpine.DEB.2.00.1112161852030.13913@urchin.earth.li> |
| Cancel-Lock | sha1:ppYBcKSEIqSU83LuRiygEHxRSHQ= |
| Xref | x330-a1.tempe.blueboxinc.net comp.lang.java.programmer:10828 |
Show key headers only | View raw
On 12/16/2011 10:58 AM, Tom Anderson wrote: > On Fri, 16 Dec 2011, Roedy Green wrote: > >> Let's say I had a million records each with a text field. I wanted to >> find all records that contained a given substring. Are there fast >> algorithms to do that or do you have to scan the whole thing linearly? > > http://en.wikipedia.org/wiki/Inverted_index > An implementation of an inverted index is often called a search engine. <http://en.wikipedia.org/wiki/Search_engine_%28computing%29> One of the best known implementations is a web search engine. Alt Vista, Yahoo, and Google are well known examples. There's a Java based search engine I happen to know about, Lucene. <http://lucene.apache.org/java/docs/index.html> This is a roll-your-own library, but it has a lot of features. Beyond a cursory reading of the documentation I can't really say much about it unfortunately.
Back to comp.lang.java.programmer | Previous | Next — Previous in thread | Next in thread | Find similar | Unroll thread
Fast String search algorithm Roedy Green <see_website@mindprod.com.invalid> - 2011-12-16 10:08 -0800
Re: Fast String search algorithm Tom Anderson <twic@urchin.earth.li> - 2011-12-16 18:58 +0000
Re: Fast String search algorithm markspace <-@.> - 2011-12-17 09:26 -0800
Re: Fast String search algorithm Daniel Pitts <newsgroup.nospam@virtualinfinity.net> - 2011-12-17 21:07 -0800
Re: Fast String search algorithm Tassilo Horn <tassilo@member.fsf.org> - 2011-12-16 20:07 +0100
Re: Fast String search algorithm Roedy Green <see_website@mindprod.com.invalid> - 2011-12-16 13:32 -0800
Re: Fast String search algorithm Gene Wirchenko <genew@ocis.net> - 2011-12-16 12:12 -0800
Re: Fast String search algorithm glen herrmannsfeldt <gah@ugcs.caltech.edu> - 2011-12-16 21:22 +0000
Re: Fast String search algorithm Arne Vajhøj <arne@vajhoej.dk> - 2011-12-16 22:20 -0500
Re: Fast String search algorithm Lew <lewbloch@gmail.com> - 2011-12-16 21:39 -0800
Re: Fast String search algorithm Wanja Gayk <brixomatic@yahoo.com> - 2011-12-17 16:15 +0100
Re: Fast String search algorithm Travers Naran <tnaran@gmail.com> - 2011-12-18 09:17 -0800
csiph-web