Path: csiph.com!x330-a1.tempe.blueboxinc.net!newsfeed.hal-mli.net!feeder3.hal-mli.net!newsfeed.hal-mli.net!feeder1.hal-mli.net!border3.nntp.dca.giganews.com!border1.nntp.dca.giganews.com!nntp.giganews.com!postnews.google.com!n6g2000vbg.googlegroups.com!not-for-mail From: kishjeff Newsgroups: comp.apps.spreadsheets Subject: Semantic data analysis Date: Fri, 25 Nov 2011 08:26:45 -0800 (PST) Organization: http://groups.google.com Lines: 13 Message-ID: <6ce323e2-7ec3-4b3f-aaca-ba3566626f30@n6g2000vbg.googlegroups.com> NNTP-Posting-Host: 174.252.244.206 Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 X-Trace: posting.google.com 1322238406 9935 127.0.0.1 (25 Nov 2011 16:26:46 GMT) X-Complaints-To: groups-abuse@google.com NNTP-Posting-Date: Fri, 25 Nov 2011 16:26:46 +0000 (UTC) Complaints-To: groups-abuse@google.com Injection-Info: n6g2000vbg.googlegroups.com; posting-host=174.252.244.206; posting-account=4dJwHgoAAADS2c9Em5MUxfcl6nnVmGSB User-Agent: G2/1.0 X-Google-Web-Client: true X-Google-Header-Order: HUARLECNK X-HTTP-UserAgent: Mozilla/5.0 (iPhone; CPU iPhone OS 5_0_1 like Mac OS X) AppleWebKit/534.46 (KHTML, like Gecko) Version/5.1 Mobile/9A405 Safari/7534.48.3,gzip(gfe) Xref: x330-a1.tempe.blueboxinc.net comp.apps.spreadsheets:6 Hi Can someone suggest something or a contact or place to look? Had a work problem I got personally interested in. Problem: given hundreds I excel sheets in minor variations of ten layouts, is it possible to use semantic analysis or some other method (machine learning?) to extract data? I think visually recognizing areas with certain types of groupings might be possible Thanks Jeff