From: Ted Flethuseo Newsgroups: comp.lang.ruby Subject: Reading XML to relational tables Date: Fri, 1 Apr 2011 17:47:22 -0500 Organization: Service de news de lacave.net Lines: 46 Message-ID: <236f02bc699a9c809bdec25e00a2d0f2@ruby-forum.com> NNTP-Posting-Host: bristol.highgroove.com Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Trace: talisker.lacave.net 1301698078 79027 65.111.164.187 (1 Apr 2011 22:47:58 GMT) X-Complaints-To: abuse@lacave.net NNTP-Posting-Date: Fri, 1 Apr 2011 22:47:58 +0000 (UTC) X-Received-From: This message has been automatically forwarded from the ruby-talk mailing list by a gateway at comp.lang.ruby. If it is SPAM, it did not originate at comp.lang.ruby. Please report the original sender, and not us. Thanks! For more details about this gateway, please visit: http://blog.grayproductions.net/categories/the_gateway X-Mail-Count: 380782 X-Ml-Name: ruby-talk X-Rubymirror: Yes X-Ruby-Talk: <236f02bc699a9c809bdec25e00a2d0f2@ruby-forum.com> Path: csiph.com!x330-a1.tempe.blueboxinc.net!usenet.pasdenom.info!news.stben.net!talisker.lacave.net!lacave.net!not-for-mail Xref: x330-a1.tempe.blueboxinc.net comp.lang.ruby:2137 Hi everyone, I need to build 3 relational tables from an xml text. In this tables, I need to keep track of words that have the and tags in them along with the word mentioned and its count in the

tag. This is easier to illustrate with an example: I need to take this text:

My name is Ted, and I like coffee. Ted does not like tea.

I have a brother who likes tea but does not like coffee

To 3 normalized tables like this: ..p_table... p_id desc 1 My name is.... 2 I have a .... ..p_to_emph_table... p_id e_id count 1 2 1 2 1 1 2 2 1 ..emph_table... e_id emph_word 1 Tea 2 Coffee I am not sure what would be the best approach to parse this xml with ruby or what tool could help me do this efficiently? Any ideas appreciated, Ted. -- Posted via http://www.ruby-forum.com/.