Path: csiph.com!usenet.pasdenom.info!aioe.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed6.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.020 X-Spam-Evidence: '*H*': 0.96; '*S*': 0.00; 'exception': 0.03; 'great.': 0.07; 'lawrence': 0.09; 'snippet': 0.09; 'to:addr:comp.lang.python': 0.09; 'cc:addr:python-list': 0.10; 'languages.': 0.15; 'expressions,': 0.16; 'mark,': 0.16; 'subject:based': 0.16; 'wrote:': 0.17; 'certainly': 0.17; 'issue.': 0.20; 'written': 0.20; 'wednesday,': 0.22; 'help.': 0.22; 'cc:2**0': 0.23; 'cc:no real name:2**0': 0.24; 'idea': 0.24; 'cc:addr:python.org': 0.25; 'header:In-Reply-To:1': 0.25; 'header :User-Agent:1': 0.26; 'regular': 0.27; 'code': 0.31; 'running': 0.32; 'traceback': 0.33; 'anyone': 0.33; 'likely': 0.33; 'received:google.com': 0.34; 'received:209.85': 0.35; 'but': 0.36; 'indian': 0.36; 'problems': 0.36; 'two': 0.37; 'received:209': 0.37; 'received:209.85.216': 0.37; 'subject:: ': 0.38; 'mark': 0.38; 'help': 0.40; 'group,': 0.60; 'from:no real name:2**0': 0.60; 'skip:u 10': 0.60; "you've": 0.61; 'provide': 0.62; 'email addr:gmail.com': 0.63; 'more': 0.63; 'learned': 0.65; 'dear': 0.66; 'kindly': 0.67; 'experiment': 0.84; 'us?': 0.91 Newsgroups: comp.lang.python Date: Tue, 18 Sep 2012 12:17:20 -0700 (PDT) In-Reply-To: Complaints-To: groups-abuse@google.com Injection-Info: glegroupsg2000goo.googlegroups.com; posting-host=122.161.222.32; posting-account=6SonuQoAAACzSakS5dCECcJQe6ylLrzY References: User-Agent: G2/1.0 X-Google-Web-Client: true X-Google-IP: 122.161.222.32 MIME-Version: 1.0 Subject: Re: HMM based Chunker NLTK From: subhabangalore@gmail.com To: comp.lang.python@googlegroups.com Content-Type: text/plain; charset=ISO-8859-1 Cc: python-list@python.org X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Message-ID: Lines: 44 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1347995848 news.xs4all.nl 6905 [2001:888:2000:d::a6]:53730 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:29447 On Wednesday, September 19, 2012 12:40:00 AM UTC+5:30, Mark Lawrence wrote: > On 18/09/2012 19:35, subhabangalore@gmail.com wrote: > > > Dear Group, > > > If anyone of the learned members can kindly help with a HMM/CRF based chunker on NLTK. > > > > > > Regards, > > > Subhabrata. > > > > > > > Certainly but how do you intend paying us? :) > > > > An alternative approach is to provide us with an idea of what you've > > researched, what code you've written and what problems you've got with > > it. If you get a code exception please provide the complete traceback > > and a mimimal code snippet that reproduces the issue. This way you're > > much more likely to get help. > > > > -- > > Cheers. > > > > Mark Lawrence. Two possibilities Mark, either nltk.chunk.(something) or nltk.tag.hmm etc. I am thinking to experiment Regular Expressions, Unigram/Bigram based chunkers are running good even with Indian languages. HMM tag is also going great. I am thinking on the issue. Regards, Subhabrata.