Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.lang.python > #100009
| Path | csiph.com!news.swapon.de!fu-berlin.de!uni-berlin.de!not-for-mail |
|---|---|
| From | Jason Friedman <jsf80238@gmail.com> |
| Newsgroups | comp.lang.python |
| Subject | Re: Exclude text within quotation marks and words beginning with a capital letter |
| Date | Fri, 4 Dec 2015 10:38:43 -0700 |
| Lines | 12 |
| Message-ID | <mailman.202.1449250726.14615.python-list@python.org> (permalink) |
| References | <a1da3bd6-5643-46b4-bcc5-69659367d491@googlegroups.com> |
| Mime-Version | 1.0 |
| Content-Type | text/plain; charset=UTF-8 |
| X-Trace | news.uni-berlin.de r/bZbV/9otXMyZiKLiWTNgGmrfJSpgnO6Bo4A5qlkvCw== |
| Return-Path | <jsf80238@gmail.com> |
| X-Original-To | python-list@python.org |
| Delivered-To | python-list@mail.python.org |
| X-Spam-Status | OK 0.028 |
| X-Spam-Evidence | '*H*': 0.94; '*S*': 0.00; 'subject:text': 0.04; 'cc:addr:python-list': 0.09; 'python': 0.10; '2.7': 0.13; 'subject: \n ': 0.15; 'received:io': 0.16; 'received:psf.io': 0.16; 'thanks.': 0.18; 'language': 0.19; 'cc:2**0': 0.20; 'cc:addr:python.org': 0.20; 'pos': 0.22; 'import': 0.24; 'words': 0.24; 'written': 0.24; 'header:In-Reply-To:1': 0.24; 'compatible': 0.27; 'coding': 0.27; 'message-id:@mail.gmail.com': 0.27; 'exclude': 0.29; 'identifies': 0.29; '(including': 0.30; 'advice': 0.35; 'received:google.com': 0.35; 'text': 0.35; 'text.': 0.35; 'received:74.125.82': 0.35; 'but': 0.36; 'url:org': 0.36; 'beginning': 0.36; 'subject:: ': 0.37; 'wanted': 0.37; 'sure': 0.39; 'subject:with': 0.40; 'within': 0.64; 'natural': 0.67; 'upper': 0.76; 'subject:letter': 0.84; 'to:none': 0.91; 'quotation': 0.93 |
| DKIM-Signature | v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:cc :content-type; bh=Ng1jP8YtR+gixRM3Bc0iMxWr91880/59ytznmdnrWRE=; b=ZyUpgw8Ieb/7pekVTIwA7qN5EvlXldq2HtLyY/EUZHLPPmTQVz7bRJ/5Kan25P6Db6 secgvu2sw95HGkF5rmhYESmBrq8tVEn5k75QHys4gQ8PrjpJo7OFR6xLXRZLoim1x8hU lRh47BB+SEBMfrVYuDTf5eroaT52/XxPPajcGnWbc4kW3naQv2Wgd8LDWOWN00Q4Ob2B 9JfQd3xGkmhueZ4rGWKAG6VjkNpsmEEVOfdly2QXhl44r8d9qJsA68/fdhn17hVn2Uw/ JJJ5AQy6w1VuDlvxLEpqIgTrj9LAc49790uMxFUyOsI0osED5/RkwLb3lmBuqHyJdIfw 2GBQ== |
| X-Received | by 10.194.250.39 with SMTP id yz7mr20501208wjc.92.1449250723865; Fri, 04 Dec 2015 09:38:43 -0800 (PST) |
| In-Reply-To | <a1da3bd6-5643-46b4-bcc5-69659367d491@googlegroups.com> |
| X-Content-Filtered-By | Mailman/MimeDel 2.1.20+ |
| X-BeenThere | python-list@python.org |
| X-Mailman-Version | 2.1.20+ |
| Precedence | list |
| List-Id | General discussion list for the Python programming language <python-list.python.org> |
| List-Unsubscribe | <https://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe> |
| List-Archive | <http://mail.python.org/pipermail/python-list/> |
| List-Post | <mailto:python-list@python.org> |
| List-Help | <mailto:python-list-request@python.org?subject=help> |
| List-Subscribe | <https://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe> |
| Xref | csiph.com comp.lang.python:100009 |
Show key headers only | View raw
> > I am working on a program that is written in Python 2.7 to be compatible > with the POS tagger that I import from Pattern. The tagger identifies all > the nouns in a text. I need to exclude from the tagger any text that is > within quotation marks, and also any word that begins with an upper case > letter (including words at the beginning of sentences). > > Any advice on coding that would be gratefully received. Thanks. > Perhaps overkill, but wanted to make sure you knew about the Natural Language Toolkit: http://www.nltk.org/.
Back to comp.lang.python | Previous | Next — Previous in thread | Find similar | Unroll thread
Exclude text within quotation marks and words beginning with a capital letter Kevin Glover <kevingloveruk@gmail.com> - 2015-12-01 03:17 -0800 Re: Exclude text within quotation marks and words beginning with a capital letter Peter Otten <__peter__@web.de> - 2015-12-01 13:43 +0100 Re: Exclude text within quotation marks and words beginning with a capital letter Jason Friedman <jsf80238@gmail.com> - 2015-12-04 10:38 -0700
csiph-web