Path: csiph.com!usenet.pasdenom.info!news.etla.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed3a.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.066 X-Spam-Evidence: '*H*': 0.87; '*S*': 0.00; 'that?': 0.05; 'cc:addr :python-list': 0.11; 'trying': 0.19; 'skip:f 30': 0.19; 'cc:addr:python.org': 0.22; 'header': 0.24; 'cc:2**0': 0.24; 'header:In-Reply-To:1': 0.27; 'message-id:@mail.gmail.com': 0.30; 'lines': 0.31; '(perhaps': 0.31; 'file': 0.32; 'subject:from': 0.34; 'but': 0.35; 'received:google.com': 0.35; 'should': 0.36; 'skip:o 20': 0.38; 'skip:[ 10': 0.38; 'how': 0.40; 'first': 0.61; 'information': 0.63; 'to:addr:gmail.com': 0.65 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=On9rRCikliMNy528StRiwR4dnxteTtCD+sJ6eDeUoRw=; b=vofeiVs95zteR/LhJfUuNkIswl0am5mtlRCqfiwSNsY4CT3GzW7S1HfJOU6nu3fEDX rJamo2jam0YXY7V7oOl72W+Bkw2ih7Nb3GZJs2MglCdxtpPmIfkQqsnwp/xbEO2qNG/c hDj+9FNiA7gY+wH722r4pVkF5E3hHb/cQN5kM8/dPzsFz7Ba6LwrVOKxyHLMxEow/f5x APOudpVUDTnuayo42qWMbZ0gyV6N9zjbcl5LChX0rnIYiJgrGbW8scy+yvdZty6TU1/u U/Z1Lrb6gxtMfezliILo0jXF7/rE4+sMzNwD1sqBLzANBmN2Lc8Wpms9Pz8WZoV43Xyy cCpg== MIME-Version: 1.0 X-Received: by 10.194.24.195 with SMTP id w3mr64060212wjf.135.1421721618378; Mon, 19 Jan 2015 18:40:18 -0800 (PST) In-Reply-To: References: Date: Mon, 19 Jan 2015 19:40:18 -0700 Subject: Re: Storing dataset from Condtional FreqDist From: Jason Friedman To: Jose Content-Type: text/plain; charset=UTF-8 Cc: python-list X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 23 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1421721619 news.xs4all.nl 2874 [2001:888:2000:d::a6]:42910 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:84051 > Hello i have trying to store information in arff file but i has been really. Any ideas of how can i do that? > > > with open('fileids3.txt', 'r') as f: > > genres=[word.strip() for word in f.next().split(',')] > > with open('adjectifs2.txt', 'r') as g: > adj = [word.strip() for word in g.next().split(',')] > > freq = nltk.ConditionalFreqDist( > (genre, m) > for genre in brown.fileids() > for m in brown.words(fileids=genre)) > > > freq.tabulate(conditions=genres, samples=adj) What do fileids3.txt and adjectifs2.txt contain (perhaps give us the first few lines of each)? What do you want your AARF file to look like (show us what the header should look like and then the first few lines of data)?