Path: csiph.com!usenet.pasdenom.info!news.etla.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed3a.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
MIME-Version: 1.0
In-Reply-To: <bd1c0f0d-febf-482c-a53d-a980d2f091cd@googlegroups.com>
References: <bd1c0f0d-febf-482c-a53d-a980d2f091cd@googlegroups.com>
Date: Mon, 19 Jan 2015 19:40:18 -0700
Subject: Re: Storing dataset from Condtional FreqDist
From: Jason Friedman <jsf80238@gmail.com>
To: Jose <josemlv83@gmail.com>
Content-Type: text/plain; charset=UTF-8
Cc: python-list <python-list@python.org>
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.17878.1421721619.18130.python-list@python.org>
Lines: 23
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:84051

> Hello i have trying to store information in arff file but i has been really. Any ideas of how can i do that?
>
>
> with open('fileids3.txt', 'r') as f:
>
>         genres=[word.strip() for word in f.next().split(',')]
>
> with open('adjectifs2.txt', 'r') as g:
>         adj = [word.strip() for word in g.next().split(',')]
>
> freq = nltk.ConditionalFreqDist(
>      (genre, m)
>       for genre in brown.fileids()
>       for m in brown.words(fileids=genre))
>
>
> freq.tabulate(conditions=genres, samples=adj)

What do fileids3.txt and adjectifs2.txt contain (perhaps give us the
first few lines of each)?

What do you want your AARF file to look like (show us what the header
should look like and then the first few lines of data)?