Path: csiph.com!usenet.pasdenom.info!aioe.org!news.stack.nl!newsfeed.xs4all.nl!newsfeed3a.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
Date: Wed, 5 Feb 2014 18:46:04 -0600
From: Tim Chase <python.list@tim.thechases.com>
To: python-list@python.org
Subject: Re: parse a csv file into a text file
In-Reply-To: <5c268845-003f-4e24-b27a-c89e9fbfcc6c@googlegroups.com>
References: <5c268845-003f-4e24-b27a-c89e9fbfcc6c@googlegroups.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.6429.1391647532.18130.python-list@python.org>
Lines: 59
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:65495

On 2014-02-05 16:10, Zhen Zhang wrote:
> import csv
> file = open('raw.csv')

Asaf recommended using string methods to split the file.  Keep doing
what you're doing (using the csv module), as it attends to a lot of
edge-cases that will trip you up otherwise.  I learned this the hard
way several years into my Python career. :-)

> reader = csv.reader(file)
> 
> f = open('NicelyDone.text','w')
> 
> for line in reader:
>       f.write("%s %s"%line[1],%line[5])

Here, I'd start by naming the pieces that you get, so do

  for line in reader:
    location = line[1]
    value = line[5]

> Also, I have to process the first column eg, "Toronto (Ont.)" into
> "Toronto". I am familiar with the function find(), I assume that i
> could extract Toronto out of Toronto(Ont.) using "(" as the
> stopping character, but based on my research , I have no idea how
> to use it and ask it to return me the string(Toronto).

You can use the .split() method to split a string, so you could do
something like

  if '(' in location:
    bits = location.split('(')
    # at this point, bits = ['Toronto ', 'Ont.)']
    location = bits[0].strip() # also strip it to remove whitespace

> 1:What is the data format for line[1], if it is string how come
> f.write()does not work. if it is not string, how do i convert it to
> a string?

The problem is not that "it is not a string" but that you passing
multiple parameters, the second of which is invalid Python because it
has an extra percent-sign.  First create the one string that you
want to output:

  output = "%s %s\n" % (location, bits)

and then write it out to the file:

  f.write(output)

rather than trying to do it all in one pass.

-tkc