Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #27896 > unrolled thread

Re: Computing win/loss records in Python

Started by"Steven W. Orr" <steveo@syslang.net>
First post2012-08-25 22:42 -0400
Last post2012-08-26 12:00 +0200
Articles 3 — 3 participants

Back to article view | Back to comp.lang.python

This discussion starts older than the indexed window; earlier articles aren't shown. The article labeled Started by below is the oldest one visible, not the original post.


Contents

  Re: Computing win/loss records in Python "Steven W. Orr" <steveo@syslang.net> - 2012-08-25 22:42 -0400
    Re: Computing win/loss records in Python Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2012-08-26 03:19 +0000
    Re: Computing win/loss records in Python Hans Mulder <hansmu@xs4all.nl> - 2012-08-26 12:00 +0200

#27896 — Re: Computing win/loss records in Python

From"Steven W. Orr" <steveo@syslang.net>
Date2012-08-25 22:42 -0400
SubjectRe: Computing win/loss records in Python
Message-ID<mailman.3823.1345948986.4697.python-list@python.org>
On 8/25/2012 10:20 PM, Christopher McComas wrote:
> Greetings,
>
> I have code that I run via Django that grabs the results from various sports from formatted text files. The script iterates over every line in the formatted text files, finds the team in the Postgres database updates their w/l record depending on the outcome on that line, saves the team's row in the db, and then moves on to the next line in the file.
>
> I'm trying to get away from Django for this project, I want to run the files, get the W/L results and output a formatted text file with the teams and their W/L records. What's confusing me I guess how to store the data/results as the wins and losses tally up. We're talking hundreds of teams, thousands of games, but a quick example would be:
>
> Marshall
> Ohio State
> Kentucky
> Indiana
>
> Marshall,24,Ohio State,48,
> Kentucky,14,Indiana,10,
> Marshall,10,Indiana,7,
> Ohio State,28,Kentucky,10
>
> That's just a quick example, I can handle seperating the data in the lines, figuring it all out, I just am unsure of how to keep a running total of a team's record. I would do "for line in file:" then on the first line I see that Marshall lost so they would have 1, Ohio State won so they'd have 1 win. It'd go to the next line Kentucky 1 win, Indiana 1 loss, then on the 3rd line, Marshall got a win so they'd have 1 win, but it would have to remember that loss from line 1...
>
> Does this make sense?
>
> Thanks,

win_count = defaultdict(int)
loss_count = defaultdict(int)

items = line.split(',')
if items[1] > items[3]:
     windex = 0
     lossdex = 2
else:
     windex = 2
     lossdex = 0
win_count[windex] += 1
loss_count[lossdex] += 1

Zat help?


-- 
Time flies like the wind. Fruit flies like a banana. Stranger things have  .0.
happened but none stranger than this. Does your driver's license say Organ ..0
Donor?Black holes are where God divided by zero. Listen to me! We are all- 000
individuals! What if this weren't a hypothetical question?
steveo at syslang.net

[toc] | [next] | [standalone]


#27898

FromSteven D'Aprano <steve+comp.lang.python@pearwood.info>
Date2012-08-26 03:19 +0000
Message-ID<503995ba$0$6574$c3e8da3$5496439d@news.astraweb.com>
In reply to#27896
On Sat, 25 Aug 2012 22:42:59 -0400, Steven W. Orr wrote:

> win_count = defaultdict(int)
> loss_count = defaultdict(int)

When I try that, I get "NameError: name 'defaultdict' is not defined."

I think it is rather unfair on beginners to show them code that almost, 
but not quite, works, and expect them to somehow work out what this 
mysterious "defaultdict" is.

The answer is probably to do this first:

from collections import defaultdict


> items = line.split(',')
> if items[1] > items[3]:
>      windex = 0
>      lossdex = 2

That's not going to work, because you are doing string comparisons 
instead of numeric comparisons. Consider:

Kentucky,6,Indiana,59

'6' > '59' and you will wrongly count that as a win to Kentucky.


> else:
>      windex = 2
>      lossdex = 0
> win_count[windex] += 1
> loss_count[lossdex] += 1

And that certainly won't work, because all you are doing is counting how 
many times the first team beats the second, instead of counting how many 
times each team wins.


-- 
Steven

[toc] | [prev] | [next] | [standalone]


#27912

FromHans Mulder <hansmu@xs4all.nl>
Date2012-08-26 12:00 +0200
Message-ID<5039f3c6$0$6900$e4fe514c@news2.news.xs4all.nl>
In reply to#27896
On 26/08/12 04:42:59, Steven W. Orr wrote:
> On 8/25/2012 10:20 PM, Christopher McComas wrote:
>> Greetings,
>>
>> I have code that I run via Django that grabs the results from various
>> sports from formatted text files. The script iterates over every line
>> in the formatted text files, finds the team in the Postgres database
>> updates their w/l record depending on the outcome on that line, saves
>> the team's row in the db, and then moves on to the next line in the file.
>>
>> I'm trying to get away from Django for this project, I want to run the
>> files, get the W/L results and output a formatted text file with the
>> teams and their W/L records. What's confusing me I guess how to store
>> the data/results as the wins and losses tally up. We're talking
>> hundreds of teams, thousands of games, but a quick example would be:
>>
>> Marshall
>> Ohio State
>> Kentucky
>> Indiana
>>
>> Marshall,24,Ohio State,48,
>> Kentucky,14,Indiana,10,
>> Marshall,10,Indiana,7,
>> Ohio State,28,Kentucky,10
>>
>> That's just a quick example, I can handle seperating the data in the
>> lines, figuring it all out, I just am unsure of how to keep a running
>> total of a team's record. I would do "for line in file:" then on the
>> first line I see that Marshall lost so they would have 1, Ohio State
>> won so they'd have 1 win. It'd go to the next line Kentucky 1 win,
>> Indiana 1 loss, then on the 3rd line, Marshall got a win so they'd
>> have 1 win, but it would have to remember that loss from line 1...
>>
>> Does this make sense?
>>
>> Thanks,
> 
> win_count = defaultdict(int)
> loss_count = defaultdict(int)
> 
> items = line.split(',')
> if items[1] > items[3]:
>     windex = 0
>     lossdex = 2
> else:
>     windex = 2
>     lossdex = 0
> win_count[windex] += 1
> loss_count[lossdex] += 1

I think you meant:

win_count[items[windex]] += 1
loss_count[items[windex]] += 1


I think it would be more readable to do:

from collections import defaultdict

win_count = defaultdict(int)
loss_count = defaultdict(int)

items = line.split(',')
if int(items[1]) > int(items[3]):
    winner = items[0]
    loser = items[2]
else:
    winner = items[2]
    loser = items[0]
win_count[winner] += 1
loss_count[loser] += 1

It looks like you're afraid of wasting RAM by needlessly
copying strings.  However, this fear is unfounded: Python
doesn't copy strings, unless you tell it to do so explictly.
An assignment like "winner = items[0]" doesn't copy the string;
it merely creates a new reference to the existing string.


Hope this helps,

-- HansM

[toc] | [prev] | [standalone]


Back to top | Article view | comp.lang.python


csiph-web