Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.lang.python > #27896 > unrolled thread
| Started by | "Steven W. Orr" <steveo@syslang.net> |
|---|---|
| First post | 2012-08-25 22:42 -0400 |
| Last post | 2012-08-26 12:00 +0200 |
| Articles | 3 — 3 participants |
Back to article view | Back to comp.lang.python
This discussion starts older than the indexed window; earlier articles aren't shown. The article labeled Started by
below is the oldest one visible, not the original post.
Re: Computing win/loss records in Python "Steven W. Orr" <steveo@syslang.net> - 2012-08-25 22:42 -0400
Re: Computing win/loss records in Python Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2012-08-26 03:19 +0000
Re: Computing win/loss records in Python Hans Mulder <hansmu@xs4all.nl> - 2012-08-26 12:00 +0200
| From | "Steven W. Orr" <steveo@syslang.net> |
|---|---|
| Date | 2012-08-25 22:42 -0400 |
| Subject | Re: Computing win/loss records in Python |
| Message-ID | <mailman.3823.1345948986.4697.python-list@python.org> |
On 8/25/2012 10:20 PM, Christopher McComas wrote:
> Greetings,
>
> I have code that I run via Django that grabs the results from various sports from formatted text files. The script iterates over every line in the formatted text files, finds the team in the Postgres database updates their w/l record depending on the outcome on that line, saves the team's row in the db, and then moves on to the next line in the file.
>
> I'm trying to get away from Django for this project, I want to run the files, get the W/L results and output a formatted text file with the teams and their W/L records. What's confusing me I guess how to store the data/results as the wins and losses tally up. We're talking hundreds of teams, thousands of games, but a quick example would be:
>
> Marshall
> Ohio State
> Kentucky
> Indiana
>
> Marshall,24,Ohio State,48,
> Kentucky,14,Indiana,10,
> Marshall,10,Indiana,7,
> Ohio State,28,Kentucky,10
>
> That's just a quick example, I can handle seperating the data in the lines, figuring it all out, I just am unsure of how to keep a running total of a team's record. I would do "for line in file:" then on the first line I see that Marshall lost so they would have 1, Ohio State won so they'd have 1 win. It'd go to the next line Kentucky 1 win, Indiana 1 loss, then on the 3rd line, Marshall got a win so they'd have 1 win, but it would have to remember that loss from line 1...
>
> Does this make sense?
>
> Thanks,
win_count = defaultdict(int)
loss_count = defaultdict(int)
items = line.split(',')
if items[1] > items[3]:
windex = 0
lossdex = 2
else:
windex = 2
lossdex = 0
win_count[windex] += 1
loss_count[lossdex] += 1
Zat help?
--
Time flies like the wind. Fruit flies like a banana. Stranger things have .0.
happened but none stranger than this. Does your driver's license say Organ ..0
Donor?Black holes are where God divided by zero. Listen to me! We are all- 000
individuals! What if this weren't a hypothetical question?
steveo at syslang.net
[toc] | [next] | [standalone]
| From | Steven D'Aprano <steve+comp.lang.python@pearwood.info> |
|---|---|
| Date | 2012-08-26 03:19 +0000 |
| Message-ID | <503995ba$0$6574$c3e8da3$5496439d@news.astraweb.com> |
| In reply to | #27896 |
On Sat, 25 Aug 2012 22:42:59 -0400, Steven W. Orr wrote:
> win_count = defaultdict(int)
> loss_count = defaultdict(int)
When I try that, I get "NameError: name 'defaultdict' is not defined."
I think it is rather unfair on beginners to show them code that almost,
but not quite, works, and expect them to somehow work out what this
mysterious "defaultdict" is.
The answer is probably to do this first:
from collections import defaultdict
> items = line.split(',')
> if items[1] > items[3]:
> windex = 0
> lossdex = 2
That's not going to work, because you are doing string comparisons
instead of numeric comparisons. Consider:
Kentucky,6,Indiana,59
'6' > '59' and you will wrongly count that as a win to Kentucky.
> else:
> windex = 2
> lossdex = 0
> win_count[windex] += 1
> loss_count[lossdex] += 1
And that certainly won't work, because all you are doing is counting how
many times the first team beats the second, instead of counting how many
times each team wins.
--
Steven
[toc] | [prev] | [next] | [standalone]
| From | Hans Mulder <hansmu@xs4all.nl> |
|---|---|
| Date | 2012-08-26 12:00 +0200 |
| Message-ID | <5039f3c6$0$6900$e4fe514c@news2.news.xs4all.nl> |
| In reply to | #27896 |
On 26/08/12 04:42:59, Steven W. Orr wrote:
> On 8/25/2012 10:20 PM, Christopher McComas wrote:
>> Greetings,
>>
>> I have code that I run via Django that grabs the results from various
>> sports from formatted text files. The script iterates over every line
>> in the formatted text files, finds the team in the Postgres database
>> updates their w/l record depending on the outcome on that line, saves
>> the team's row in the db, and then moves on to the next line in the file.
>>
>> I'm trying to get away from Django for this project, I want to run the
>> files, get the W/L results and output a formatted text file with the
>> teams and their W/L records. What's confusing me I guess how to store
>> the data/results as the wins and losses tally up. We're talking
>> hundreds of teams, thousands of games, but a quick example would be:
>>
>> Marshall
>> Ohio State
>> Kentucky
>> Indiana
>>
>> Marshall,24,Ohio State,48,
>> Kentucky,14,Indiana,10,
>> Marshall,10,Indiana,7,
>> Ohio State,28,Kentucky,10
>>
>> That's just a quick example, I can handle seperating the data in the
>> lines, figuring it all out, I just am unsure of how to keep a running
>> total of a team's record. I would do "for line in file:" then on the
>> first line I see that Marshall lost so they would have 1, Ohio State
>> won so they'd have 1 win. It'd go to the next line Kentucky 1 win,
>> Indiana 1 loss, then on the 3rd line, Marshall got a win so they'd
>> have 1 win, but it would have to remember that loss from line 1...
>>
>> Does this make sense?
>>
>> Thanks,
>
> win_count = defaultdict(int)
> loss_count = defaultdict(int)
>
> items = line.split(',')
> if items[1] > items[3]:
> windex = 0
> lossdex = 2
> else:
> windex = 2
> lossdex = 0
> win_count[windex] += 1
> loss_count[lossdex] += 1
I think you meant:
win_count[items[windex]] += 1
loss_count[items[windex]] += 1
I think it would be more readable to do:
from collections import defaultdict
win_count = defaultdict(int)
loss_count = defaultdict(int)
items = line.split(',')
if int(items[1]) > int(items[3]):
winner = items[0]
loser = items[2]
else:
winner = items[2]
loser = items[0]
win_count[winner] += 1
loss_count[loser] += 1
It looks like you're afraid of wasting RAM by needlessly
copying strings. However, this fear is unfounded: Python
doesn't copy strings, unless you tell it to do so explictly.
An assignment like "winner = items[0]" doesn't copy the string;
it merely creates a new reference to the existing string.
Hope this helps,
-- HansM
[toc] | [prev] | [standalone]
Back to top | Article view | comp.lang.python
csiph-web