Path: csiph.com!usenet.pasdenom.info!weretis.net!feeder4.news.weretis.net!newsfeed.straub-nv.de!feed.xsnews.nl!border-1.ams.xsnews.nl!newsfeed.xs4all.nl!newsfeed5.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.000 X-Spam-Evidence: '*H*': 1.00; '*S*': 0.00; 'else:': 0.03; 'algorithm': 0.03; 'output': 0.04; 'linear': 0.07; 'subject:file': 0.07; 'url:pycon': 0.07; '"w")': 0.09; 'brackets': 0.09; 'dict': 0.09; 'iterate': 0.09; 'observation': 0.09; 'tuple': 0.09; "wouldn't": 0.11; '(2,': 0.16; "(i'm": 0.16; '-tkc': 0.16; '3):': 0.16; 'cleanly': 0.16; 'combination,': 0.16; 'entries,': 0.16; 'generator.': 0.16; 'iterating': 0.16; 'process?': 0.16; 'readable': 0.16; 'row': 0.16; 'semicolon': 0.16; 'think?': 0.16; 'wrote:': 0.17; 'version.': 0.17; 'tim': 0.18; '>>>': 0.18; 'appropriate': 0.20; 'received:209.85.216.46': 0.21; 'combination': 0.22; "python's": 0.23; 'seems': 0.23; 'header:In- Reply-To:1': 0.25; 'entries': 0.27; 'message-id:@mail.gmail.com': 0.27; "doesn't": 0.28; 'this?': 0.28; '>>>>': 0.29; 'chase': 0.29; 'overhead': 0.29; 'included': 0.29; 'point': 0.31; 'url:python': 0.32; 'file': 0.32; 'print': 0.32; 'skip:\xd0 10': 0.33; 'problem': 0.33; 'to:addr:python-list': 0.33; "can't": 0.34; 'received:google.com': 0.34; 'thanks': 0.34; 'list': 0.35; 'faster': 0.35; 'received:209.85': 0.35; 'something': 0.35; 'there': 0.35; 'uses': 0.37; 'why': 0.37; 'item': 0.37; 'received:209': 0.37; 'received:209.85.216': 0.37; 'subject:: ': 0.38; 'skip:o 20': 0.38; 'to:addr:python.org': 0.39; 'header:Received:5': 0.40; 'think': 0.40; 'skip:u 10': 0.60; 'link': 0.60; '8bit%:84': 0.62; 'different': 0.63; 'skip:n 10': 0.63; 'more': 0.63; 'making': 0.64; 'august': 0.66; 'touch': 0.69; 'actually,': 0.84; 'difference.': 0.84; 'different.': 0.84; 'raster': 0.84; 'url:2007': 0.84; 'hundred': 0.95 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; bh=/AytS/BM4qv1+dP/1YQ0FPIIT32qukkXHpNxTo0NM3E=; b=UczzoTjmrnVs28iL3ehLcjV2iE4BHW4Jsbw1o255jLPzbrtWn03I+NaFbO1O4yGW/u 0WfXsa/Bk1NUNX8NMRk9RGAjqbsG6fKlXieJc5fyIttkXzmgruX8MCDKQNBF4XiBiu8b J3jAHQcFkSNlo5zW4orkfKv5SFJVtQW2x9ulss6NOC+vZpz4qxt4JJ8191tGvPJlJZbl yXlOCoqFbpkJxhzBEK36AzO/nqju3TgpOqw26zzYWdGjbH5rKepip8HNUd+ZYhG9LruA Bp76Uh8mmUB7aXwMVHvqxru6li+Btx7oMCKcOm4lNpJdAezfJHgWlQ29+YkpdBQOINCJ v19g== MIME-Version: 1.0 In-Reply-To: <583C055C-9E99-4EAC-8F3A-D578C399826E@gmail.com> References: <930ab3d8-4ab9-446d-9970-ee811eb70a44@googlegroups.com> <50241F14.2060209@tim.thechases.com> <36EA3847-6713-4C12-B47B-9B5E10325F00@gmail.com> <502429C3.5000600@tim.thechases.com> <583C055C-9E99-4EAC-8F3A-D578C399826E@gmail.com> Date: Thu, 9 Aug 2012 16:53:49 -0500 Subject: Re: save dictionary to a file without brackets. From: Giuseppe Amatulli To: python-list@python.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.12 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 123 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1344549233 news.xs4all.nl 6893 [2001:888:2000:d::a6]:51776 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:26822 Thanks a lot for the clarification. Actually my problem is giving to raster dataset in geo-tif format find out unique pair combination, count the number of observation unique combination in rast1, count the number of observation unique combination in rast2, count the number of observation I try different solution and this seems to me the faster Rast00=3DdsRast00.GetRasterBand(1).ReadAsArray() Rast10=3DdsRast10.GetRasterBand(1).ReadAsArray() mask=3D( Rast00 !=3D 0 ) & ( Rast10 !=3D 0 ) # may be this masking operation can be included in the for loop Rast00_mask=3D Rast00[mask] # may be this masking operation can be included in the for loop Rast10_mask=3D Rast10[mask] # may be this masking operation can be included in the for loop array2D =3D np.array(zip( Rast00_mask,Rast10_mask)) unique_u=3Ddict() unique_k1=3Ddict() unique_k2=3Ddict() for key1,key2 in array2D : row =3D tuple((key1,key2)) if row in unique_u: unique_u[row] +=3D 1 else: unique_u[row] =3D 1 if key1 in unique_k1: unique_k1[key1] +=3D 1 else: unique_k1[key1] =3D 1 if key2 in unique_k2: unique_k2[key2] +=3D 1 else: unique_k2[key2] =3D 1 output =3D open(dst_file_rast0010, "w") for (a, b), c in unique_u.items(): print(a, b, c, file=3Doutput) output.close() output =3D open(dst_file_rast00, "w") for (a), b in unique_k1.items(): print(a, b, file=3Doutput) output.close() output =3D open(dst_file_rast10, "w") for (a), b in unique_k2.items(): print(a, b, file=3Doutput) output.close() What do you think? is there a way to speed up the process? Thanks Giuseppe On 9 August 2012 16:34, Roman Vashkevich wrote: > Actually, they are different. > Put a dict.{iter}items() in an O(k^N) algorithm and make it a hundred tho= usand entries, and you will feel the difference. > Dict uses hashing to get a value from the dict and this is why it's O(1). > > 10.08.2012, =D0=B2 1:21, Tim Chase =D0=BD=D0=B0=D0=BF=D0=B8=D1=81=D0=B0= =D0=BB(=D0=B0): > >> On 08/09/12 15:41, Roman Vashkevich wrote: >>> 10.08.2012, =D0=B2 0:35, Tim Chase =D0=BD=D0=B0=D0=BF=D0=B8=D1=81=D0=B0= =D0=BB(=D0=B0): >>>> On 08/09/12 15:22, Roman Vashkevich wrote: >>>>>> {(4, 5): 1, (5, 4): 1, (4, 4): 2, (2, 3): 1, (4, 3): 2} >>>>>> and i want to print to a file without the brackets comas and semicol= on in order to obtain something like this? >>>>>> 4 5 1 >>>>>> 5 4 1 >>>>>> 4 4 2 >>>>>> 2 3 1 >>>>>> 4 3 2 >>>>> >>>>> for key in dict: >>>>> print key[0], key[1], dict[key] >>>> >>>> This might read more cleanly with tuple unpacking: >>>> >>>> for (edge1, edge2), cost in d.iteritems(): # or .items() >>>> print edge1, edge2, cost >>>> >>>> (I'm making the assumption that this is a edge/cost graph...use >>>> appropriate names according to what they actually mean) >>> >>> dict.items() is a list - linear access time whereas with 'for >>> key in dict:' access time is constant: >>> http://python.net/~goodger/projects/pycon/2007/idiomatic/handout.html#u= se-in-where-possible-1 >> >> That link doesn't actually discuss dict.{iter}items() >> >> Both are O(N) because you have to touch each item in the dict--you >> can't iterate over N entries in less than O(N) time. For small >> data-sets, building the list and then iterating over it may be >> faster faster; for larger data-sets, the cost of building the list >> overshadows the (minor) overhead of a generator. Either way, the >> iterate-and-fetch-the-associated-value of .items() & .iteritems() >> can (should?) be optimized in Python's internals to the point I >> wouldn't think twice about using the more readable version. >> >> -tkc >> >> > --=20 Giuseppe Amatulli Web: www.spatial-ecology.net