Path: csiph.com!fu-berlin.de!uni-berlin.de!not-for-mail From: Tim Chase Newsgroups: comp.lang.python Subject: Re: new to python, help please !! Date: Thu, 12 Nov 2015 05:48:12 -0600 Lines: 34 Message-ID: References: <93aef8e5-3d6f-41f4-a625-cd3c2007686e@googlegroups.com> <5644005e$0$2932$c3e8da3$76491128@news.astraweb.com> <8737wbu49x.fsf@elektro.pacujo.net> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Trace: news.uni-berlin.de B0srblkC8C3KveA6C2mpMQKF1024N4SLG7cngGpFns9Q== Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.001 X-Spam-Evidence: '*H*': 1.00; '*S*': 0.00; 'else:': 0.03; 'subject:help': 0.07; 'chunk_size': 0.09; 'chunks': 0.09; 'eof': 0.09; 'subject:python': 0.14; '-tkc': 0.16; '1024': 0.16; 'from:addr:python.list': 0.16; 'from:addr:tim.thechases.com': 0.16; 'from:name:tim chase': 0.16; 'md5': 0.16; 'received:10.122': 0.16; 'received:io': 0.16; 'received:psf.io': 0.16; 'true:': 0.16; 'wrote:': 0.16; 'header:In-Reply-To:1': 0.24; 'compare': 0.27; 'large.': 0.29; 'subject:please': 0.35; 'should': 0.36; 'to:addr :python-list': 0.36; 'subject:: ': 0.37; 'received:10': 0.37; 'really': 0.37; 'two': 0.37; 'charset:us-ascii': 0.37; 'wanted': 0.37; 'files': 0.38; 'to:addr:python.org': 0.40; 'skip:n 10': 0.62; 'received:23': 0.84 X-Sender-Id: wwwh|x-authuser|tim@thechases.com X-Sender-Id: wwwh|x-authuser|tim@thechases.com X-MC-Relay: Neutral X-MailChannels-SenderId: wwwh|x-authuser|tim@thechases.com X-MailChannels-Auth-Id: wwwh X-MC-Loop-Signature: 1447328980412:39166612 X-MC-Ingress-Time: 1447328980412 In-Reply-To: <8737wbu49x.fsf@elektro.pacujo.net> X-Mailer: Claws Mail 3.11.1 (GTK+ 2.24.25; x86_64-pc-linux-gnu) X-AuthUser: tim@thechases.com X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.20+ Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Xref: csiph.com comp.lang.python:98696 On 2015-11-12 08:21, Marko Rauhamaa wrote: > And if you really wanted to compare two files that are known to > contain MD5 checksums, the simplest way is: > > with open('f1.md5') as f1, open('f2.md5') as f2: > if f1.read() == f2.read(): > ... > else: > ... Though that suffers if the files are large. Might try CHUNK_SIZE = 4 * 1024 # read 4k chunks # chunk_offset = 0 with open('f1.md5') as f1, open('f2.md5') as f2: while True: c1 = f1.read(CHUNK_SIZE) c2 = f2.read(CHUNK_SIZE) if c1 or c2: # chunk_offset += 1 if c1 != c2: not_the_same(c1, c2) # not_the_same(chunk_offset * CHUNK_SIZE, c1, c2) break else: # EOF the_same() break which should perform better if the files are huge -tkc