Path: csiph.com!usenet.pasdenom.info!news.albasani.net!newsfeed.freenet.ag!news2.euro.net!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.000 X-Spam-Evidence: '*H*': 1.00; '*S*': 0.00; '(at': 0.04; 'output': 0.05; 'sized': 0.07; 'subject:skip:s 10': 0.07; 'bits': 0.09; 'bytes.': 0.09; 'friday,': 0.09; 'os.path': 0.09; 'shutil': 0.09; 'skip:o 50': 0.09; 'to:addr:comp.lang.python': 0.09; 'cc:addr :python-list': 0.11; 'python': 0.11; 'def': 0.12; 'wrote': 0.14; '"w")': 0.16; 'bytes)': 0.16; 'closed.': 0.16; 'flush': 0.16; 'incomplete': 0.16; 'involved?': 0.16; 'multiples': 0.16; 'number?': 0.16; 'one)': 0.16; 'run:': 0.16; 'simpson': 0.16; 'skip:/ 60': 0.16; 'temp': 0.16; 'wrote:': 0.18; "hasn't": 0.19; "python's": 0.19; "skip:' 30": 0.19; 'written': 0.21; 'command': 0.22; 'import': 0.22; 'cc:addr:gmail.com': 0.22; 'manual': 0.22; 'cc:addr:python.org': 0.22; 'print': 0.22; 'cc:2**1': 0.23; 'creating': 0.23; 'header:User-Agent:1': 0.23; 'driver': 0.24; 'earlier': 0.24; 'cheers,': 0.24; 'file.': 0.24; '(or': 0.24; 'cc:no real name:2**0': 0.24; 'sort': 0.25; "i've": 0.25; 'source': 0.25; 'skip:" 30': 0.26; 'gets': 0.27; 'header:In-Reply- To:1': 0.27; 'tried': 0.27; 'function': 0.29; 'skip:p 30': 0.29; '(like': 0.30; "i'm": 0.30; '(which': 0.31; 'code': 0.31; "skip:' 10": 0.31; 'usually': 0.31; 'question:': 0.31; 'skip:s 70': 0.31; 'file': 0.32; '(i.e.': 0.33; 'guess': 0.33; 'copying': 0.34; 'device': 0.34; "i'd": 0.34; 'subject: (': 0.35; 'received:209.85': 0.35; 'created': 0.35; 'something': 0.35; 'but': 0.35; 'received:google.com': 0.35; 'there': 0.35; 'skip:f 40': 0.36; 'skip:s 60': 0.36; "didn't": 0.36; 'thanks': 0.36; 'changing': 0.37; 'received:209': 0.37; 'does': 0.39; 'skip:t 30': 0.61; "you're": 0.61; 'complete': 0.62; 'show': 0.63; 'size.': 0.65; 'latest': 0.67; 'close': 0.67; 'results': 0.69; 'power': 0.76; 'difference.': 0.84; 'imagining': 0.84; 'ncr': 0.84; 'partial': 0.84; '2013': 0.98 X-Received: by 10.49.98.65 with SMTP id eg1mr834519qeb.2.1365758297380; Fri, 12 Apr 2013 02:18:17 -0700 (PDT) Newsgroups: comp.lang.python Date: Fri, 12 Apr 2013 02:18:17 -0700 (PDT) In-Reply-To: Complaints-To: groups-abuse@google.com Injection-Info: glegroupsg2000goo.googlegroups.com; posting-host=46.233.72.51; posting-account=fjrvzQoAAAAF4F9_gcVHWLVVScTZ6_Hn References: User-Agent: G2/1.0 X-Google-Web-Client: true X-Google-IP: 46.233.72.51 MIME-Version: 1.0 Subject: Re: shutil.copyfile is incomplete (truncated) From: Rob Schneider To: comp.lang.python@googlegroups.com Content-Type: text/plain; charset=ISO-8859-1 Cc: python-list@python.org, Rob Schneider X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Message-ID: Lines: 134 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1365758300 news.xs4all.nl 2661 [2001:888:2000:d::a6]:42440 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:43435 On Friday, 12 April 2013 09:26:21 UTC+1, Cameron Simpson wrote: > > | > > Question: is the size of the incomplete file a round number? (Like > > | > > a multiple of a decent sized power of 2>) > > [...] > > | Source (correct one) is 47,970 bytes. Target after copy of 45,056 > > | bytes. I've tried changing what gets written to change the file > > | size. It is usually this sort of difference. > > > > 45046 is exactly 11 * 4096. I'd say your I/O is using 4KB blocks, > > and the last partial block (to make it up to 47970) didn't get > > written (at the OS level). > > > > Earlier you wrote: > > | I have created a file in temp space, then use the function > > | "shutil.copyfile(fn,loc+fname)" from "fn" to "loc+fname". > > and: > > | Yes, there is a close function call before the copy is launched. No other writes. > > | Does Python wait for file close command to complete before proceeding? > > > > Please show us the exact code used to make the temp file. > > > > I would guess the temp file has not been closed (or flushed) before > > the call to copyfile. > > > > If you're copying data to a tempfile, it will only have complete > > buffers (i.e. multiples of 4096 bytes) in it until the final flush > > or close. > > > > So I'm imagining something like: > > > > tfp = open(tempfilename, "w") > > ... lots of tfp.write() ... > > shutil.copyfile(tempfilename, newfilename) > > > > Note above no flush or close of tfp. So the final incomplete I/O > > buffer is still in Python's memory; it hasn't been actually written > > to the temp file because the buffer has not been filled, and the file > > has not been closed. > > > > Anyway, can you show us the relevant bits of code involved? > > > > Cheers, > > -- > > Cameron Simpson > > > > Processes are like potatoes. - NCR device driver manual Thanks for the observation. Code (simplified but results in same flaw) (which a close, far as I can tell). def CreateSpeakerList1(): import shutil import filecmp import os.path t=get_template('speaker_list.html') fn=TEMP_DIR+SOC_SPEAKER_LIST fn=tempfile.gettempdir()+"/"+SOC_SPEAKER_LIST f=open(fn,'w') speaker_list=Speaker.objects.order_by('status__order','targetmtg__date') print " Creating " + SOC_SPEAKER_LIST + " ..." html=(smart_str(t.render(Context( { 'css_include_file':CSS_INCLUDE_FILE, 'css_link':False, 'title': ORG_NAME+" Speaker List", 'speaker_list': speaker_list, })))) f.write(html) f.close print " Wrote "+fn shutil.copyfile(fn,SOC_GENERAL_OUTPUT_FOLDER+SOC_SPEAKER_LIST) print "Filecompare :",filecmp.cmp(fn,SOC_GENERAL_OUTPUT_FOLDER+SOC_SPEAKER_LIST) print "Statinfo :"+fn+":\n", os.stat(fn) print "Statinfo :"+SOC_GENERAL_OUTPUT_FOLDER+SOC_SPEAKER_LIST+"\n", os.stat(SOC_GENERAL_OUTPUT_FOLDER+SOC_SPEAKER_LIST) return Output on latest run: Creating speakers.htm ... Wrote /var/folders/p_/n5lktj2n0r938_46jyqb52g40000gn/T/speakers.htm Filecompare : True Statinfo :/var/folders/p_/n5lktj2n0r938_46jyqb52g40000gn/T/speakers.htm: posix.stat_result(st_mode=33188, st_ino=32332374, st_dev=16777218L, st_nlink=1, st_uid=501, st_gid=20, st_size=45056, st_atime=1365758139, st_mtime=1365758139, st_ctime=1365758139) Statinfo :/Users/rmschne/Documents/ScottishOilClub/Output/speakers.htm posix.stat_result(st_mode=33188, st_ino=32143886, st_dev=16777218L, st_nlink=1, st_uid=501, st_gid=20, st_size=45056, st_atime=1365758029, st_mtime=1365758139, st_ctime=1365758139)