Path: csiph.com!usenet.pasdenom.info!goblin2!goblin.stu.neva.ru!newsfeed.xs4all.nl!newsfeed1.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.005 X-Spam-Evidence: '*H*': 0.99; '*S*': 0.00; '16,': 0.03; 'cpython': 0.05; 'assuming': 0.09; 'difference,': 0.09; 'git': 0.09; 'mercurial': 0.09; 'read-only': 0.09; 'yeah,': 0.09; 'python': 0.11; '80k': 0.16; 'big,': 0.16; 'dvcs': 0.16; 'from:addr:rosuav': 0.16; 'from:name:chris angelico': 0.16; 'garbage': 0.16; 'guessing': 0.16; 'hmm.': 0.16; 'roy': 0.16; 'shallow': 0.16; 'thought.': 0.16; 'files.': 0.16; 'size,': 0.16; 'wrote:': 0.18; 'seems': 0.21; 'sort': 0.25; 'push': 0.26; 'gets': 0.27; 'header :In-Reply-To:1': 0.27; 'chris': 0.29; 'am,': 0.29; 'patch': 0.29; 'message-id:@mail.gmail.com': 0.30; "i'm": 0.30; 'easier': 0.31; 'that.': 0.31; 'lot.': 0.31; 'run': 0.32; "i'd": 0.34; 'problem.': 0.35; 'but': 0.35; 'received:google.com': 0.35; 'there': 0.35; '(we': 0.36; 'skip:~ 10': 0.36; 'done': 0.36; 'doing': 0.36; "didn't": 0.36; 'project': 0.37; 'being': 0.38; 'to:addr:python- list': 0.38; 'skip:- 10': 0.38; 'pm,': 0.38; 'that,': 0.38; 'does': 0.39; 'to:addr:python.org': 0.39; 'major': 0.40; 'everybody': 0.60; 'tell': 0.60; 'full': 0.61; 'natural': 0.68; 'building,': 0.68; 'smith': 0.68; 'bulk': 0.74; 'history,': 0.84; 'pike': 0.84; 'subject:Control': 0.84; 'rusi': 0.91; '2013': 0.98 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=gCPYMV9GKpABkfKxpg7UqzyWT6Qd2ctojJ4L3n/ywU8=; b=yg0C8RMBaOxoUgEIssJkOE9tjuOYbQeNBtgnszDhk2lsLL6F2nkLQrDBqUaHEBt3wc jqP6VyNM67oyS/DLm16sm815pvN3tTJ7ZlNVdw42bsHsXgu887vZCH1SPoi6zIDtoO6y lzjnDmQ1Or6zn1/AvrHLzpT8YZLAKxazFfh2/Ed8EK33BiKqCdOg7eAa9uVSBHrEV1AM PdNzA+2M68ZNIsb03qeUSLo+YqQMscIFRu/y6GPfgUAuIGHR96KTufZlNSjUC6nNIY4e X/G4BMDTk5tcHj1S0ts3oNW7I/wdm2i9TVP3oefZ4jVVtSvEA2Nb+pAgV/WTG29n+z8z wzkA== MIME-Version: 1.0 X-Received: by 10.220.182.193 with SMTP id cd1mr2910067vcb.32.1371355993726; Sat, 15 Jun 2013 21:13:13 -0700 (PDT) In-Reply-To: References: <98c13a55-dbf2-46a7-a2aa-8c5f052ff375@googlegroups.com> <2644d0de-9a81-41aa-b27a-cb4535964b58@googlegroups.com> <51BB8338.50006@davea.name> Date: Sun, 16 Jun 2013 14:13:13 +1000 Subject: Re: Version Control Software From: Chris Angelico To: python-list@python.org Content-Type: text/plain; charset=ISO-8859-1 X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 37 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1371355997 news.xs4all.nl 15882 [2001:888:2000:d::a6]:57039 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:48409 On Sun, Jun 16, 2013 at 1:55 PM, rusi wrote: > On Jun 16, 4:14 am, Chris Angelico wrote: >> On Sun, Jun 16, 2013 at 12:16 AM, Roy Smith wrote: >> > The advantage of DVCS is that everybody has a full copy of the repo. >> > The disadvantage of the DVCS is that every MUST have a full copy of the >> > repo. When a repo gets big, you may not want to pull all of that data >> > just to get the subtree you need. >> >> Yeah, and depending on size, that can be a major problem. While git >> _will_ let you make a shallow clone, it won't let you push from that, >> so it's good only for read-only repositories (we use git to manage >> software deployments at work - shallow clones are perfect) or for >> working with patch files. >> >> Hmm. ~/cpython/.hg is 200MB+, but ~/pike/.git is only 86MB. Does >> Mercurial compress its content? A tar.gz of each comes down, but only >> to ~170MB and ~75MB respectively, so I'm guessing the bulk of it is >> already compressed. But 200MB for cpython seems like a lot. > > [I am assuming that you have run "git gc --aggressive" before giving > those figures] They're both clones done for the purpose of building, so I hadn't run any sort of garbage collect. > Your data would tell me that python is about twice as large a project > as pike in terms of number of commits. Isn't this a natural conclusion? I didn't think there would be that much difference, tbh. Mainly, I'm just seeing cpython as not being 200MB of history, or so I'd thought. Pike has ~30K commits (based on 'git log --oneline|wc -l'); CPython has roughly 80K (based on 'hg log|grep changeset|wc -l' - there's likely an easier way but I don't know Mercurial). So yeah, okay, it's been doing more. But I still don't see 200MB in that. Seems a lot of content. ChrisA