Path: csiph.com!usenet.pasdenom.info!weretis.net!feeder4.news.weretis.net!feeds.phibee-telecom.net!newsfeed.xs4all.nl!newsfeed1.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
To: python-list@python.org
From: Dennis Lee Bieber <wlfraed@ix.netcom.com>
Subject: Re: Finding size of Variable
Date: Wed, 05 Feb 2014 22:14:53 -0500
Organization: IISS Elusive Unicorn
References: <8e4c1ab1-e65d-483f-ad9d-6933ae2052c3@googlegroups.com> <lcred6$q3r$1@ger.gmane.org> <mailman.6405.1391542145.18130.python-list@python.org> <7e7d3200-a4ae-4842-ad8d-68b4435b9006@googlegroups.com> <52f219c5$0$29972$c3e8da3$5496439d@news.astraweb.com> <CAPTjJmoJUxwRaq1Xxz9kz8NK+5YxsvtLpcxCoPjU7RePWPHOKg@mail.gmail.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.6433.1391656482.18130.python-list@python.org>
Lines: 37
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:65504

On Wed, 5 Feb 2014 22:44:47 +1100, Chris Angelico <rosuav@gmail.com>
declaimed the following:

>On Wed, Feb 5, 2014 at 10:00 PM, Steven D'Aprano
><steve+comp.lang.python@pearwood.info> wrote:
>>> where stopWords.txt is a file of size 4KB
>>
>> My guess is that if you split a 4K file into words, then put the words
>> into a list, you'll probably end up with 6-8K in memory.
>
>I'd guess rather more; Python strings have a fair bit of fixed
>overhead, so with a whole lot of small strings, it will get more
>costly.
>
>>>> sys.version
>'3.4.0b2 (v3.4.0b2:ba32913eb13e, Jan  5 2014, 16:23:43) [MSC v.1600 32
>bit (Intel)]'
>>>> sys.getsizeof("asdf")
>29
>

>>> import sys
>>> indata = "221B or not to be seeing you again"
>>> sys.getsizeof(indata)
67
>>> worddata = indata.split()
>>> worddata
['221B', 'or', 'not', 'to', 'be', 'seeing', 'you', 'again']
>>> sys.getsizeof(worddata) + sum(sys.getsizeof(wd) for wd in worddata)
451

	That's a 7X expansion for just splitting a single line into a list of
words.
-- 
	Wulfraed                 Dennis Lee Bieber         AF6VN
    wlfraed@ix.netcom.com    HTTP://wlfraed.home.netcom.com/