Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #75195

ANN: bcolz 0.7.0, columnar, chunked and compressed datasets at your fingertips

Path csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!goblin2!goblin.stu.neva.ru!newsfeed.xs4all.nl!newsfeed1.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
Return-Path <faltet@gmail.com>
X-Original-To python-list@python.org
Delivered-To python-list@mail.python.org
X-Spam-Status OK 0.003
X-Spam-Evidence '*H*': 0.99; '*S*': 0.00; 'parameters': 0.04; 'repository': 0.05; '64-bit': 0.07; 'binary': 0.07; 'column': 0.07; 'pypi': 0.07; 'skip:` 10': 0.07; 'subject:ANN': 0.07; '32-bit': 0.09; 'moreover,': 0.09; 'release,': 0.09; 'url:github': 0.09; 'python': 0.11; 'systems.': 0.12; 'windows': 0.15; '==========': 0.16; 'alted': 0.16; 'compression': 0.16; 'containers': 0.16; 'exhaustive': 0.16; 'francesc': 0.16; 'from:addr:faltet': 0.16; 'from:name:francesc alted': 0.16; 'internally': 0.16; 'iterators': 0.16; 'numpy': 0.16; 'renaming': 0.16; 'subject: \n ': 0.16; 'too).': 0.16; 'tuning': 0.16; 'url:licenses': 0.16; 'cheap': 0.19; 'addition,': 0.20; 'memory': 0.22; 'install': 0.23; 'header:User-Agent:1': 0.23; 'performing': 0.26; 'query': 0.26; 'tables': 0.26; 'least': 0.26; '----': 0.29; 'unix': 0.29; '(like': 0.30; 'evaluation': 0.30; 'list:': 0.30; 'url:wiki': 0.31; '(although': 0.31; 'fast.': 0.31; 'skip:= 20': 0.31; "user's": 0.31; 'allows': 0.31; 'minimal': 0.33; 'at:': 0.34; 'objects': 0.35; 'operations': 0.35; 'test': 0.35; 'received:google.com': 0.35; 'in:': 0.36; 'installing': 0.36; 'doing': 0.36; 'possible': 0.36; 'url:org': 0.36; 'operating': 0.37; 'project': 0.37; 'message-id:@gmail.com': 0.38; 'to:addr :python-list': 0.38; 'to:addr:python.org': 0.39; 'either': 0.39; 'release': 0.40; 'simple,': 0.60; 'new': 0.61; 'real': 0.63; 'url:group': 0.63; 'visit': 0.64; 'more': 0.64; 'different': 0.65; 'finally': 0.65; 'license': 0.66; 'received:46': 0.66; 'latest': 0.67; 'optimized': 0.68; 'promise': 0.68; 'home': 0.69; 'default': 0.69; 'removal': 0.74; 'subject:your': 0.76; 'goals': 0.81; 'column.': 0.84; 'ever.': 0.84; 'together,': 0.84; 'url:master': 0.84; 'announcing': 0.91; 'info,': 0.91; 'reducing': 0.93; 'email addr:googlegroups.com': 0.95
DKIM-Signature v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:date:from:user-agent:mime-version:to:subject :content-type:content-transfer-encoding; bh=ap010T6JsUvQMYGvfE/hNDjMZ4cBRxv/hlYZg2chu7g=; b=IZDaEAQGXKt0sgQ9Wo/8wG1lqOHhTR/T1CWXbwh0iItAR9Q+2KsaWAYifWwYLx7zhV GvNja1hD3DIE4ptIIhP0zJbikpnxMBjAOtRcqAD8HmeChXnD5UEsc0yfycZXRgJLYKcv K9C8OuDGGNbc5AExRm4a5nf9wEYWRP94Vd2DIRW63IGRbVvAY1HbVl86yHBAg464lKp+ fbFt33lzQXi21oV2XfNU7BUUrqgWxv53EUM6XtDrxFhIxmivfKTbRYsh3mXU0MPJvk1E y5jgagnS8aUovb3EATC4HPqAC+1RaJDt3DPuwZ6eUdkqiLsNOZ2VSPh7Qe4BOu5R6Wk6 5zcg==
X-Received by 10.194.158.164 with SMTP id wv4mr583751wjb.124.1406265217403; Thu, 24 Jul 2014 22:13:37 -0700 (PDT)
Date Fri, 25 Jul 2014 07:13:37 +0200
From Francesc Alted <faltet@gmail.com>
User-Agent Mozilla/5.0 (Macintosh; Intel Mac OS X 10.9; rv:24.0) Gecko/20100101 Thunderbird/24.6.0
MIME-Version 1.0
To python-list@python.org
Subject ANN: bcolz 0.7.0, columnar, chunked and compressed datasets at your fingertips
Content-Type text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding 7bit
X-BeenThere python-list@python.org
X-Mailman-Version 2.1.15
Precedence list
List-Id General discussion list for the Python programming language <python-list.python.org>
List-Unsubscribe <https://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe>
List-Archive <http://mail.python.org/pipermail/python-list/>
List-Post <mailto:python-list@python.org>
List-Help <mailto:python-list-request@python.org?subject=help>
List-Subscribe <https://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe>
Newsgroups comp.lang.python
Message-ID <mailman.12311.1406265219.18130.python-list@python.org> (permalink)
Lines 84
NNTP-Posting-Host 2001:888:2000:d::a6
X-Trace 1406265219 news.xs4all.nl 2890 [2001:888:2000:d::a6]:47539
X-Complaints-To abuse@xs4all.nl
Xref csiph.com comp.lang.python:75195

Show key headers only | View raw


======================
Announcing bcolz 0.7.0
======================

What's new
==========

In this release, support for Python 3 has been added, Pandas and
HDF5/PyTables conversion, support for different compressors via latest
release of Blosc, and a new `iterblocks()` iterator.

Also, intensive benchmarking has lead to an important tuning of buffer
sizes parameters so that compression and evaluation goes faster than
ever.  Together, bcolz and the Blosc compressor, are finally fullfilling
the promise of accelerating memory I/O, at least for some real
scenarios:

http://nbviewer.ipython.org/github/Blosc/movielens-bench/blob/master/querying-ep14.ipynb#Plots 


``bcolz`` is a renaming of the ``carray`` project.  The new goals for
the project are to create simple, yet flexible compressed containers,
that can live either on-disk or in-memory, and with some
high-performance iterators (like `iter()`, `where()`) for querying them.

For more detailed info, see the release notes in:
https://github.com/Blosc/bcolz/wiki/Release-Notes


What it is
==========

bcolz provides columnar and compressed data containers.  Column storage
allows for efficiently querying tables with a large number of columns.
It also allows for cheap addition and removal of column.  In addition,
bcolz objects are compressed by default for reducing memory/disk I/O
needs.  The compression process is carried out internally by Blosc, a
high-performance compressor that is optimized for binary data.

bcolz can use numexpr internally so as to accelerate many vector and
query operations (although it can use pure NumPy for doing so too).
numexpr optimizes the memory usage and use several cores for doing the
computations, so it is blazing fast.  Moreover, the carray/ctable
containers can be disk-based, and it is possible to use them for
seamlessly performing out-of-memory computations.

bcolz has minimal dependencies (NumPy), comes with an exhaustive test
suite and fully supports both 32-bit and 64-bit platforms.  Also, it is
typically tested on both UNIX and Windows operating systems.


Installing
==========

bcolz is in the PyPI repository, so installing it is easy:

$ pip install -U bcolz


Resources
=========

Visit the main bcolz site repository at:
http://github.com/Blosc/bcolz

Manual:
http://bcolz.blosc.org

Home of Blosc compressor:
http://blosc.org

User's mail list:
bcolz@googlegroups.com
http://groups.google.com/group/bcolz

License is the new BSD:
https://github.com/Blosc/bcolz/blob/master/LICENSES/BCOLZ.txt


----

   **Enjoy data!**

-- Francesc Alted

Back to comp.lang.python | Previous | Next | Find similar | Unroll thread


Thread

ANN: bcolz 0.7.0, columnar, chunked and compressed datasets at your fingertips Francesc Alted <faltet@gmail.com> - 2014-07-25 07:13 +0200

csiph-web