Path: csiph.com!newsfeed.hal-mli.net!feeder3.hal-mli.net!newsfeed.hal-mli.net!feeder1.hal-mli.net!newsreader4.netcologne.de!news.netcologne.de!newsfeed.freenet.ag!news2.euro.net!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.016 X-Spam-Evidence: '*H*': 0.97; '*S*': 0.00; 'algorithm': 0.03; 'true,': 0.04; 'modify': 0.05; 'bits': 0.07; '(first': 0.09; '16-bit': 0.09; '32-bit': 0.09; 'compression': 0.09; 'compute': 0.09; "wouldn't": 0.11; 'producing': 0.15; '"small"': 0.16; '11:09': 0.16; '24,': 0.16; 'encryption': 0.16; 'expert.': 0.16; 'hashes': 0.16; 'md5': 0.16; 'sure.': 0.16; 'wrote:': 0.17; 'byte': 0.17; 'bytes': 0.17; 'certainly': 0.17; 'thu,': 0.17; 'jan': 0.18; 'changes': 0.20; 'proposed': 0.20; 'thanks.': 0.21; 'large,': 0.22; "i've": 0.23; 'seems': 0.23; 'random': 0.24; 'header:In- Reply-To:1': 0.25; 'header:User-Agent:1': 0.26; 'url:wiki': 0.26; 'values': 0.26; 'am,': 0.27; 'change,': 0.27; "doesn't": 0.28; 'chris': 0.28; 'hash': 0.29; 'probability': 0.29; 'statements': 0.29; 'url:wikipedia': 0.29; 'definition': 0.29; 'function': 0.30; 'stuff': 0.30; '(and': 0.32; 'computing': 0.32; 'getting': 0.33; 'avoiding': 0.33; 'to:addr:python-list': 0.33; 'equal': 0.33; 'another': 0.33; "can't": 0.34; 'done': 0.34; 'needed': 0.35; 'conditions.': 0.35; 'faster': 0.35; 'especially': 0.35; 'pm,': 0.35; 'something': 0.35; 'there': 0.35; 'but': 0.36; 'url:org': 0.36; 'anything': 0.36; 'possible': 0.37; 'two': 0.37; 'being': 0.37; 'data': 0.37; 'subject:: ': 0.38; 'some': 0.38; 'url:en': 0.38; 'to:addr:python.org': 0.39; 'received:192': 0.39; 'skip:" 10': 0.40; 'received:192.168': 0.40; 'your': 0.60; 'easy': 0.60; 'skip:u 10': 0.60; 'real': 0.61; 'chance': 0.61; "you'll": 0.62; 'different': 0.63; 'perfect': 0.63; 'more': 0.63; 'worldwide': 0.64; 'making': 0.64; 'secure.': 0.65; "today's": 0.66; 'subject: & ': 0.67; 'secure': 0.67; 'received:74.208': 0.71; 'power': 0.74; '2013': 0.84; 'collision': 0.84; 'collision.': 0.84; 'cuts': 0.84; 'fascinating,': 0.84; 'received:74.208.4.194': 0.84; 'secret,': 0.84; 'checks.': 0.91; 'cryptography': 0.91; 'angel': 0.93 Date: Wed, 23 Jan 2013 19:53:28 -0500 From: Dave Angel User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130106 Thunderbird/17.0.2 MIME-Version: 1.0 To: python-list@python.org Subject: Re: Uniquely identifying each & every html template References: <8deb6f5d-ff10-4b36-bdd6-36f9eed58e1e@googlegroups.com> <03581a24-9330-4019-bde9-61a607000d3d@googlegroups.com> <187d77e0-e948-46bf-acc5-668c446cf3aa@googlegroups.com> <239abe33-fa5b-41a9-ae80-5260b9b1bd9c@googlegroups.com> <2391171e-e170-4647-8924-8e446ea1c6b1@googlegroups.com> <9d9b287c-ca2a-49c1-a16b-e42cb2a5db38@q16g2000pbt.googlegroups.com> <6f3e7d20-3005-4d1e-b949-d90a78e7bbf6@googlegroups.com> <50FFD9B8.3090304@davea.name> <51007BD0.1030105@davea.name> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Provags-ID: V02:K0:D9afbY97zxgNb8frPoeOjAA5gujXRhwZxFKckrmB+xc /ySdKp5W+uWBY2h9N1HLDqSvVa+OjxGTYqeXTpx5oKI9w7n4dm p3WDs4dRAnjFeLD5YwIuPNJ6rMtv/tiMN1SGwXvswHedbjQYsN cGthsj32kvNyvsV6Yxg68MQP0yWTaUYlIUSH3QUaqsJ2cEsC2m msz8GlNxLWp6iCr09nAxTy0WEB1Q8JKx3S/aygCdGGe0q4FUAr uXWiKxTmqTIlIIbp8BUeskVXoGgZIdepUFPQRH1Bl0ZVk3kUD0 Bzm7xZkaM/oN/N+CrNpgmQm7E8bNpHTZKRfmgBXySXdAb92XQ= = X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 65 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1358988834 news.xs4all.nl 6868 [2001:888:2000:d::a6]:52551 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:37527 On 01/23/2013 07:39 PM, Chris Angelico wrote: > On Thu, Jan 24, 2013 at 11:09 AM, Dave Angel wrote: >> I certainly can't disagree that it's easy to produce a very long hash that >> isn't at all secure. But I would disagree that longer hashes >> *automatically* reduce chances of collision. > > Sure. But by and large, longer hashes give you a better chance at > avoiding collisions. > > Caveat: I am not a cryptography expert. My statements are based on my > own flawed understanding of what's going on. I use the stuff but I > don't invent it. > >> Wikipedia - http://en.wikipedia.org/wiki/Cryptographic_hash_function >> >> seems to say that there are four requirements. >> it is easy to compute the hash value for any given message >> it is infeasible to generate a message that has a given hash >> it is infeasible to modify a message without changing the hash >> it is infeasible to find two different messages with the same hash >> >> Seems to me a small hash wouldn't be able to meet the last 3 conditions. > > True, but the definition of "small" is tricky. Of course the one-byte > hash I proposed isn't going to be difficult to break, since you can > just brute-force a bunch of message changes until you find one that > has the right hash. > > But it's more about the cascade effect - that any given message has > equal probability of having any of the possible hashes. Make a random > change, get another random hash. So for a perfect one-byte hash, you > have exactly one chance in 256 of getting any particular hash. > > By comparison, a simple/naive hash that just XORs together all the > byte values fails these checks. Even if you take the message 64 bytes > at a time (thus producing a 512-bit hash), you'll still be insecure, > because it's easy to predict what hash you'll get after making a > particular change. > > This property of the hash doesn't change as worldwide computing power > improves. A hashing function might go from being "military-grade > security" to being "home-grade security" to being "two-foot fence > around your property", while still being impossible to predict without > brute-forcing. But when an algorithm is found that generates > collisions faster than the hash size indicates, it effectively reduces > the hash size to the collision rate - MD5 is 128-bit, but (if I > understand the Wikipedia note correctly) a known attack cuts that to > 20.96 bits of "real hash size". So MD5 is still better than a perfect > 16-bit hash, but not as good as a perfect 32-bit hash. (And on today's > hardware, that's not good enough.) > > http://en.wikipedia.org/wiki/Collision_resistant > > ChrisA > Thanks. I've read a lot about encryption and data compression (two overlapping fields), and done some amateurish work (first time was 1975) that was just to keep something secret, not to be especially secure. I find the field fascinating, but have never needed to do anything particularly secure for a real product. -- DaveA