Path: csiph.com!newsfeed.hal-mli.net!feeder3.hal-mli.net!newsfeed.hal-mli.net!feeder1.hal-mli.net!newsfeed.xs4all.nl!newsfeed2.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.130 X-Spam-Level: * X-Spam-Evidence: '*H*': 0.74; '*S*': 0.00; 'algorithm': 0.03; 'one?': 0.05; 'filename': 0.07; 'suppose': 0.07; 'file,': 0.15; 'producing': 0.15; 'attributes.': 0.16; 'exchanges': 0.16; "file's": 0.16; 'fine.': 0.16; 'identifiable': 0.16; 'numbered': 0.16; 'task.': 0.16; 'string': 0.17; 'wrote:': 0.17; 'solution.': 0.18; 'define': 0.20; 'putting': 0.20; 'file.': 0.20; 'location,': 0.22; 'recognize': 0.22; 'programming': 0.23; "i've": 0.23; 'random': 0.24; 'header:In-Reply-To:1': 0.25; 'header:User- Agent:1': 0.26; 'am,': 0.27; 'decide': 0.28; 'hash': 0.29; 'parking': 0.29; 'no,': 0.29; 'probably': 0.29; 'maybe': 0.29; 'asking': 0.32; 'towards': 0.32; 'file': 0.32; 'could': 0.32; 'problem': 0.33; 'to:addr:python-list': 0.33; 'everyone': 0.33; 'another': 0.33; "can't": 0.34; 'changed': 0.34; 'or,': 0.34; 'done': 0.34; 'solving': 0.35; "won't": 0.35; 'something': 0.35; 'but': 0.36; 'totally': 0.36; 'beyond': 0.37; 'one,': 0.37; 'subject:: ': 0.38; 'some': 0.38; 'several': 0.39; 'to:addr:python.org': 0.39; 'received:192': 0.39; 'received:192.168': 0.40; 'your': 0.60; 'company': 0.60; 'identify': 0.61; 'tracking': 0.61; 'policy': 0.62; 'solve': 0.62; 'strange': 0.62; 'personal': 0.62; 'license': 0.65; 'content,': 0.65; 'records.': 0.65; 'subject: & ': 0.67; 'car': 0.69; 'obvious': 0.71; 'received:74.208': 0.71; 'day': 0.73; 'power': 0.74; '100': 0.78; 'computers.': 0.84; "employees'": 0.84; "everything's": 0.84; "it'd": 0.84; 'received:74.208.4.194': 0.84; 'sticker,': 0.84; 'cars': 0.91; 'hand.': 0.91; 'seriously,': 0.91; 'examine': 0.95; 'hand,': 0.97 Date: Mon, 21 Jan 2013 17:26:57 -0500 From: Dave Angel User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130106 Thunderbird/17.0.2 MIME-Version: 1.0 To: python-list@python.org Subject: Re: Uniquely identifying each & every html template References: <8deb6f5d-ff10-4b36-bdd6-36f9eed58e1e@googlegroups.com> <5dd4babd-716d-4542-ad36-e6a841b73ec3@googlegroups.com> <03581a24-9330-4019-bde9-61a607000d3d@googlegroups.com> <187d77e0-e948-46bf-acc5-668c446cf3aa@googlegroups.com> <239abe33-fa5b-41a9-ae80-5260b9b1bd9c@googlegroups.com> In-Reply-To: <239abe33-fa5b-41a9-ae80-5260b9b1bd9c@googlegroups.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Provags-ID: V02:K0:LXq57Tdxc04a6uIUl8GQBHjsfmcy0KygfRk8qwgPRzo BueOqOgauGO4HAB+9hRKE5wAjVibEOQlHAIkaQfLiwJDGkmksC o6NIpKp0ENi2ZW49edxEZ1rB8jkNeQt7KF4+zcNgSAmC2e05h9 FURTTWq5ejBZWYU5jB8sEBY8R8oX997Y8LF5osrgINf4Xd2LL7 MeENLtXWadh9n+yvc9Uf5ncyeUH0pyM57DjyTqW5QKhSzfWYcQ SSCkn6WveZ/GSQQv5omM+KKpyUGcu4XBi5H4SqDzV5h0G6w9E5 fLO0GOejRkSnC9qqlzY89eGOeq+w62tATRdms4wsDRzFFqEug= = X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 58 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1358807241 news.xs4all.nl 6863 [2001:888:2000:d::a6]:52762 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:37228 On 01/21/2013 07:06 AM, Ferrous Cranus wrote: > >> >> >> >> Seriously, you're asking for something that's beyond the power of >> >> humans or computers. You want to identify that something's the same >> >> file, without tracking the change or having any identifiable tag. >> >> That's a fundamentally impossible task. > > No, it is difficult but not impossible. > It just cannot be done by tagging the file by: > > 1. filename > 2. filepath > 3. hash (math algorithm producing a string based on the file's contents) > > We need another way to identify the file WITHOUT using the above attributes. > Repeating the same impossible scenario won't solve it. You need to find some other way to recognize the file. If you can't count on either name, location, or content, you can't do it. Try solving the problem by hand. If you examine the files, and a particular one has both changed names and content, how are you going to decide that it's the "same" one? Define "same" in a way that you could do it by hand, and you're halfway towards a programming solution. Maybe it'd be obvious from an analogy. Suppose you're HR for a company with 100 employees, and a strange policy of putting paychecks under the wipers of the employees' windshields. All the employee cars are kept totally clean of personal belongings, with no registration or license plates. The lot has no reserved parking places, so every car has a random location. For a while, you just memorize the make/model/color of each car, and everything's fine. But one day several of the employees buy new cars. How do you then associate each car with each employee? I've got it - you require each one to keep a numbered parking sticker, and they move the sticker when they get a new car. Or, you give everyone a marked, reserved parking place. Or you require each employee to report any car exchanges to you, so you can update your records. If you can solve this one, you can probably solve the other one. Until then, we have no spec. -- DaveA