Path: csiph.com!newsfeed.hal-mli.net!feeder3.hal-mli.net!newsfeed.hal-mli.net!feeder1.hal-mli.net!news.mixmin.net!feeder.erje.net!eu.feeder.erje.net!newsfeed.freenet.ag!news2.euro.net!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.006 X-Spam-Evidence: '*H*': 0.99; '*S*': 0.00; 'algorithm': 0.03; '21,': 0.07; 'filename': 0.07; 'pages.': 0.07; 'python': 0.09; 'alter': 0.09; 'owners.': 0.09; "they've": 0.09; 'to:addr:comp.lang.python': 0.09; 'cc:addr:python-list': 0.10; 'file,': 0.15; 'producing': 0.15; 'assortment': 0.16; 'co,': 0.16; "file's": 0.16; "he'll": 0.16; 'identifiable': 0.16; 'magic.': 0.16; 'renames': 0.16; 'task.': 0.16; 'mon,': 0.16; 'string': 0.17; 'wrote:': 0.17; 'jan': 0.18; 'shell': 0.18; 'written': 0.20; 'cc:2**0': 0.23; 'cc:no real name:2**0': 0.24; 'script': 0.24; 'cc:addr:python.org': 0.25; 'header:In-Reply-To:1': 0.25; 'header :User-Agent:1': 0.26; 'chris': 0.28; 'hash': 0.29; 'no,': 0.29; 'asking': 0.32; 'file': 0.32; 'raising': 0.33; 'anyone': 0.33; 'another': 0.33; 'received:google.com': 0.34; 'done': 0.34; 'templates': 0.35; 'pm,': 0.35; 'received:209.85': 0.35; 'something': 0.35; 'but': 0.36; 'anything': 0.36; 'beyond': 0.37; 'being': 0.37; 'received:209': 0.37; 'subject:: ': 0.38; 'some': 0.38; 'identify': 0.61; 'tracking': 0.61; 'different': 0.63; 'websites': 0.66; 'subject: & ': 0.67; 'family': 0.68; '8bit%:100': 0.70; '8bit%:92': 0.70; 'business': 0.70; 'acts': 0.71; 'power': 0.74; '2013': 0.84; 'computers.': 0.84; 'dreamweaver,': 0.84; 'moves': 0.84; 'shade': 0.84; 'seriously,': 0.91; 'investing': 0.95 X-Received: by 10.49.1.43 with SMTP id 11mr3639366qej.29.1358769983681; Mon, 21 Jan 2013 04:06:23 -0800 (PST) Newsgroups: comp.lang.python Date: Mon, 21 Jan 2013 04:06:23 -0800 (PST) In-Reply-To: Complaints-To: groups-abuse@google.com Injection-Info: glegroupsg2000goo.googlegroups.com; posting-host=94.68.70.179; posting-account=DYJQ-woAAACEPH85Au2BhUVfFTfSfVa4 References: <8deb6f5d-ff10-4b36-bdd6-36f9eed58e1e@googlegroups.com> <5dd4babd-716d-4542-ad36-e6a841b73ec3@googlegroups.com> <03581a24-9330-4019-bde9-61a607000d3d@googlegroups.com> <187d77e0-e948-46bf-acc5-668c446cf3aa@googlegroups.com> User-Agent: G2/1.0 X-Google-Web-Client: true X-Google-IP: 94.68.70.179 MIME-Version: 1.0 Subject: Re: Uniquely identifying each & every html template From: Ferrous Cranus To: comp.lang.python@googlegroups.com Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Cc: python-list@python.org X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Message-ID: Lines: 55 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1358770584 news.xs4all.nl 6947 [2001:888:2000:d::a6]:52508 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:37183 =CE=A4=CE=B7 =CE=94=CE=B5=CF=85=CF=84=CE=AD=CF=81=CE=B1, 21 =CE=99=CE=B1=CE= =BD=CE=BF=CF=85=CE=B1=CF=81=CE=AF=CE=BF=CF=85 2013 11:31:24 =CF=80.=CE=BC. = UTC+2, =CE=BF =CF=87=CF=81=CE=AE=CF=83=CF=84=CE=B7=CF=82 Chris Angelico =CE= =AD=CE=B3=CF=81=CE=B1=CF=88=CE=B5: > On Mon, Jan 21, 2013 at 8:19 PM, Ferrous Cranus w= rote: >=20 > > This python script acts upon websites other people use and >=20 > > every html templates has been written by different methods(notepad++, d= reamweaver, joomla). >=20 > > >=20 > > Renames and moves are performed, either by shell access or either by c= Panel access by website owners. >=20 > > >=20 > > That being said i have no control on HOW and WHEN users alter their htm= l pages. >=20 >=20 >=20 > Then I recommend investing in some magic. There's an old-established >=20 > business JW Wells & Co, Family Sorcerers. They've a first-rate >=20 > assortment of magic, and for raising a posthumous shade with effects >=20 > that are comic, or tragic, there's no cheaper house in the trade! If >=20 > anyone anything lacks, he'll find it all ready in stacks, if he'll >=20 > only look in on the resident Djinn, number seventy, Simmery Axe! >=20 >=20 >=20 > Seriously, you're asking for something that's beyond the power of >=20 > humans or computers. You want to identify that something's the same >=20 > file, without tracking the change or having any identifiable tag. >=20 > That's a fundamentally impossible task. No, it is difficult but not impossible. It just cannot be done by tagging the file by: 1. filename 2. filepath 3. hash (math algorithm producing a string based on the file's contents) We need another way to identify the file WITHOUT using the above attributes= .