Path: csiph.com!newsfeed.hal-mli.net!feeder3.hal-mli.net!newsfeed.hal-mli.net!feeder1.hal-mli.net!newsreader4.netcologne.de!news.netcologne.de!xlned.com!feeder7.xlned.com!news2.euro.net!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.039 X-Spam-Evidence: '*H*': 0.92; '*S*': 0.00; 'subject:: [': 0.03; 'patterns': 0.04; 'linear': 0.07; 'received:mail- vc0-f174.google.com': 0.09; 'statistical': 0.09; 'subset': 0.09; 'cherry': 0.16; 'from:addr:rosuav': 0.16; 'from:name:chris angelico': 0.16; 'lying.': 0.16; 'sadly': 0.16; 'subject:Line': 0.16; 'wrote:': 0.17; 'drawing': 0.17; 'jan': 0.18; 'subject:] ': 0.19; 'example': 0.23; 'least': 0.25; 'header:In-Reply-To:1': 0.25; 'am,': 0.27; 'message-id:@mail.gmail.com': 0.27; 'lines': 0.28; 'cycles': 0.29; "d'aprano": 0.29; 'long.': 0.29; 'received:209.85.220.174': 0.29; 'sensible': 0.29; 'steven': 0.29; 'subject: [': 0.29; "i'm": 0.29; 'point': 0.31; 'picking': 0.33; 'science.': 0.33; 'anyone': 0.33; 'to:addr:python-list': 0.33; 'themselves': 0.33; 'that,': 0.34; 'received:google.com': 0.34; 'or,': 0.34; 'subject:]': 0.35; 'received:209.85.220': 0.35; 'received:209.85': 0.35; 'but': 0.36; 'announce': 0.36; 'data.': 0.36; 'others.': 0.36; 'visual': 0.36; 'received:209': 0.37; 'data': 0.37; 'subject:: ': 0.38; 'nothing': 0.38; 'sure': 0.38; 'to:addr:python.org': 0.39; 'easily': 0.39; 'notice': 0.39; 'header:Received:5': 0.40; 'most': 0.61; 'real': 0.61; 'world': 0.63; 'more': 0.63; 'other.': 0.64; 'here': 0.65; 'analysis': 0.70; 'increase': 0.72; 'url:images': 0.84; '2013': 0.84; 'hardly': 0.84; 'often,': 0.84; 'points,': 0.84; 'subject:removal': 0.84; 'weather': 0.84; 'url:gif': 0.85 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=kdc8ZLRy6kcHSo7IOn7kssWYmRzCwMlcQnXSkKKb+WY=; b=lid7IWB0yCnf76Cl2rT7y80xaAomKfzXfFVGXVaxo9knt3Uw2lSM76u1RGzUaWAeTy 4xXmQ00mCSENGVF+xj7ktERZM50pwcHlMzASL6ArS4CvJfhltxaLp/MERBU0qU01ziW1 v9Vi0IeYA3XanqhAye+hxHAaKfxejHeQulQ5BvmXPeK73lhPaSElq3PINS4lrpz5sfZH Aep7B30bs5G/E9qDLiKjv6rSjP03m5gd6Ee9Klt/McTA19nnpsyNnh+0ECH2Lj2zHNuf QUqqCrVum4Nx+PAGG9eyPzLuRGaE9RJkLFCWc20tTLr4bsFK4hqo7YrVspJ9Zx6JJgYc mR7w== MIME-Version: 1.0 In-Reply-To: <50eb0cd2$0$30003$c3e8da3$5496439d@news.astraweb.com> References: <50ea28e7$0$30003$c3e8da3$5496439d@news.astraweb.com> <50ea58f0$0$21851$c3e8da3$76491128@news.astraweb.com> <50eb0cd2$0$30003$c3e8da3$5496439d@news.astraweb.com> Date: Tue, 8 Jan 2013 06:43:46 +1100 Subject: Re: [Offtopic] Line fitting [was Re: Numpy outlier removal] From: Chris Angelico To: python-list@python.org Content-Type: text/plain; charset=ISO-8859-1 X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 22 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1357587833 news.xs4all.nl 6968 [2001:888:2000:d::a6]:41504 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:36373 On Tue, Jan 8, 2013 at 4:58 AM, Steven D'Aprano wrote: > Anyone can fool themselves into placing a line through a subset of non- > linear data. Or, sadly more often, *deliberately* cherry picking fake > clusters in order to fool others. Here is a real world example of what > happens when people pick out the data clusters that they like based on > visual inspection: > > http://www.skepticalscience.com/images/TempEscalator.gif And sensible people will notice that, even drawn like that, it's only a ~0.6 deg increase across ~30 years. Hardly statistically significant, given that weather patterns have been known to follow cycles at least that long. But that's nothing to do with drawing lines through points, and more to do with how much data you collect before you announce a conclusion, and how easily a graph can prove any point you like. Statistical analysis is a huge science. So is lying. And I'm not sure most people can pick one from the other. ChrisA