Groups > comp.lang.python > #42722 > unrolled thread

In defence of 80-char lines

Started by	Steven D'Aprano <steve+comp.lang.python@pearwood.info>
First post	2013-04-04 01:36 +0000
Last post	2013-04-05 00:14 +0000
Articles	20 on this page of 25 — 16 participants

Back to article view | Back to comp.lang.python

  In defence of 80-char lines Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2013-04-04 01:36 +0000
    Re: In defence of 80-char lines Andrew Berg <bahamutzero8825@gmail.com> - 2013-04-03 20:59 -0500
    Re: In defence of 80-char lines Mitya Sirenef <msirenef@lightbird.net> - 2013-04-03 22:40 -0400
    Re: In defence of 80-char lines llanitedave <llanitedave@veawb.coop> - 2013-04-03 21:32 -0700
      Re: In defence of 80-char lines Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2013-04-04 06:33 +0000
      Re: In defence of 80-char lines gregor <gregor@ediwo.com> - 2013-04-04 08:40 +0200
      Re: In defence of 80-char lines Peter Otten <__peter__@web.de> - 2013-04-04 08:43 +0200
      Re: In defence of 80-char lines Tim Chase <python.list@tim.thechases.com> - 2013-04-04 06:09 -0500
      Re: In defence of 80-char lines Roy Smith <roy@panix.com> - 2013-04-04 07:52 -0400
        Re: In defence of 80-char lines llanitedave <llanitedave@veawb.coop> - 2013-04-04 08:28 -0700
      Re: In defence of 80-char lines Jason Swails <jason.swails@gmail.com> - 2013-04-04 08:18 -0400
      Re: In defence of 80-char lines Joshua Landau <joshua.landau.ws@gmail.com> - 2013-04-04 18:18 +0100
      Re: In defence of 80-char lines Dennis Lee Bieber <wlfraed@ix.netcom.com> - 2013-04-04 18:22 -0400
    Re: In defence of 80-char lines rusi <rustompmody@gmail.com> - 2013-04-03 21:56 -0700
    Re: In defence of 80-char lines Rui Maciel <rui.maciel@gmail.com> - 2013-04-04 08:15 +0100
    Re: In defence of 80-char lines Jason Swails <jason.swails@gmail.com> - 2013-04-04 08:13 -0400
      Re: In defence of 80-char lines Roy Smith <roy@panix.com> - 2013-04-04 08:39 -0400
        Re: In defence of 80-char lines Jason Swails <jason.swails@gmail.com> - 2013-04-04 09:23 -0400
        Re: In defence of 80-char lines Neil Cerutti <neilc@norwich.edu> - 2013-04-04 15:56 +0000
        Re: In defence of 80-char lines Kushal Kumaran <kushal.kumaran+python@gmail.com> - 2013-04-04 23:04 +0530
          Re: In defence of 80-char lines Roy Smith <roy@panix.com> - 2013-04-04 19:55 -0400
    Re: In defence of 80-char lines Mitya Sirenef <msirenef@lightbird.net> - 2013-04-04 12:12 -0400
    Re: In defence of 80-char lines jmfauth <wxjmfauth@gmail.com> - 2013-04-04 13:28 -0700
      Re: In defence of 80-char lines Jason Swails <jason.swails@gmail.com> - 2013-04-04 17:00 -0400
      Re: In defence of 80-char lines Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2013-04-05 00:14 +0000

Page 1 of 2 [1] 2 Next page →

#42722 — In defence of 80-char lines

From	Steven D'Aprano <steve+comp.lang.python@pearwood.info>
Date	2013-04-04 01:36 +0000
Subject	In defence of 80-char lines
Message-ID	<515cd919$0$29966$c3e8da3$5496439d@news.astraweb.com>

Although PEP 8 is only compulsory for the Python standard library, many 
users like to stick to PEP 8 for external projects. 

http://www.python.org/dev/peps/pep-0008/

With perhaps one glaring exception: many people hate, or ignore, PEP 8's 
recommendation to limit lines to 80 characters. (Strictly speaking, 79 
characters.)


Here is a good defence of 80 char lines:

http://wrongsideofmemphis.com/2013/03/25/80-chars-per-line-is-great/



-- 
Steven

[toc] | [next] | [standalone]

#42726

From	Andrew Berg <bahamutzero8825@gmail.com>
Date	2013-04-03 20:59 -0500
Message-ID	<mailman.83.1365040763.3114.python-list@python.org>
In reply to	#42722

While I agree that not having a line take up hundreds of characters is a
good thing, 80 is really arbitrary in 2013 and having any self-imposed
hard limit is silly. When you put a single 4- or 5-character word on a
new line because you don't want to go over 80 (or 120 or whatever), the
code is /less/ readable. A better guideline is to make new lines as
necessary to make things more readable rather than blindly stick to some
hard limit and say it's more readable just because.

Also, IMO, 80 is far too limiting and I find 120-130 much better. Then
again, I like small font sizes and avoid lower resolution screens like
the plague.
-- 
CPython 3.3.0 | Windows NT 6.2.9200 / FreeBSD 9.1

[toc] | [prev] | [next] | [standalone]

#42729

From	Mitya Sirenef <msirenef@lightbird.net>
Date	2013-04-03 22:40 -0400
Message-ID	<mailman.86.1365043256.3114.python-list@python.org>
In reply to	#42722

On 04/03/2013 09:59 PM, Andrew Berg wrote:
> While I agree that not having a  line take up hundreds of characters is a
 > good thing, 80 is really arbitrary in 2013 and having any self-imposed
 > hard limit is silly. When you put a single 4- or 5-character word on a
 > new line because you don't want to go over 80 (or 120 or whatever), the
 > code is /less/ readable. A better guideline is to make new lines as
 > necessary to make things more readable rather than blindly stick to some
 > hard limit and say it's more readable just because.
 >
 > Also, IMO, 80 is far too limiting and I find 120-130 much better. Then
 > again, I like small font sizes and avoid lower resolution screens like
 > the plague.

I have to agree. To some degree, it's a matter of taste: for me, 80c
limit looks ugly to the extreme, at least in Django; but 140+ looks even
uglier, and the longer line is, the uglier it looks. The optimal size
for Django code is a 105 char soft limit -- by soft limit I mean that
under 105 it's always one line, 105-110 I decide on a case-by-case basis
and over 110 is always split.

So my preference is: 105 > 120-130 > 140 > 80 > 140+

The trade-off is that on one hand, the code is more readable when a
single line is a single "operation", from a cognitive standpoint, when
you're thinking about the logic of the function as a whole, or a subset
of a function if it's too long (which it shouldn't be, right?) On the
other hand, even if your monitor is wide, you probably still want to fit
in the browser window and the terminal window, and as the blog author
rightly notes, really long lines do get harder to read.

Again, I mostly work with Django and I suspect if I worked in regular
Python I would possibly gravitate towards 95-100 limit.

I find the blog author's point about fitting more text nonsensical: you
can obviously fit more text PER LINE if lines are longer! And you can
quite easily fit two 120-130 wide gvim screens on a modern monitor with
room to spare.

I'm sure eyesight acuity also figures into this: I prefer to work
without glasses -- otherwise my eyes get tired after a couple of hours;
but this means I can't see code on my second monitor. If I could, I
might have preferred having browser and terminal on one monitor and Gvim
with slightly longer width limits than I use now.

  -m

-- 
Lark's Tongue Guide to Python: http://lightbird.net/larks/

The Bible is literature, not dogma.  George Santayana

[toc] | [prev] | [next] | [standalone]

#42734

From	llanitedave <llanitedave@veawb.coop>
Date	2013-04-03 21:32 -0700
Message-ID	<c338b844-e9ce-46a7-9daf-20374372390e@googlegroups.com>
In reply to	#42722

I also tend to prefer a maximum between 110 and 120 characters.  I find continuation lines confusing, and when you use some third-party tools, such as wxPython, for example, the boilerplate code leads to some long lines.

I would hate to have to break up this line, for instance:

self.mainLabel.SetFont(wx.Font(12, wx.DEFAULT, wx.NORMAL, wx.BOLD, faceName = "FreeSans"))

Especially if it's already indented a few levels to begin with.

With most of your code in classes, you already got most of it indented two levels right off the bat.

[toc] | [prev] | [next] | [standalone]

#42738

From	Steven D'Aprano <steve+comp.lang.python@pearwood.info>
Date	2013-04-04 06:33 +0000
Message-ID	<515d1ecd$0$29891$c3e8da3$5496439d@news.astraweb.com>
In reply to	#42734

On Wed, 03 Apr 2013 21:32:33 -0700, llanitedave wrote:

> I also tend to prefer a maximum between 110 and 120 characters.  I find
> continuation lines confusing, and when you use some third-party tools,
> such as wxPython, for example, the boilerplate code leads to some long
> lines.

Excessive boilerplate, and long lines, is a code-smell that suggests 
strongly that the code needs refactoring and/or simplifying.

> I would hate to have to break up this line, for instance:
> 
> self.mainLabel.SetFont(wx.Font(12, wx.DEFAULT, wx.NORMAL, wx.BOLD,
> faceName = "FreeSans"))

Here, let me do that for you, with two, no, three, no FOUR levels of 
indentation :-)

                self.mainLabel.SetFont(self.label_format)

Now when you want to change the font of your labels, you can change them 
all with *one* edit. Or you can add themes to your application by reading 
style info from a config file. Even if you don't want to do that, you 
still have the benefit of much more readable code.

Clearly you have to define the label format in the class. Here's one 
simple way:

class MyClass(whatever):
    label_format = wx.Font(
        12, wx.DEFAULT, wx.NORMAL, wx.BOLD, faceName="FreeSans")

    def __init__(self, label_format=None):
        if label_format is not None:
            # Over-ride the default.
            self.label_format = label_format

Even if you don't want to do that, you can still fit it into 80 char 
lines:

        # 80 characters, not 79. Oh well.
        font = wx.Font(12, wx.DEFAULT, wx.NORMAL, wx.BOLD, faceName="FreeSans")
        self.mainLabel.SetFont(font)

And finally, when all is said and done, the most important rule from PEP 8 
applies: know when to break the rules.

-- 
Steven

[toc] | [prev] | [next] | [standalone]

#42739

From	gregor <gregor@ediwo.com>
Date	2013-04-04 08:40 +0200
Message-ID	<20130404084056.682541bd@florenz>
In reply to	#42734

Am Wed, 3 Apr 2013 21:32:33 -0700 (PDT)
schrieb llanitedave <llanitedave@veawb.coop>:

> I would hate to have to break up this line, for instance:
> 
> self.mainLabel.SetFont(wx.Font(12, wx.DEFAULT, wx.NORMAL, wx.BOLD,
> faceName = "FreeSans"))

I think this is much more readable:

self.mainLabel.SetFont(wx.Font(12,
                               wx.DEFAULT,
                               wx.NORMAL,
                               wx.BOLD,
                               faceName = "FreeSans"))  

Emacs for example does this indentation with the TAB key automatically.

--
Greg

[toc] | [prev] | [next] | [standalone]

#42740

From	Peter Otten <__peter__@web.de>
Date	2013-04-04 08:43 +0200
Message-ID	<mailman.91.1365057741.3114.python-list@python.org>
In reply to	#42734

llanitedave wrote:

> I also tend to prefer a maximum between 110 and 120 characters.  I find
> continuation lines confusing, and when you use some third-party tools,
> such as wxPython, for example, the boilerplate code leads to some long
> lines.
> 
> I would hate to have to break up this line, for instance:
> 
> self.mainLabel.SetFont(wx.Font(12, wx.DEFAULT, wx.NORMAL, wx.BOLD,
> faceName = "FreeSans"))

I'm not a wx user, but I think I would prefer

labelfont = wx.Font(
    pointSize=12,
    style=wx.DEFAULT,
    family=wx.NORMAL,
    weight=wx.BOLD,
    faceName="FreeSans")
self.mainLabel.SetFont(labelfont)

even if I knew the order of the arguments and the meaning of constants like 
DEFAULT and NORMAL by heart.

[toc] | [prev] | [next] | [standalone]

#42746

From	Tim Chase <python.list@tim.thechases.com>
Date	2013-04-04 06:09 -0500
Message-ID	<mailman.94.1365073655.3114.python-list@python.org>
In reply to	#42734

On 2013-04-04 08:43, Peter Otten wrote:
> llanitedave wrote:
>> self.mainLabel.SetFont(wx.Font(12, wx.DEFAULT, wx.NORMAL, wx.BOLD, faceName = "FreeSans"))
> 
> I think I would prefer
> 
> labelfont = wx.Font(
>     pointSize=12,
>     style=wx.DEFAULT,
>     family=wx.NORMAL,
>     weight=wx.BOLD,
>     faceName="FreeSans")
> self.mainLabel.SetFont(labelfont)

+1
The only change I'd make to this suggestion would be to add a
semi-superfluous comma+newline after the last keyword argument too:

 labelfont = wx.Font(
     pointSize=12,
     style=wx.DEFAULT,
     family=wx.NORMAL,
     weight=wx.BOLD,
     faceName="FreeSans",
     )

which makes diffs cleaner when you need to insert something after
faceName:

--- peter1.txt  2013-04-04 06:03:01.420762566 -0500
+++ peter2.txt  2013-04-04 06:03:34.736762582 -0500
@@ -3,4 +3,5 @@
     style=wx.DEFAULT,
     family=wx.NORMAL,
     weight=wx.BOLD,
-    faceName="FreeSans")
+    faceName="FreeSans",
+    otherValue=42)

vs.

--- tkc1.txt    2013-04-04 06:02:52.436762562 -0500
+++ tkc2.txt    2013-04-04 06:03:51.392762588 -0500
@@ -4,4 +4,5 @@
     family=wx.NORMAL,
     weight=wx.BOLD,
     faceName="FreeSans",
+    otherValue=42,
     )

Additionally, if there are lots of keyword parameters like this, I'd
be tempted to keep them in sorted order for ease of tracking them
down (though CSS has long-standing arguments on how properties should
be ordered, so to each their own on this).

-tkc

[toc] | [prev] | [next] | [standalone]

#42748

From	Roy Smith <roy@panix.com>
Date	2013-04-04 07:52 -0400
Message-ID	<roy-46860E.07523804042013@news.panix.com>
In reply to	#42734

In article <c338b844-e9ce-46a7-9daf-20374372390e@googlegroups.com>,
 llanitedave <llanitedave@veawb.coop> wrote:

> I would hate to have to break up this line, for instance:
> 
> self.mainLabel.SetFont(wx.Font(12, wx.DEFAULT, wx.NORMAL, wx.BOLD, faceName = 
> "FreeSans"))

I would write that as some variation on

self.mainLabel.SetFont(wx.Font(12,
                               wx.DEFAULT,
                               wx.NORMAL,
                               wx.BOLD, 
                               faceName="FreeSans"))

This lets the reader see at a glance that all the arguments go with 
wx.Font(), not with SetFont(), without having to visually parse and 
match parenthesis levels.

Actually, I would probably break it up further as:

my_font = wx.Font(12,
                  wx.DEFAULT,
                  wx.NORMAL,
                  wx.BOLD,
                  faceName="FreeSans")
self.mainLabel.SetFont(my_font)

The last thing on my mind when deciding how to format this is whether I 
would be able to punch it onto a single card.

[toc] | [prev] | [next] | [standalone]

#42757

From	llanitedave <llanitedave@veawb.coop>
Date	2013-04-04 08:28 -0700
Message-ID	<59b9df11-b5b5-430a-b37f-958cbbf9f3f0@googlegroups.com>
In reply to	#42748

On Thursday, April 4, 2013 4:52:38 AM UTC-7, Roy Smith wrote:
> In article <c338b844-e9ce-46a7-9daf-20374372390e@googlegroups.com>,
> 
>  llanitedave <llanitedave@veawb.coop> wrote:
> 
>  
> 
> > I would hate to have to break up this line, for instance:
> 
> > 
> 
> > self.mainLabel.SetFont(wx.Font(12, wx.DEFAULT, wx.NORMAL, wx.BOLD, faceName = 
> 
> > "FreeSans"))
> 
> 
> 
> I would write that as some variation on
> 
> 
> 
> self.mainLabel.SetFont(wx.Font(12,
> 
>                                wx.DEFAULT,
> 
>                                wx.NORMAL,
> 
>                                wx.BOLD, 
> 
>                                faceName="FreeSans"))
> 
> 
> 
> This lets the reader see at a glance that all the arguments go with 
> 
> wx.Font(), not with SetFont(), without having to visually parse and 
> 
> match parenthesis levels.
> 
> 
> 
> Actually, I would probably break it up further as:
> 
> 
> 
> my_font = wx.Font(12,
> 
>                   wx.DEFAULT,
> 
>                   wx.NORMAL,
> 
>                   wx.BOLD,
> 
>                   faceName="FreeSans")
> 
> self.mainLabel.SetFont(my_font)
> 
> 
> 
> The last thing on my mind when deciding how to format this is whether I 
> 
> would be able to punch it onto a single card.

To each their own, definitely.  For myself, I don't see the utility in adding a bunch of what appears to be superfluous horizontal white space at the expense of extra lines to scroll down.  I like to limit my scrolling needs in *both* directions.
(Although I do tend to be fairly generous with blank lines to break up code "paragraphs")

[toc] | [prev] | [next] | [standalone]

#42750

From	Jason Swails <jason.swails@gmail.com>
Date	2013-04-04 08:18 -0400
Message-ID	<mailman.97.1365077925.3114.python-list@python.org>
In reply to	#42734

[Multipart message — attachments visible in raw view] — view raw

On Thu, Apr 4, 2013 at 7:09 AM, Tim Chase <python.list@tim.thechases.com>wrote:

> On 2013-04-04 08:43, Peter Otten wrote:
> > llanitedave wrote:
> >> self.mainLabel.SetFont(wx.Font(12, wx.DEFAULT, wx.NORMAL, wx.BOLD,
> faceName = "FreeSans"))
> >
> > I think I would prefer
> >
> > labelfont = wx.Font(
> >     pointSize=12,
> >     style=wx.DEFAULT,
> >     family=wx.NORMAL,
> >     weight=wx.BOLD,
> >     faceName="FreeSans")
> > self.mainLabel.SetFont(labelfont)
>
> +1
> The only change I'd make to this suggestion would be to add a
> semi-superfluous comma+newline after the last keyword argument too:
>
>  labelfont = wx.Font(
>      pointSize=12,
>      style=wx.DEFAULT,
>      family=wx.NORMAL,
>      weight=wx.BOLD,
>      faceName="FreeSans",
>      )
>
> which makes diffs cleaner when you need to insert something after
> faceName:
>
> --- peter1.txt  2013-04-04 06:03:01.420762566 -0500
> +++ peter2.txt  2013-04-04 06:03:34.736762582 -0500
> @@ -3,4 +3,5 @@
>      style=wx.DEFAULT,
>      family=wx.NORMAL,
>      weight=wx.BOLD,
> -    faceName="FreeSans")
> +    faceName="FreeSans",
> +    otherValue=42)
>
> vs.
>
> --- tkc1.txt    2013-04-04 06:02:52.436762562 -0500
> +++ tkc2.txt    2013-04-04 06:03:51.392762588 -0500
> @@ -4,4 +4,5 @@
>      family=wx.NORMAL,
>      weight=wx.BOLD,
>      faceName="FreeSans",
> +    otherValue=42,
>      )
>

+1

I wasn't aware you could do this (superfluous trailing commas), although I
admit it hadn't occurred to me to try.  I use git for virtually everything,
and I regularly parse diffstats -- this would make them much easier to
grok.  (It's an incredibly helpful bug-tracking technique)

Thanks!
Jason

[toc] | [prev] | [next] | [standalone]

#42766

From	Joshua Landau <joshua.landau.ws@gmail.com>
Date	2013-04-04 18:18 +0100
Message-ID	<mailman.109.1365095929.3114.python-list@python.org>
In reply to	#42734

[Multipart message — attachments visible in raw view] — view raw

On 4 April 2013 12:09, Tim Chase <python.list@tim.thechases.com> wrote:

> On 2013-04-04 08:43, Peter Otten wrote:
> > llanitedave wrote:
> >> self.mainLabel.SetFont(wx.Font(12, wx.DEFAULT, wx.NORMAL, wx.BOLD,
> faceName = "FreeSans"))
> >
> > I think I would prefer
> >
> > labelfont = wx.Font(
> >     pointSize=12,
> >     style=wx.DEFAULT,
> >     family=wx.NORMAL,
> >     weight=wx.BOLD,
> >     faceName="FreeSans")
> > self.mainLabel.SetFont(labelfont)
>
> +1
> The only change I'd make to this suggestion would be to add a
> semi-superfluous comma+newline after the last keyword argument too:
>
>  labelfont = wx.Font(
>      pointSize=12,
>      style=wx.DEFAULT,
>      family=wx.NORMAL,
>      weight=wx.BOLD,
>      faceName="FreeSans",
>      )
>

Since we're all showing opinions, I've always prefered the typical block
indentation:

labelfont = wx.Font(
    pointSize=12,
    style=wx.DEFAULT,
    family=wx.NORMAL,
    weight=wx.BOLD,
    faceName="FreeSans",
) # Not indented here

as

A(
    B(
        C,
        D,
        E,
    )
)

reads a lot cleaner than

A(
    B(
        C,
        D,
        E
        )
    )

which makes diffs cleaner when you need to insert something after
> faceName:
>
<DIFS SNIP>

That is a very good point :).

Additionally, if there are lots of keyword parameters like this, I'd
> be tempted to keep them in sorted order for ease of tracking them
> down (though CSS has long-standing arguments on how properties should
> be ordered, so to each their own on this).
>

Personally I'd rarely be tempted to put more than 9 or so arguments
directly into a function or class. Most of the time I can imagine unpacking
(or equiv.) would look much more readable in the circumstances that apply.

[toc] | [prev] | [next] | [standalone]

#42781

From	Dennis Lee Bieber <wlfraed@ix.netcom.com>
Date	2013-04-04 18:22 -0400
Message-ID	<mailman.116.1365114154.3114.python-list@python.org>
In reply to	#42734

On Wed, 3 Apr 2013 21:32:33 -0700 (PDT), llanitedave
<llanitedave@veawb.coop> declaimed the following in
gmane.comp.python.general:

> I also tend to prefer a maximum between 110 and 120 characters.  I find continuation lines confusing, and when you use some third-party tools, such as wxPython, for example, the boilerplate code leads to some long lines.
> 
> I would hate to have to break up this line, for instance:
> 
> self.mainLabel.SetFont(wx.Font(12, wx.DEFAULT, wx.NORMAL, wx.BOLD, faceName = "FreeSans"))
>
	Whereas I would consider breaking at the commas somewhat natural

	self.mainLabel.SetFont(wx.Font(12,
										wx.DEFAULT,
										wx.NORMAL,
										wx.BOLD,
										faceName="FreeSans")	)
-- 
	Wulfraed                 Dennis Lee Bieber         AF6VN
        wlfraed@ix.netcom.com    HTTP://wlfraed.home.netcom.com/

[toc] | [prev] | [next] | [standalone]

#42735

From	rusi <rustompmody@gmail.com>
Date	2013-04-03 21:56 -0700
Message-ID	<0df4abca-211c-42e9-8589-25b1d96299b2@ps9g2000pbb.googlegroups.com>
In reply to	#42722

On Apr 4, 6:36 am, Steven D'Aprano <steve
+comp.lang.pyt...@pearwood.info> wrote:
> Although PEP 8 is only compulsory for the Python standard library, many
> users like to stick to PEP 8 for external projects.
>
> http://www.python.org/dev/peps/pep-0008/
>
> http://blog.languager.org/2012/10/layout-imperative-in-functional.htmlith perhaps one glaring exception: many people hate, or ignore, PEP 8's
> recommendation to limit lines to 80 characters. (Strictly speaking, 79
> characters.)
>
> Here is a good defence of 80 char lines:
>
> http://wrongsideofmemphis.com/2013/03/25/80-chars-per-line-is-great/

The exchange on hacker news linked from there makes for a nice read --
tnx.

I had a blog article http://blog.languager.org/2012/10/layout-imperative-in-functional.html
on this subject. It started from a python discussion, though its more
relevant to Haskell.
What does not so easily come out there is that the wide-line code
samples I posted which read ok to me were not to some readers. So I
moved it to gist, but even there some would get the horizontal scroll
bar.  Reading it 'raw' seems to remove the problem -- though I can
hardly promise that for all devices.  So from this POV the point that
was made was opposite to the one I was trying to make =)

The discussion that followed on haskell cafe
http://www.haskell.org/pipermail/haskell-cafe/2012-October/104224.html
made a number of interesting points about pros and cons of long lines.

[toc] | [prev] | [next] | [standalone]

#42742

From	Rui Maciel <rui.maciel@gmail.com>
Date	2013-04-04 08:15 +0100
Message-ID	<kjj95v$6m0$1@dont-email.me>
In reply to	#42722

Steven D'Aprano wrote:

> Although PEP 8 is only compulsory for the Python standard library, many
> users like to stick to PEP 8 for external projects.
> 
> http://www.python.org/dev/peps/pep-0008/
> 
> With perhaps one glaring exception: many people hate, or ignore, PEP 8's
> recommendation to limit lines to 80 characters. (Strictly speaking, 79
> characters.)
> 
> 
> Here is a good defence of 80 char lines:
> 
> http://wrongsideofmemphis.com/2013/03/25/80-chars-per-line-is-great/


The now arbitrary 80-column limit is a remnant of the limitations built into 
ancient terminals.

Why not let the text editor auto-wrap the lines?  They can do that now.


Rui Maciel

[toc] | [prev] | [next] | [standalone]

#42749

From	Jason Swails <jason.swails@gmail.com>
Date	2013-04-04 08:13 -0400
Message-ID	<mailman.96.1365077619.3114.python-list@python.org>
In reply to	#42722

[Multipart message — attachments visible in raw view] — view raw

On Wed, Apr 3, 2013 at 9:36 PM, Steven D'Aprano <
steve+comp.lang.python@pearwood.info> wrote:

> Although PEP 8 is only compulsory for the Python standard library, many
> users like to stick to PEP 8 for external projects.
>

But even the standard library breaks this rule on occasion.  e.g.,
/usr/lib/python2.7/urllib.py:1095 (I remember seeing others, but this is
the first example I was able to find quickly).

>
> http://www.python.org/dev/peps/pep-0008/
>
> With perhaps one glaring exception: many people hate, or ignore, PEP 8's
> recommendation to limit lines to 80 characters. (Strictly speaking, 79
> characters.)
>
>
> Here is a good defence of 80 char lines:
>
> http://wrongsideofmemphis.com/2013/03/25/80-chars-per-line-is-great/

Personally, I try my best to keep all lines at 80 character max (80 +
newline, not 79).  In addition to liking my 84-character-width gvim windows
(to allow a little leeway) side-by-side for code references, I'm definitely
not above printing out some code to lay it out on a desk---that's not
something you can do on-screen without a wall of monitors in front of you.

The only time I regularly break my rule is for regular expressions (at some
point I may embrace re.X to allow me to break those up, too).

All the best,
Jason

[toc] | [prev] | [next] | [standalone]

#42751

From	Roy Smith <roy@panix.com>
Date	2013-04-04 08:39 -0400
Message-ID	<roy-D6F29A.08394604042013@news.panix.com>
In reply to	#42749

In article <mailman.96.1365077619.3114.python-list@python.org>,
 Jason Swails <jason.swails@gmail.com> wrote:

> The only time I regularly break my rule is for regular expressions (at some
> point I may embrace re.X to allow me to break those up, too).

re.X is a pretty cool tool for making huge regexes readable.  But, it 
turns out that python's auto-continuation and string literal 
concatenation rules are enough to let you get much the same effect.  
Here's a regex we use to parse haproxy log files. This would be utter 
line noise all run together. This way, it's almost readable :-)

pattern = re.compile(r'haproxy\[(?P<pid>\d+)]: '
                     r'(?P<client_ip>(\d{1,3}\.){3}\d{1,3}):'
                     r'(?P<client_port>\d{1,5}) '
                     
r'\[(?P<accept_date>\d{2}/\w{3}/\d{4}(:\d{2}){3}\.\d{3})] '
                     r'(?P<frontend_name>\S+) '
                     r'(?P<backend_name>\S+)/'
                     r'(?P<server_name>\S+) '
                     r'(?P<Tq>(-1|\d+))/'
                     r'(?P<Tw>(-1|\d+))/'
                     r'(?P<Tc>(-1|\d+))/'
                     r'(?P<Tr>(-1|\d+))/'
                     r'(?P<Tt>\+?\d+) '
                     r'(?P<status_code>\d{3}) '
                     r'(?P<bytes_read>\d+) '
                     r'(?P<captured_request_cookie>\S+) '
                     r'(?P<captured_response_cookie>\S+) '
                     r'(?P<termination_state>[\w-]{4}) '
                     r'(?P<actconn>\d+)/'
                     r'(?P<feconn>\d+)/'
                     r'(?P<beconn>\d+)/'
                     r'(?P<srv_conn>\d+)/'
                     r'(?P<retries>\d+) '
                     r'(?P<srv_queue>\d+)/'
                     r'(?P<backend_queue>\d+) '
                     r'(\{(?P<request_id>.*?)\} )?'
                     r'(\{(?P<captured_request_headers>.*?)\} )?'
                     r'(\{(?P<captured_response_headers>.*?)\} )?'
                     r'"(?P<http_request>.+)"'
                     )

And, for those of you who go running in the other direction every time 
regex is suggested as a solution, I challenge you to come up with easier 
to read (or write) code for parsing a line like this (probably 
hopelessly mangled by the time you read it):

2013-04-03T00:00:00+00:00 localhost haproxy[5199]: 10.159.19.244:57291 
[02/Apr/2013:23:59:59.811] app-nodes next-song-nodes/web8.songza.com 
0/0/3/214/219 200 593 sessionid=NWiX5KGOdvg6dSaA 
sessionid=NWiX5KGOdvg6dSaA ---- 249/249/149/14/0 0/0 
{4C0ABFA9-515B6DEF-933229} "POST 
/api/1/station/892337/song/16024201/notify-play HTTP/1.0"

[toc] | [prev] | [next] | [standalone]

#42752

From	Jason Swails <jason.swails@gmail.com>
Date	2013-04-04 09:23 -0400
Message-ID	<mailman.98.1365081820.3114.python-list@python.org>
In reply to	#42751

[Multipart message — attachments visible in raw view] — view raw

On Thu, Apr 4, 2013 at 8:39 AM, Roy Smith <roy@panix.com> wrote:

> In article <mailman.96.1365077619.3114.python-list@python.org>,
>  Jason Swails <jason.swails@gmail.com> wrote:
>
> > The only time I regularly break my rule is for regular expressions (at
> some
> > point I may embrace re.X to allow me to break those up, too).
>
> re.X is a pretty cool tool for making huge regexes readable.  But, it
> turns out that python's auto-continuation and string literal
> concatenation rules are enough to let you get much the same effect.
> Here's a regex we use to parse haproxy log files. This would be utter
> line noise all run together. This way, it's almost readable :-)
>
> pattern = re.compile(r'haproxy\[(?P<pid>\d+)]: '
>                      r'(?P<client_ip>(\d{1,3}\.){3}\d{1,3}):'
>                      r'(?P<client_port>\d{1,5}) '
>

For some reason that never occurred to me.  I use this technique every
other time I want to break up a long string, but never for regexes...

Now I will.  I was wary of using re.X since I sometimes use meaningful
whitespace in my regexes, and I didn't want to have to figure out how to
prevent them from being ignored...  This is a much better solution.

Thanks,
Jason

[toc] | [prev] | [next] | [standalone]

#42762

From	Neil Cerutti <neilc@norwich.edu>
Date	2013-04-04 15:56 +0000
Message-ID	<as5m68Fe3ttU1@mid.individual.net>
In reply to	#42751

On 2013-04-04, Roy Smith <roy@panix.com> wrote:
> re.X is a pretty cool tool for making huge regexes readable.
> But, it turns out that python's auto-continuation and string
> literal concatenation rules are enough to let you get much the
> same effect.  Here's a regex we use to parse haproxy log files.
> This would be utter line noise all run together. This way, it's
> almost readable :-)
>
> pattern = re.compile(r'haproxy\[(?P<pid>\d+)]: '
>                      r'(?P<client_ip>(\d{1,3}\.){3}\d{1,3}):'
>                      r'(?P<client_port>\d{1,5}) '
>                      
> r'\[(?P<accept_date>\d{2}/\w{3}/\d{4}(:\d{2}){3}\.\d{3})] '
>                      r'(?P<frontend_name>\S+) '
>                      r'(?P<backend_name>\S+)/'
>                      r'(?P<server_name>\S+) '
>                      r'(?P<Tq>(-1|\d+))/'
>                      r'(?P<Tw>(-1|\d+))/'
>                      r'(?P<Tc>(-1|\d+))/'
>                      r'(?P<Tr>(-1|\d+))/'
>                      r'(?P<Tt>\+?\d+) '
>                      r'(?P<status_code>\d{3}) '
>                      r'(?P<bytes_read>\d+) '
>                      r'(?P<captured_request_cookie>\S+) '
>                      r'(?P<captured_response_cookie>\S+) '
>                      r'(?P<termination_state>[\w-]{4}) '
>                      r'(?P<actconn>\d+)/'
>                      r'(?P<feconn>\d+)/'
>                      r'(?P<beconn>\d+)/'
>                      r'(?P<srv_conn>\d+)/'
>                      r'(?P<retries>\d+) '
>                      r'(?P<srv_queue>\d+)/'
>                      r'(?P<backend_queue>\d+) '
>                      r'(\{(?P<request_id>.*?)\} )?'
>                      r'(\{(?P<captured_request_headers>.*?)\} )?'
>                      r'(\{(?P<captured_response_headers>.*?)\} )?'
>                      r'"(?P<http_request>.+)"'
>                      )
>
> And, for those of you who go running in the other direction every time 
> regex is suggested as a solution, I challenge you to come up with easier 
> to read (or write) code for parsing a line like this (probably 
> hopelessly mangled by the time you read it):
>
> 2013-04-03T00:00:00+00:00 localhost haproxy[5199]: 10.159.19.244:57291 
> [02/Apr/2013:23:59:59.811] app-nodes next-song-nodes/web8.songza.com 
> 0/0/3/214/219 200 593 sessionid=NWiX5KGOdvg6dSaA 
> sessionid=NWiX5KGOdvg6dSaA ---- 249/249/149/14/0 0/0 
> {4C0ABFA9-515B6DEF-933229} "POST 
> /api/1/station/892337/song/16024201/notify-play HTTP/1.0"

The big win from the above seems to me the groupdict result. The
parsing is also very simple, with virtually no nesting. It's a
good application of re. 

It seems easy enough to do with str methods, but would it be an
improvement?

I ran out of time before the prototype was finished, but here's a
sketch.


import re
import datetime
import pprint

s =('2013-04-03T00:00:00+00:00 localhost haproxy[5199]: 10.159.19.244:57291'
    ' [02/Apr/2013:23:59:59.811] app-nodes next-song-nodes/web8.songza.com'
    ' 0/0/3/214/219 200 593 sessionid=NWiX5KGOdvg6dSaA'
    ' sessionid=NWiX5KGOdvg6dSaA ---- 249/249/149/14/0 0/0'
    ' {4C0ABFA9-515B6DEF-933229}'
    ' "POST /api/1/station/892337/song/16024201/notify-play HTTP/1.0"')

def get_haproxy(s):
    prefix = 'haproxy['
    if s.startswith(prefix):
        return int(s[len(prefix):s.index(']')])
    return False

def get_client_info(s):
    ip, colon, port = s.partition(':')
    if colon != ':':
        return False
    else:
        return ip, int(port)

def get_accept_date(s):
    try:
        return datetime.datetime.strptime(s, '[%d/%b/%Y:%H:%M:%S.%f]')
    except ValueError:
        return False

def get_backend(s):
    name, slash, server = s.partition('/')
    if slash != '/':
        return False
    else:
        return name, server

def get_track_info(s):
    try:
        return s.split('/')
    except TypeError:
        return False

matchers = [
        (None, None),
        (None, 'localhost'),
        ('haproxy', get_haproxy),
        (('client_ip', 'client_port'), get_client_info), 
        ('accept_date', get_accept_date),
        ('frontend_name', lambda s: s),
        (('backend_name', 'server_name'), get_backend),
        (('Tq', 'Tw', 'Tc', 'Tr', 'Tt'), get_track_info),
        ]
result = {}

for i, s in enumerate(s.split()):
    if i < len(matchers): # I'm not finished writing matchers yet.
        key, matcher = matchers[i]
        if matcher is None:
            pass
        else:
            if isinstance(matcher, str):
                value = matcher == s
            else:
                value = matcher(s)
            if value is False:
                raise ValueError('Parse error {}: {} "{}"'.format(
                    key, matcher, s))
            if isinstance(key, tuple):
                result.update(zip(*[key, value]))
            elif key is not None:
                result[key] = value
pprint.pprint(result)

The engine would need to be improved in implementation and made
more flexible once it's working and tested. I think the error
handling is a good feature and the ability to customize parsing
and return custom types is cool.

-- 
Neil Cerutti

[toc] | [prev] | [next] | [standalone]

#42770

From	Kushal Kumaran <kushal.kumaran+python@gmail.com>
Date	2013-04-04 23:04 +0530
Message-ID	<mailman.111.1365096868.3114.python-list@python.org>
In reply to	#42751

Roy Smith <roy@panix.com> writes:

> In article <mailman.96.1365077619.3114.python-list@python.org>,
>  Jason Swails <jason.swails@gmail.com> wrote:
>
>> The only time I regularly break my rule is for regular expressions (at some
>> point I may embrace re.X to allow me to break those up, too).
>
> re.X is a pretty cool tool for making huge regexes readable.  But, it 
> turns out that python's auto-continuation and string literal 
> concatenation rules are enough to let you get much the same effect.  
> Here's a regex we use to parse haproxy log files. This would be utter 
> line noise all run together. This way, it's almost readable :-)
>
> pattern = re.compile(r'haproxy\[(?P<pid>\d+)]: '
>                      r'(?P<client_ip>(\d{1,3}\.){3}\d{1,3}):'
>                      r'(?P<client_port>\d{1,5}) '
>                      
> r'\[(?P<accept_date>\d{2}/\w{3}/\d{4}(:\d{2}){3}\.\d{3})] '
>                      r'(?P<frontend_name>\S+) '
>                      r'(?P<backend_name>\S+)/'
>                      r'(?P<server_name>\S+) '
>                      r'(?P<Tq>(-1|\d+))/'
>                      r'(?P<Tw>(-1|\d+))/'
>                      r'(?P<Tc>(-1|\d+))/'
>                      r'(?P<Tr>(-1|\d+))/'
>                      r'(?P<Tt>\+?\d+) '
>                      r'(?P<status_code>\d{3}) '
>                      r'(?P<bytes_read>\d+) '
>                      r'(?P<captured_request_cookie>\S+) '
>                      r'(?P<captured_response_cookie>\S+) '
>                      r'(?P<termination_state>[\w-]{4}) '
>                      r'(?P<actconn>\d+)/'
>                      r'(?P<feconn>\d+)/'
>                      r'(?P<beconn>\d+)/'
>                      r'(?P<srv_conn>\d+)/'
>                      r'(?P<retries>\d+) '
>                      r'(?P<srv_queue>\d+)/'
>                      r'(?P<backend_queue>\d+) '
>                      r'(\{(?P<request_id>.*?)\} )?'
>                      r'(\{(?P<captured_request_headers>.*?)\} )?'
>                      r'(\{(?P<captured_response_headers>.*?)\} )?'
>                      r'"(?P<http_request>.+)"'
>                      )
>
> And, for those of you who go running in the other direction every time 
> regex is suggested as a solution, I challenge you to come up with easier 
> to read (or write) code for parsing a line like this (probably 
> hopelessly mangled by the time you read it):
>
> 2013-04-03T00:00:00+00:00 localhost haproxy[5199]: 10.159.19.244:57291 
> [02/Apr/2013:23:59:59.811] app-nodes next-song-nodes/web8.songza.com 
> 0/0/3/214/219 200 593 sessionid=NWiX5KGOdvg6dSaA 
> sessionid=NWiX5KGOdvg6dSaA ---- 249/249/149/14/0 0/0 
> {4C0ABFA9-515B6DEF-933229} "POST 
> /api/1/station/892337/song/16024201/notify-play HTTP/1.0"

Is using csv.DictReader with delimiter=' ' not sufficient for this?  I
did not actually read the regular expression in its entirety.

-- 
regards,
kushal

[toc] | [prev] | [next] | [standalone]

Page 1 of 2 [1] 2 Next page →

csiph-web

In defence of 80-char lines

Contents

#42722 — In defence of 80-char lines

#42726

#42729

#42734

#42738

#42739

#42740

#42746

#42748

#42757

#42750

#42766

#42781

#42735

#42742

#42749

#42751

#42752

#42762

#42770