Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > comp.lang.python > #101714
| Path | csiph.com!fu-berlin.de!uni-berlin.de!not-for-mail |
|---|---|
| From | Nathan Hilterbrand <nhilterbrand@gmail.com> |
| Newsgroups | comp.lang.python |
| Subject | Re: ignoring or replacing white lines in a diff |
| Date | Thu, 14 Jan 2016 15:57:08 -0500 |
| Lines | 143 |
| Message-ID | <mailman.169.1452805039.13488.python-list@python.org> (permalink) |
| References | <mailman.159.1452786947.13488.python-list@python.org> <5697f08f$0$23822$e4fe514c@news.xs4all.nl> <5698118F0200001B0003B68D@gwsmtp1.astron.nl> |
| Mime-Version | 1.0 |
| Content-Type | text/plain; charset=windows-1252; format=flowed |
| Content-Transfer-Encoding | 7bit |
| X-Trace | news.uni-berlin.de kcDLHcUKi50XptRc6j0hTAWkO9saZg1IUwmrF2+xdm6g== |
| Return-Path | <nhilterbrand@gmail.com> |
| X-Original-To | python-list@python.org |
| Delivered-To | python-list@mail.python.org |
| X-Spam-Status | OK 0.000 |
| X-Spam-Evidence | '*H*': 1.00; '*S*': 0.00; 'diff': 0.05; 'fixes': 0.05; 'below)': 0.07; 'rewrite': 0.07; '101': 0.09; 'behave': 0.09; 'cmd': 0.09; 'here?': 0.09; 'long"': 0.09; 'python:': 0.09; 'regression': 0.09; 'xml.': 0.09; 'python': 0.10; 'template': 0.11; 'output': 0.13; 'suggest': 0.15; 'options.': 0.15; 'properly': 0.15; '(diff': 0.16; "(it's": 0.16; 'bits,': 0.16; 'fluent': 0.16; 'ignores': 0.16; 'loops': 0.16; 'observatory': 0.16; 'osx': 0.16; 'received:io': 0.16; 'received:psf.io': 0.16; 'url:gnu': 0.16; 'wrote:': 0.16; 'version.': 0.18; '(on': 0.22; 'file:': 0.22; 'file.': 0.22; 'trying': 0.22; 'needed.': 0.23; 'replacing': 0.23; 'tried': 0.24; 'import': 0.24; 'xml': 0.24; 'header:In-Reply-To:1': 0.24; 'module': 0.25; 'script': 0.25; "i've": 0.25; 'header:User-Agent:1': 0.26; "doesn't": 0.26; 'command': 0.26; 'appreciated.': 0.27; 'earlier': 0.27; 'point.': 0.27; 'compare': 0.27; 'skip:f 40': 0.27; "skip:' 10": 0.28; '---': 0.28; 'skip:/ 80': 0.29; 'url:software': 0.29; 'whitespace': 0.29; "i'm": 0.30; 'print': 0.30; 'work.': 0.30; 'maybe': 0.33; 'problem': 0.33; 'option.': 0.33; 'quotes': 0.33; 'phone:': 0.33; 'message-id:@gmail.com': 0.34; '(for': 0.34; 'running': 0.34; 'received:google.com': 0.35; 'quite': 0.35; 'according': 0.36; 'should': 0.36; 'url:org': 0.36; 'lines': 0.36; 'received:209.85': 0.36; 'to:addr:python-list': 0.36; 'pm,': 0.36; 'subject:: ': 0.37; 'received:10': 0.37; 'two': 0.37; 'thanks': 0.37; 'seem': 0.37; 'starting': 0.37; 'things': 0.38; 'version': 0.38; 'received:209': 0.38; 'wrong': 0.38; 'someone': 0.38; 'files': 0.38; 'received:209.85.220': 0.38; 'test': 0.39; 'does': 0.39; 'skip:x 10': 0.40; 'to:addr:python.org': 0.40; 'some': 0.40; 'software': 0.40; 'email:': 0.62; 'charset:windows-1252': 0.62; 'between': 0.65; 'differences': 0.66; 'here': 0.66; 'results': 0.66; 'reply': 0.68; 'received:10.10': 0.76; '100': 0.79; 'escaping': 0.84; 'nathan': 0.84; 'netherlands': 0.84; 'received:10.10.10': 0.84 |
| DKIM-Signature | v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to:content-type:content-transfer-encoding; bh=cGdlBrfnMA2ngo8JyeMDNwyBePvGVtSGSmPZY7e9xRc=; b=iy72WPGROahAyOzUJ7gpnszqTde61JDqx0J0SQIgbHeCNFuuRLe7qFaf1rxUsycUsg OvliNpJjHkNWseZ1UjqI8yLFC24ClcllELacpFqY+ZcjuXTqiycx9GK00ZOSz0FBd1ao Fv6CQuMfvaiJ0NbZDsRbbntVQnzD5Zxpo9ylZVuL2Y7kZJzuz523y0hMfJA6Q2u6dDEe yVm2S1sEWXKMY/fVDQq8F9peih5bTb3rn3c4A9NDIsoC2B9YY4+oT3ChEkDI9gh/bsPz 6mB/NlGpuUP806hrTUbXt5ST8ZVIOih6kBeoW0my1KNjHfr0xSICfcwsfgC0Disev+Qi vR5w== |
| X-Google-DKIM-Signature | v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:subject:to:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-type :content-transfer-encoding; bh=cGdlBrfnMA2ngo8JyeMDNwyBePvGVtSGSmPZY7e9xRc=; b=ieLxE7JHAM8p4ZVu3cl54v0o3qK0O+r7g98zgg+T8aAPfYQ7g87pikgwTJbfyeZA28 jF7oXbAsQD+fZnaz6kU4Unm5Eo47Ym+77HnUySRjEDzo2W+I/pdJclKh1Ep56Pm4T3up njSE9cVX1Y/yvQFwdWTaPXSakryr/9mvlXRrK2CAHGt4JCOfUoMScI+cWMCbVlqDV+97 wa5C18YM6RM+xFqnQcxSydHypcDfO7G703/k2HugGx+KvPZRa3dB2VMHdbYWnEouBPfi v7NhMMcoyI/bSm0/PYaRK7gcyTnhwMixwmFEAFRwPzAxKVDGtFAkrwPD5cnA2vmrTLMB 5B7Q== |
| X-Gm-Message-State | ALoCoQnKtycDwsfQSlUzVh2PpUGeONXpe6gIssi430akZGIpTlazP5A5UejbI3WyvXsMFCHhcrGxHiQXR9qz1QjrGTiCC0QlSQ== |
| X-Received | by 10.55.27.98 with SMTP id b95mr8744325qkb.51.1452805030084; Thu, 14 Jan 2016 12:57:10 -0800 (PST) |
| User-Agent | Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.4.0 |
| In-Reply-To | <5698118F0200001B0003B68D@gwsmtp1.astron.nl> |
| X-BeenThere | python-list@python.org |
| X-Mailman-Version | 2.1.20+ |
| Precedence | list |
| List-Id | General discussion list for the Python programming language <python-list.python.org> |
| List-Unsubscribe | <https://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe> |
| List-Archive | <http://mail.python.org/pipermail/python-list/> |
| List-Post | <mailto:python-list@python.org> |
| List-Help | <mailto:python-list-request@python.org?subject=help> |
| List-Subscribe | <https://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe> |
| Xref | csiph.com comp.lang.python:101714 |
Show key headers only | View raw
On 01/14/2016 03:22 PM, Adriaan Renting wrote:
> Maybe someone here has a clue what is going wrong here? Any help is
> appreciated.
>
> I'm writing a regression test for a module that generates XML.
>
> I'm using diff to compare the results with a pregenerated one from an
> earlier version.
>
> I'm running into two problems:
>
> The diff doesn't seem to behave properly with the -B option. (diff (GNU
> diffutils) 2.8.1 on OSX 10.9)
>
> Replacing -B with -I '^[[:space:]]*$' fixes it on the command line,
> which should be exactly the same according to:
> http://www.gnu.org/software/diffutils/manual/html_node/Blank-Lines.html#Blank-Lines
>
> (for Python problem continue below)
>
> MacRenting 21:00-159> diff -w -B test.xml xml/Ticket_6923.xml
> 3,5c3,5
> < <version>2.15.0</version>
> < <template version="2.15.0" author="Alwin de Jong,Adriaan Renting"
> changedBy="Adriaan Renting">
> < <description>XML Template generator version 2.15.0</description>
> ---
>> <version>2.6.0</version>
>> <template version="2.6.0" author="Alwin de Jong"
> changedBy="Alwin de Jong">
>> <description>XML Template generator version
> 2.6.0</description>
> 113d112
> <
> 163d161
> <
> 213d210
> <
> 258d254
> <
> 369d364
> <
> 419d413
> <
> 469d462
> <
> 514d506
> <
> 625d616
> <
> 675d665
> <
> 725d714
> <
> 770d758
> <
> 881d868
> <
> 931d917
> <
> 981d966
> <
> 1026d1010
> <
> 1137d1120
> <
> 1187d1169
> <
> 1237d1218
> <
> 1282d1262
> <
>
> /Users/renting/src/CEP4-DevelopClusterModel-Story-Task8432-SAS/XML_generator/test
> MacRenting 21:00-160> diff -w -I '^[[:space:]]*$' test.xml
> xml/Ticket_6923.xml
> 3,5c3,5
> < <version>2.15.0</version>
> < <template version="2.15.0" author="Alwin de Jong,Adriaan Renting"
> changedBy="Adriaan Renting">
> < <description>XML Template generator version 2.15.0</description>
> ---
>> <version>2.6.0</version>
>> <template version="2.6.0" author="Alwin de Jong"
> changedBy="Alwin de Jong">
>> <description>XML Template generator version
> 2.6.0</description>
>
>
> Now I try to use this in Python:
>
> cmd = ["diff", "-w", "-I '^[[:space:]]*$'", "./xml/%s.xml" %
> name, "test.xml"]
> ## -w ignores differences in whitespace
> ## -I '^[[:space:]]*$' because -B doesn't work for blank lines
> (on OSX?)
> p = subprocess.Popen(cmd, stdin=open('/dev/null'),
> stdout=subprocess.PIPE, stderr=subprocess.PIPE)
> logs = p.communicate()
> diffs = logs[0].splitlines() #stdout
> print "diff reply was %i lines long" % len(diffs)
>
> This doesn't work. I've tried escaping the various bits, like the * and
> $, even though with single quotes that should not be needed.
>
> I tried first removing the blank lines from the file:
>
> import fileinput
> for line in fileinput.FileInput("test.xml",inplace=1):
> if line.rstrip():
> print line
>
> This makes it worse, as it adds and empty line for each line in the
> file.
>
> I've tried various other options. The only thing I can think of, is
> ditching Python and trying to rewrite the whole script in Bash.
> (It's quite complicated, as it loops over various things and does some
> pretty output in between and I'm not very fluent in Bash)
>
> Any suggestions?
>
> Thanks for any help provided.
>
> Adriaan Renting.
>
>
> Adriaan Renting | Email: renting@astron.nl
> Software Engineer Radio Observatory
> ASTRON | Phone: +31 521 595 100 (797 direct)
> P.O. Box 2 | GSM: +31 6 24 25 17 28
> NL-7990 AA Dwingeloo | FAX: +31 521 595 101
> The Netherlands | Web: http://www.astron.nl/~renting/
>
>
Without having the files handy to test with, I would suggest that you
try replacing "-I '^[[:space:]]*$'" with r"-I '^[[:space:]]*$'" as a
starting point.
Nathan
Back to comp.lang.python | Previous | Next — Previous in thread | Next in thread | Find similar | Unroll thread
problem Shivam Gupta <mailtoshivamgupta@gmail.com> - 2016-01-14 21:23 +0530
Re: problem Irmen de Jong <irmen.NOSPAM@xs4all.nl> - 2016-01-14 20:01 +0100
ignoring or replacing white lines in a diff "Adriaan Renting" <renting@astron.nl> - 2016-01-14 21:22 +0100
Re: ignoring or replacing white lines in a diff Zachary Ware <zachary.ware+pylist@gmail.com> - 2016-01-14 14:54 -0600
Re: ignoring or replacing white lines in a diff Nathan Hilterbrand <nhilterbrand@gmail.com> - 2016-01-14 15:57 -0500
Re: ignoring or replacing white lines in a diff "Martin A. Brown" <martin@linux-ip.net> - 2016-01-14 12:57 -0800
Re: ignoring or replacing white lines in a diff Peter Otten <__peter__@web.de> - 2016-01-14 22:05 +0100
Re: problem Chris Angelico <rosuav@gmail.com> - 2016-01-15 08:10 +1100
Re: problem eryk sun <eryksun@gmail.com> - 2016-01-14 15:39 -0600
Re: ignoring or replacing white lines in a diff "Adriaan Renting" <renting@astron.nl> - 2016-01-15 10:44 +0100
Re: ignoring or replacing white lines in a diff Peter Otten <__peter__@web.de> - 2016-01-15 11:20 +0100
csiph-web