Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > gnu.utils.bug > #2184

Diff doesn't properly ignore whitespace for this input

From Tyler Bletsch <tkbletsc@ncsu.edu>
Newsgroups gnu.utils.bug
Subject Diff doesn't properly ignore whitespace for this input
Date 2015-07-14 15:01 -0400
Message-ID <mailman.6949.1436953025.904.bug-gnu-utils@gnu.org> (permalink)

Show all headers | View raw


I believe I've found a bug in diff's handling of "ignore whitespace" 
mode. I have two test files that differ only in whitespace and newlines; 
I've verified this using a separate tool (WinMerge) plus doing a diff on 
the files after doing s/\s*/ / on the whole file. When I ask for the 
diff using "-wb", it reports a spurious difference only in whitespace if 
I give the arguments in one order, but correctly reports no differences 
if I give it the reverse order.  Further, I get consistently correct 
behavior if I add the "-d" option.

Example:

$ diff -wB in1.txt in2.txt
3946c4201,4203
< Exits:
---
 >
 >
 > Exits:
$ diff -wB in2.txt in1.txt
$ diff -dwB in1.txt in2.txt
$ diff -dwB in2.txt in1.txt

This came up while using diff to automatically grade a text adventure 
I'm having students do in my class -- this is the ONLY file pair out of 
over 3000 that appears to exhibit the problem. This leads me to believe 
that it must be a fairly rare issue. I'm fixing it on my end by always 
using -d, but I think this should be classified as a bug, because it 
reports a non-whitespace difference in files where none exists.

I'm not sure if this mailing list allows attachments, so I've put the 
files in question here:

https://dl.dropboxusercontent.com/u/68643317/diff-bug-test-files.zip

I tried paring the files down to just demonstrate the bug and nothing 
else, but the behavior would seemingly go away at random as I removed 
content from the files. Therefore, I'm including the files in their 
original form. The files represent test output of the text adventure, 
specifically navigation of the default world from the ROM 2.4b6 MUD 
(after having been converted to a format for my class's assignment). 
This content is safe to share.

I've confirmed that this behavior is present in the following builds of 
diff:
- diff (GNU diffutils) 2.8.1 on Red Hat Enterprise Linux Server release 
6.5 (Santiago)
- diff (GNU diffutils) 3.2 on Ubuntu 12.04.4 LTS
- diff (GNU diffutils) 2.9 on Cygwin 32-bit (Windows 7 x64)

Let me know if there's any further information I can provide that might 
assist. Thanks for producing quality utilities used the world over!

Regards,
Dr. Tyler Bletsch
Adjunct Professor, NC State University

Back to gnu.utils.bug | Previous | Next | Find similar


Thread

Diff doesn't properly ignore whitespace for this input Tyler Bletsch <tkbletsc@ncsu.edu> - 2015-07-14 15:01 -0400

csiph-web