Path: csiph.com!3.us.feeder.erje.net!feeder.erje.net!news.linkpendium.com!news.linkpendium.com!panix!usenet.stanford.edu!not-for-mail From: Chet Ramey Newsgroups: gnu.bash.bug Subject: Re: Bash removes unrequested characters in bracket expressions (not a range). Date: Sat, 24 Nov 2018 14:06:21 -0500 Organization: ITS, Case Western Reserve University Lines: 38 Approved: bug-bash@gnu.org Message-ID: References: Reply-To: chet.ramey@case.edu NNTP-Posting-Host: lists.gnu.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: usenet.stanford.edu 1543086410 15094 208.118.235.17 (24 Nov 2018 19:06:50 GMT) X-Complaints-To: action@cs.stanford.edu Cc: chet.ramey@case.edu To: Bize Ma , bug-bash , bash@packages.debian.org Envelope-to: bug-bash@gnu.org X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:reply-to:cc:subject:to:references:from:openpgp :autocrypt:organization:message-id:date:user-agent:mime-version :in-reply-to:content-language:content-transfer-encoding; bh=OHxdl85QEpb0M2xwUCsTsvCPdy/dUK6sb5yNURuH1zo=; b=Udq7NCenPVrxnXZveymE+fSePQzMnjzsTelcdU4GumUtWxzmhyzAGSPIoZVoU0QlEQ Vb8rl5U+sSjDhywwOKg2Jxjmki7cZyP19OjEa5rX0AzVZdkBB/oaHcotu4na7dyJDcTj q0olwnIXlViVuVi9CEs6hrNzw4Ct2oNcXVBTR9L4NvHd5vjGUJJ6nkrSLjHQyYZehbBP FD5hxJtPHmPeg6Wew3yS0A1+fO9HXrWZn0PqwsQ71rb+I0fZtSFI6XiwgC8l4aTV/xQc 4GCIVRAoeX5oAUdsy+oM7MfueilIxEaIMz6XD5iXcbQFfue0yQQk8rN6LvtCS/8Yk33g DwYw== X-Gm-Message-State: AA+aEWYmLpzeKTdsy1uFW/YZhoo5djkmctruWs3dUWW3Nd46ke3rvgON 8zU3oAJyQ7Y6Dt0qTvTaFKQnvmjjJWhFfs5nNh+fK99fakK7jUwQNSVTsCkGk02jPywcVOt3Zqx MRXcbVkOvK8E= X-Received: by 2002:a37:1b2a:: with SMTP id b42mr19036998qkb.198.1543086384199; Sat, 24 Nov 2018 11:06:24 -0800 (PST) X-Google-Smtp-Source: AFSGD/VciHXx/QGMNKB9i6sPzHvTiSGM6pjp5rvzec24QBhkhhKNVhg+x9zYrA4sMJOhazFJ8Hj6PA== X-Received: by 2002:a37:1b2a:: with SMTP id b42mr19036979qkb.198.1543086383948; Sat, 24 Nov 2018 11:06:23 -0800 (PST) Openpgp: preference=signencrypt Autocrypt: addr=chet.ramey@case.edu; prefer-encrypt=mutual; keydata= xsDiBEEOsGwRBACFa0A1oa71HSZLWxAx0svXzhOZNQZOzqHmSuGOG92jIpQpr8DpvgRh40Yp AwdcXb8QG1J5yGAKeevNE1zCFaA725vGSdHUyypHouV0xoWwukYO6qlyyX+2BZU+okBUqoWQ koWxiYaCSfzB2Ln7pmdys1fJhcgBKf3VjWCjd2XJTwCgoFJOwyBFJdugjfwjSoRSwDOIMf0D /iQKqlWhIO1LGpMrGX0il0/x4zj0NAcSwAk7LaPZbN4UPjn5pqGEHBlf1+xDDQCkAoZ/VqES GZragl4VqJfxBr29Ag0UDvNbUbXoxQsARdero1M8GiAIRc50hj7HXFoERwenbNDJL86GPLAQ OTGOCa4W2o29nFfFjQrsrrYHzVtyA/9oyKvTeEMJ7NA3VJdWcmn7gOu0FxEmSNhSoV1T4vP2 1Wf7f5niCCRKQLNyUy0wEApQi4tSysdz+AbgAc0b/bHYVzIf2uO2lIEZQNNt+3g2bmXgloWm W5fsm/di50Gm1l1Na63d3RZ00SeFQos6WEwLUHEB0yp6KXluXLLIZitEJM0aQ2hldCBSYW1l eSA8Y2hldEBjd3J1LmVkdT7CYQQTEQIAIQIbAwYLCQgHAwIDFQIDAxYCAQIeAQIXgAUCQ+La kQIZAQAKCRC7WGnwZOp0q9rGAJ4sRGLmlF8klZTH75z7jyQScpU6aACeNMahjWIhumt4u96d 9mdMJqlabVnOwE0EQQ6wbxAEAJCukwDigRDPhAuI+lf+6P64lWanIFOXIndqhvU13cDbQ/Wt 5LwPzm2QTvd7F+fcHOgZ8KOFScbDpjJaRqwIybMTcIN0B2pBLX/C10W1aY+cUrXZgXUGVISE MmpaP9v02auToo7XXVEHC+XLO9IU7/xaU98FL69l6/K4xeNSBRM/AAMHA/wNAmRBpcyK0+Vg gZ5esQaIP/LyolAm2qwcmrd3dZi+g24s7yjV0EUwvRP7xHRDQFgkAo6++QbuecU/J90lxrVn QwucZmfz9zgWDkT/MpfB/CNRSKLFjhYq2yHmHWT6vEjw9Ry/hF6Pc0oh1a62USdfaKAiim0n VxxQmPmiRvtCmcJJBBgRAgAJBQJBDrBvAhsMAAoJELtYafBk6nSr43AAn2ZZFQg8Gs/zUzvX Mt7evaFqVTzcAJ0cHtKpP1i/4H4R9+OsYeQdxxWxTQ== User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.14; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 In-Reply-To: Content-Language: en-US X-Junkmail-Status: score=9/90, host=mpv1-2015.case.edu X-Junkmail-PrAS-Raw: score=9/90, refid=2.7.2:2018.11.24.183916:17:9.975, ip=, rules=__YOUTUBE_RCVD, __X_GOOGLE_DKIM_SIGNATURE, __HAS_REPLYTO, __HAS_CC_HDR, __SUBJ_REPLY, __BOUNCE_CHALLENGE_SUBJ, __BOUNCE_NDR_SUBJ_EXEMPT, __TO_MALFORMED_2, __TO_NAME, __TO_NAME_DIFF_FROM_ACC, __REFERENCES, __HAS_FROM, FROM_EDU_TLD, __HAS_MSGID, __SANE_MSGID, DATE_TZ_NA, __USER_AGENT, __MOZILLA_USER_AGENT, __MIME_VERSION, __IN_REP_TO, __CT, __CT_TEXT_PLAIN, __CTE, __REPLYTO_SAMEAS_FROM_ADDY, __REPLYTO_SAMEAS_FROM_ACC, __FROM_DOMAIN_IN_ANY_CC1, __FROM_DOMAIN_IN_ANY_CC2, __TO_IN_SUBJECT2, __REPLYTO_SAMEAS_FROM_DOMAIN, __ANY_URI, __URI_WITH_PATH, __URI_NO_WWW, __RUS_HASHBUSTER_KOI8R, __HIGHBITS, __CP_URI_IN_BODY, __C230066_P5, __FRAUD_MONEY_CURRENCY_POUND, __FRAUD_MONEY_CURRENCY_DOLLAR, __SUBJ_ALPHA_NEGATE, __URI_IN_BODY, __URI_NOT_IMG, __FORWARDED_MSG, __NO_HTML_TAG_RAW, BODY_SIZE_1300_1399, BODYTEXTP_SIZE_3000_LESS, __MIME_TEXT_P1, __MIME_TEXT_ONLY, [TRUNCATED], so=2010-03-03 19:42:08, dmn=2016-08-03-0138 X-Mirapoint-Virus-RAPID-Raw: score=unknown(0), refid=str=0001.0A02020A.5BF9A141.0035,ss=1,re=0.000,fgs=0, ip=98.21.79.44, so=2016-11-06 16:00:04, dmn=2011-05-27 18:58:46 X-Mirapoint-Loop-Id: 9f5ef203a4b94cb350a5bddfd8187f11 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.4.x-2.6.x [generic] [fuzzy] X-Received-From: 129.22.103.226 X-BeenThere: bug-bash@gnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: Bug reports for the GNU Bourne Again SHell List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Xref: csiph.com gnu.bash.bug:14849 On 11/23/18 6:09 PM, Bize Ma wrote: > Bash Version: 4.4 > Patch Level: 12 > Release Status: release > > > > Description: > > Bash is removing characters not explicitly listed in a bracket > expression (character range). > In this example, it is removing digits from other languages. > > Also tested (and it fails) in bash 3.{0,1,3} 4.{1,2,3} and 5.0 > Not a problem in bash 2.{0,1} I can't reproduce this: $ cat ./x4 a='0123456789 ٠١٢٣٤٥٦٧٨٩ ۰۱۲۳۴۵۶۷۸۹ ߀߁߂߃߄߅߆߇߈߉ ०१२३४५६७८९' recho "${a}" recho "${a//[0123456789]}" $ ../bash-5.0-beta2/bash ./x4 argv[1] = <0123456789 ٠١٢٣٤٥٦٧٨٩ ۰۱۲۳۴۵۶۷۸۹ ߀߁߂߃߄߅߆߇߈߉ ०१२३४५६७८९> argv[1] = < ٠١٢٣٤٥٦٧٨٩ ۰۱۲۳۴۵۶۷۸۹ ߀߁߂߃߄߅߆߇߈߉ ०१२३४५६७८९> $ ../bash-4.4-patched/bash ./x4 argv[1] = <0123456789 ٠١٢٣٤٥٦٧٨٩ ۰۱۲۳۴۵۶۷۸۹ ߀߁߂߃߄߅߆߇߈߉ ०१२३४५६७८९> argv[1] = < ٠١٢٣٤٥٦٧٨٩ ۰۱۲۳۴۵۶۷۸۹ ߀߁߂߃߄߅߆߇߈߉ ०१२३४५६७८९> -- ``The lyf so short, the craft so long to lerne.'' - Chaucer ``Ars longa, vita brevis'' - Hippocrates Chet Ramey, UTech, CWRU chet@case.edu http://tiswww.cwru.edu/~chet/