Path: csiph.com!aioe.org!.POSTED!not-for-mail From: Fillmore Newsgroups: comp.lang.python Subject: Re: non printable (moving away from Perl) Date: Fri, 11 Mar 2016 09:23:09 -0500 Organization: Aioe.org NNTP Server Lines: 16 Message-ID: References: NNTP-Posting-Host: QDVbxQc8gIT0P2duTbJvwA.user.gioia.aioe.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Complaints-To: abuse@aioe.org User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:24.0) Gecko/20100101 Thunderbird/24.4.0 X-Notice: Filtered by postfilter v. 0.8.2 Xref: csiph.com comp.lang.python:104618 On 03/11/2016 07:13 AM, Wolfgang Maier wrote: > One lesson for Perl regex users is that in Python many things can be solved without regexes. > How about defining: > > printable = {chr(n) for n in range(32, 127)} > > then using: > > if (set(my_string) - set(printable)): > break seems computationally heavy. I have a file with about 70k lines, of which only 20 contain "funny" chars. ANy idea on how I can create a script that compares Perl speed vs. Python speed in performing the cleaning operation?