Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #107485

Re: delete from pattern to pattern if it contains match

From Peter Otten <__peter__@web.de>
Newsgroups comp.lang.python
Subject Re: delete from pattern to pattern if it contains match
Date 2016-04-22 11:24 +0200
Organization None
Message-ID <mailman.11.1461317067.2861.python-list@python.org> (permalink)
References <20c0b0fe-136b-4b01-b004-c55c6d47b299@googlegroups.com> <91432d7b-7233-4504-a725-22bc81637ea3@googlegroups.com> <lf5fuufqe81.fsf@ling.helsinki.fi> <991c5867-27d1-4e75-aa52-a7d47e626b74@googlegroups.com> <nfcqjs$guu$1@ger.gmane.org>

Show all headers | View raw


harirammanohar@gmail.com wrote:

> On Thursday, April 21, 2016 at 7:03:00 PM UTC+5:30, Jussi Piitulainen
> wrote:
>> harirammanohar@gmail.com writes:
>> 
>> > On Monday, April 18, 2016 at 12:38:03 PM UTC+5:30,
>> > hariram...@gmail.com wrote:
>> >> HI All,
>> >> 
>> >> can you help me out in doing below.
>> >> 
>> >> file:
>> >> <start>
>> >>  guava
>> >> fruit
>> >> <end>
>> >> <start>
>> >>  mango
>> >> fruit
>> >> <end>
>> >> <start>
>> >>  orange
>> >> fruit
>> >> <end>
>> >> 
>> >> need to delete from start to end if it contains mango in a file...
>> >> 
>> >> output should be:
>> >> 
>> >> <start>
>> >>  guava
>> >> fruit
>> >> <end>
>> >> <start>
>> >>  orange
>> >> fruit
>> >> <end>
>> >> 
>> >> Thank you
>> >
>> > any one can guide me ? why xml tree parsing is not working if i have
>> > root.tag and root.attrib as mentioned in earlier post...
>> 
>> Assuming the real consists of lines between a start marker and end
>> marker, a winning plan is to collect a group of lines, deal with it, and
>> move on.
>> 
>> The following code implements something close to the plan. You need to
>> adapt it a bit to have your own source of lines and to restore the end
>> marker in the output and to account for your real use case and for
>> differences in taste and judgment. - The plan is as described above, but
>> there are many ways to implement it.
>> 
>> from io import StringIO
>> 
>> text = '''\
>> <start>
>>   guava
>> fruit
>> <end>
>> <start>
>>   mango
>> fruit
>> <end>
>> <start>
>>   orange
>> fruit
>> <end>
>> '''
>> 
>> def records(source):
>>     current = []
>>     for line in source:
>>         if line.startswith('<end>'):
>>             yield current
>>             current = []
>>         else:
>>             current.append(line)
>> 
>> def hasmango(record):
>>     return any('mango' in it for it in record)
>> 
>> for record in records(StringIO(text)):
>>     hasmango(record) or print(*record)
> 
> Hi,
> 
> not working....this is the output i am getting...
> 
> \

This means that the line

>> text = '''\

has trailing whitespace in your copy of the script.

>  <start>
>    guava
>  fruit
> 
> <start>
>    orange
>  fruit

Jussi forgot to add the "<end>..." line to the group. To fix this change the 
generator to

def records(source):
    current = []
    for line in source:
        current.append(line)
        if line.startswith('<end>'):
            yield current
            current = []


>>     hasmango(record) or print(*record)

The

print(*record)

inserts spaces between record entries (i. e. at the beginning of all lines 
except the first) and adds a trailing newline. You can avoid this by 
specifying the delimiters explicitly:

if not hasmango(record):
    print(*record, sep="", end="")

Even with these changes code still looks somewhat brittle...

Back to comp.lang.python | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-18 00:07 -0700
  RE: delete from pattern to pattern if it contains match Joaquin Alzola <Joaquin.Alzola@lebara.com> - 2016-04-18 07:49 +0000
    Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-18 01:52 -0700
    Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-18 21:01 -0700
  Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-21 03:17 -0700
    Re: delete from pattern to pattern if it contains match Peter Otten <__peter__@web.de> - 2016-04-21 13:24 +0200
      Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-22 02:00 -0700
        Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-22 02:14 -0700
          Re: delete from pattern to pattern if it contains match Peter Otten <__peter__@web.de> - 2016-04-22 11:50 +0200
            Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-24 23:24 -0700
    Re: delete from pattern to pattern if it contains match Jussi Piitulainen <jussi.piitulainen@helsinki.fi> - 2016-04-21 16:32 +0300
      Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-22 01:59 -0700
        Re: delete from pattern to pattern if it contains match Peter Otten <__peter__@web.de> - 2016-04-22 11:24 +0200
          Re: delete from pattern to pattern if it contains match Jussi Piitulainen <jussi.piitulainen@helsinki.fi> - 2016-04-22 14:10 +0300
            Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-24 23:29 -0700
              Re: delete from pattern to pattern if it contains match Jussi Piitulainen <jussi.piitulainen@helsinki.fi> - 2016-04-25 10:17 +0300
                Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-25 02:49 -0700
                Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-25 02:53 -0700
                Re: delete from pattern to pattern if it contains match Jussi Piitulainen <jussi.piitulainen@helsinki.fi> - 2016-04-25 13:37 +0300
                Re: delete from pattern to pattern if it contains match Peter Otten <__peter__@web.de> - 2016-04-25 12:13 +0200
                Re: delete from pattern to pattern if it contains match Jussi Piitulainen <jussi.piitulainen@helsinki.fi> - 2016-04-25 13:39 +0300
                Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-25 04:02 -0700
                Re: delete from pattern to pattern if it contains match Jussi Piitulainen <jussi.piitulainen@helsinki.fi> - 2016-04-25 14:28 +0300
                Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-25 04:40 -0700
                Re: delete from pattern to pattern if it contains match Jussi Piitulainen <jussi.piitulainen@helsinki.fi> - 2016-04-25 15:00 +0300
                Re: delete from pattern to pattern if it contains match Peter Otten <__peter__@web.de> - 2016-04-25 14:33 +0200
                Re: delete from pattern to pattern if it contains match harirammanohar@gmail.com - 2016-04-26 03:31 -0700
                Re: delete from pattern to pattern if it contains match Jussi Piitulainen <jussi.piitulainen@helsinki.fi> - 2016-04-25 13:24 +0300
                RE: delete from pattern to pattern if it contains match Joaquin Alzola <Joaquin.Alzola@lebara.com> - 2016-04-25 10:19 +0000

csiph-web