Path: csiph.com!usenet.pasdenom.info!weretis.net!feeder1.news.weretis.net!feeder.erje.net!eu.feeder.erje.net!xlned.com!feeder7.xlned.com!news2.euro.net!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail
MIME-Version: 1.0
In-Reply-To: <7e82becd-77c1-4800-8f4e-7624b19de82b@googlegroups.com>
References: <08ae2828-1532-47b6-a9cb-208549189467@googlegroups.com> <b3gnnaFbe60U1@mid.individual.net> <8ea32ea7-2cee-4e61-8cbd-066721d88d4a@googlegroups.com> <I9GAt.794$ct1.646@newsfe07.iad> <b3gqi7FbvfsU1@mid.individual.net> <7e82becd-77c1-4800-8f4e-7624b19de82b@googlegroups.com>
From: Joshua Landau <joshua.landau.ws@gmail.com>
Date: Tue, 2 Jul 2013 21:56:55 +0100
Subject: Re: Parsing Text file
To: sas429s@gmail.com
Content-Type: text/plain; charset=UTF-8
Cc: python-list <python-list@python.org>
Precedence: list
Newsgroups: comp.lang.python
Message-ID: <mailman.4126.1372798662.3114.python-list@python.org>
Lines: 65
NNTP-Posting-Host: 2001:888:2000:d::a6
Xref: csiph.com comp.lang.python:49656

On 2 July 2013 21:28,  <sas429s@gmail.com> wrote:
> Here I am looking for the line that contains: "WORK_MODE_MASK", I want to print that line as well as the file name above it: config/meal/governor_mode_config.h
> or config/meal/components/source/ceal_PackD_kso_aic_core_config.h.
>
> SO the output should be something like this:
> config/meal/governor_mode_config.h
>
> #define GOVERNOR_MODE_WORK_MODE_MASK    (CEAL_MODE_WORK_MASK_GEAR| \
>                                            CEAL_MODE_WORK_MASK_PARK_BRAKE | \
>                                            CEAL_MODE_WORK_MASK_VEHICLE_SPEED)
>
> config/meal/components/source/kso_aic_core_config.h
> #define CEAL_KSO_AIC_WORK_MODE_MASK   (CEAL_MODE_WORK_MASK_GEAR       | \
>                                    CEAL_MODE_WORK_MASK_PARK_BRAKE | \
>                                    CEAL_MODE_WORK_MASK_VEHICLE_SPEED)

(Please don't top-post.)

    filename = None

    with open("tmp.txt") as file:
        nonblanklines = (line for line in file if line)

        for line in nonblanklines:
            if line.lstrip().startswith("#define"):
                defn, name, *other = line.split()
                if name.endswith("WORK_MODE_MASK"):
                    print(filename, line, sep="")

            else:
                filename = line

Basically, you loop through remembering what lines you need, match a
little bit and ignore blank lines. If this isn't a solid
specification, you'll 'ave to tell me more about the edge-cases.

You said that

> #define CEAL_KSO_AIC_WORK_MODE_MASK   (CEAL_MODE_WORK_MASK_GEAR       | \
>                                    CEAL_MODE_WORK_MASK_PARK_BRAKE | \
>                                    CEAL_MODE_WORK_MASK_VEHICLE_SPEED)

was one line. If it is not, I suggest doing a pre-process to "wrap"
lines with trailing "\"s before running the algorithm:

    def wrapped(lines):
        wrap = ""
        for line in lines:
            if line.rstrip().endswith("\\"):
                wrap += line

            else:
                yield wrap + line
                wrap = ""

...
        nonblanklines = (line for line in wrapped(file) if line)
...


This doesn't handle all wrapped lines properly, as it leaves the "\"
in so may interfere with matching. That's easily fixable, and there
are many other ways to do this.

What did you try?