Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #91942 > unrolled thread

Re: What sort of data structure to use?

Started byPeter Otten <__peter__@web.de>
First post2015-06-03 12:59 +0200
Last post2015-06-03 12:59 +0200
Articles 1 — 1 participant

Back to article view | Back to comp.lang.python

This discussion starts older than the indexed window; earlier articles aren't shown. The article labeled Started by below is the oldest one visible, not the original post.


Contents

  Re: What sort of data structure to use? Peter Otten <__peter__@web.de> - 2015-06-03 12:59 +0200

#91942 — Re: What sort of data structure to use?

FromPeter Otten <__peter__@web.de>
Date2015-06-03 12:59 +0200
SubjectRe: What sort of data structure to use?
Message-ID<mailman.100.1433329169.13271.python-list@python.org>
David Aldrich wrote:

> Hi
> 
> I have written a Python utility that performs a certain activity on some
> predefined sets of files.  Here is the outline of what I have written:
> 
> # File Set A
> pathA = 'pathA'
> fileListA = ['fileA1.txt', 'fileA2.txt']
> 
> # File Set B
> pathB = 'pathB'
> fileListB = ['fileB1.txt', 'fileB2.txt', 'fileB3.txt']
> 
> myFunc1(pathA, fileListA)
> myFunc2(pathA, fileListA)
> 
> myFunc1(pathB, fileListB)
> myFunc2(pathB, fileListB)
> 
> I want to add more file sets, so I really want to add the sets to a list
> and iterate over the list, calling myFunc1 & myFunc2 for each item.
> 
> My question is: what sort of data structure could I use to organise this,
> given that I want to associate a set of files with each path and that, for
> each set, there is an arbitrary number of files?

I'd start simple and put (path, files) pairs into a list:

path_files_pairs = [
    ("pathA", ["fileA1.txt", "fileA2.txt", ...]),
    ("pathB", ["fileB1.txt", ...]),
]

for path, files in path_files_pairs:
    func1(path, files)
    func2(path, files)

You can always add complications later:

import glob
import os

class VirtualFileset:
    def __init__(self, folder, pattern):
        self.folder = folder
        self.pattern = pattern
    def __iter__(self):
        yield self.folder
        yield glob.glob(os.path.join(self.folder, self.pattern))

path_files_pairs = [
    ("pathA", ["fileA1.txt", "fileA2.txt", ...]),
    ("pathB", ["fileB1.txt", ...]),
    VirtualFileset("pathC", "*.py"), # all python files in directory "pathC"
]

for path, files in path_files_pairs:
    func1(path, files)
    func2(path, files)

[toc] | [standalone]


Back to top | Article view | comp.lang.python


csiph-web