Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!newsfeed.xs4all.nl!newsfeed1.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.003 X-Spam-Evidence: '*H*': 0.99; '*S*': 0.00; 'matches': 0.07; 'python3': 0.07; 'subject:Getting': 0.07; "'.'": 0.09; 'builtin': 0.09; 'imported': 0.09; 'received:80.91': 0.09; 'received:80.91.229': 0.09; 'received:gmane.org': 0.09; 'received:list': 0.09; 'subject:modules': 0.09; 'python': 0.11; 'def': 0.12; "'':": 0.16; '(it': 0.16; '.py': 0.16; 'imported.': 0.16; 'kern': 0.16; 'modules,': 0.16; 'programmatic': 0.16; 'received:80.91.229.3': 0.16; 'received:plane.gmane.org': 0.16; 'set()': 0.16; 'spurious': 0.16; 'sys.path:': 0.16; 'underlying': 0.16; 'extensions': 0.16; 'wrote:': 0.18; 'packages.': 0.19; 'header:User-Agent:1': 0.23; 'interpret': 0.24; 'script': 0.25; 'header:X-Complaints-To:1': 0.27; 'header:In-Reply-To:1': 0.27; 'function': 0.29; "doesn't": 0.30; 'robert': 0.30; 'subject:list': 0.30; "i'm": 0.30; 'base,': 0.31; "d'aprano": 0.31; 'steven': 0.31; 'anyone': 0.31; 'file': 0.32; 'probably': 0.32; 'subject:all': 0.32; 'problem': 0.35; 'but': 0.35; 'there': 0.35; 'ext': 0.36; 'done': 0.36; 'shows': 0.36; 'half': 0.37; 'wrong': 0.37; 'two': 0.37; 'list': 0.37; 'easily': 0.37; 'skip:o 20': 0.38; 'system,': 0.38; 'e.g.': 0.38; 'filter': 0.38; 'handle': 0.38; 'to:addr:python-list': 0.38; 'files': 0.38; 'does': 0.39; 'to:addr:python.org': 0.39; 'skip:p 20': 0.39; 'received:org': 0.40; 'skip:u 10': 0.60; 'easy': 0.60; 'issues,': 0.61; 'complete': 0.62; 'name': 0.63; 'our': 0.64; 'zip': 0.64; 'more': 0.64; 'world': 0.66; 'believe': 0.68; 'discovered': 0.83; 'answer:': 0.84; 'eco': 0.84; 'glance': 0.84; 'otten': 0.84; 'terrible': 0.84; 'have.': 0.93 X-Injected-Via-Gmane: http://gmane.org/ To: python-list@python.org From: Robert Kern Subject: Re: Getting a list of all modules Date: Wed, 30 Jul 2014 11:35:41 +0100 References: <53d8a20e$0$29977$c3e8da3$5496439d@news.astraweb.com> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Gmane-NNTP-Posting-Host: 213.1.240.226 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.8; rv:24.0) Gecko/20100101 Thunderbird/24.6.0 In-Reply-To: X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 51 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1406716558 news.xs4all.nl 2928 [2001:888:2000:d::a6]:42364 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:75365 On 2014-07-30 09:46, Peter Otten wrote: > Steven D'Aprano wrote: > >> I'm looking for a programmatic way to get a list of all Python modules >> and packages. Not just those already imported, but all those which >> *could* be imported. >> >> I have a quick-and-dirty function which half does the job: >> >> >> def get_modules(): >> extensions = ('.py', '.pyc', '.pyo', '.so', '.dll') >> matches = set() >> for location in sys.path: >> if location == '': location = '.' >> if os.path.isdir(location): >> for name in os.listdir(location): >> base, ext = os.path.splitext(name) >> if ext in extensions: >> matches.add(base) >> return sorted(matches) >> >> >> >> but I know it's wrong (it doesn't handle packages correctly, or zip >> files, doesn't follow .pth files, has a very naive understanding of cross- >> platform issues, fails to include built-in modules that don't live in the >> file system, and probably more). >> >> Is this problem already solved? Can anyone make any suggestions? > > $ python3 -m pydoc -b > > shows a page with modules that I think is more complete than what you have. > A quick glance at the implementation suggests that the hard work is done by > > pkgutil.iter_modules() There are two niggles to this answer: it omits builtin modules, but those are easily discovered through sys.builtin_module_names. It can also include spurious script .py files that cannot be imported because their names are not Python identifiers: e.g. check-newconfigs.py. Those are easy to filter out, fortunately. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco