Path: csiph.com!usenet.pasdenom.info!weretis.net!feeder4.news.weretis.net!rt.uk.eu.org!newsfeed.xs4all.nl!newsfeed3a.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.093 X-Spam-Evidence: '*H*': 0.82; '*S*': 0.00; 'preferably': 0.05; 'subject:Python': 0.06; '22,': 0.09; 'subject:software': 0.09; 'cc:addr:python-list': 0.11; 'jumping': 0.16; 'received:mac.com': 0.16; 'relevance': 0.16; 'wrote:': 0.18; 'received:10.0.1': 0.19; 'not,': 0.20; 'help.': 0.21; 'cc:addr:python.org': 0.22; 'cc:2**1': 0.23; 'guys': 0.24; 'cc:no real name:2**0': 0.24; 'sort': 0.25; 'script': 0.25; 'academic': 0.26; 'am,': 0.29; 'words': 0.29; "i'm": 0.30; 'url:mailman': 0.30; 'serve': 0.31; 'keywords,': 0.31; 'open': 0.33; 'url:python': 0.33; '(e.g.': 0.33; 'could': 0.34; 'point.': 0.35; 'google': 0.35; 'there': 0.35; 'located': 0.36; 'keyword': 0.36; 'url:listinfo': 0.36; 'charset:us-ascii': 0.36; 'thanks': 0.36; 'hi,': 0.36; 'received:10.0': 0.36; 'url:org': 0.36; 'received:10': 0.37; 'received:17': 0.38; 'anything': 0.39; 'does': 0.39; 'url:mail': 0.40; 'subject:? ': 0.60; 'subject:Can': 0.60; 'here:': 0.62; 'complete': 0.62; 'email addr:gmail.com': 0.63; 'grab': 0.64; 'provide': 0.64; 'to:addr:gmail.com': 0.65; 'results': 0.69; 'subject:First': 0.74; 'subject:this': 0.83; '2014,': 0.84; 'header:In-reply-to:1': 0.84; 'scholar': 0.84; 'source?': 0.84; 'abstracts': 0.91 MIME-version: 1.0 Content-transfer-encoding: 7BIT Content-type: text/plain; CHARSET=US-ASCII X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:5.11.96,1.0.14,0.0.0000 definitions=2014-05-22_03:2014-05-22,2014-05-22,1970-01-01 signatures=0 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 spamscore=0 suspectscore=1 phishscore=0 adultscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=7.0.1-1402240000 definitions=main-1405220164 Subject: Re: Can Python do this? First steps, links to resources or complete software referals appreciated. From: William Ray Wing In-reply-to: <4efbb539-be4f-487c-8f7e-9675990e20a5@googlegroups.com> Date: Thu, 22 May 2014 08:51:36 -0400 References: <4efbb539-be4f-487c-8f7e-9675990e20a5@googlegroups.com> To: ed.cottam@gmail.com X-Mailer: Apple Mail (2.1878.2) Cc: python-list@python.org, "William R. Wing" X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 19 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1400766707 news.xs4all.nl 2963 [2001:888:2000:d::a6]:40087 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:71889 On May 22, 2014, at 6:03 AM, ed.cottam@gmail.com wrote: > Hi, I'm an academic and I want to find/adapt/create a script that will grab abstracts (150-250 words of text) from Google Scholar search results and sort them by relevance (e.g. keywords, keyword combinations, anything other way you can think of). > > Any of you guys know of a script that does this already? Preferably open source? If not, any resources you could bring to my attention? I' a complete Newb! > > Thanks for your help. > > Ed > -- > https://mail.python.org/mailman/listinfo/python-list Well, you might take a look at scholar.py, located here: http://www.icir.org/christian/scholar.html Also, there is this at stackoverflow: http://stackoverflow.com/questions/13200709/extract-google-scholar-results-using-python-or-r One of these may provide what you want, or serve as a jumping off point. -Bill