Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #104381

Review Request of Python Code

Newsgroups comp.lang.python
Date 2016-03-08 20:18 -0800
Message-ID <f0973a0d-62ba-402b-ab23-cb68bdd15323@googlegroups.com> (permalink)
Subject Review Request of Python Code
From subhabangalore@gmail.com

Show all headers | View raw


Dear Group,

I am trying to write a code for pulling data from MySQL at the backend and annotating words and trying to put the results as separated sentences with each line. The code is generally running fine but I am feeling it may be better in the end of giving out sentences, and for small data sets it is okay but with 50,000 news articles it is performing dead slow. I am using Python2.7.11 on Windows 7 with 8GB RAM. 

I am trying to copy the code here, for your kind review. 

import MySQLdb
import nltk
def sql_connect_NewTest1():
    db = MySQLdb.connect(host="localhost",
                     user="*****",         
                     passwd="*****",  
                     db="abcd_efgh")
    cur = db.cursor()
    #cur.execute("SELECT * FROM newsinput limit 0,50000;") #REPORTING RUNTIME ERROR
    cur.execute("SELECT * FROM newsinput limit 0,50;")
    dict_open=open("/python27/NewTotalTag.txt","r") #OPENING THE DICTIONARY FILE 
    dict_read=dict_open.read() 
    dict_word=dict_read.split()
    a4=dict_word #Assignment for code. 
    list1=[]
    flist1=[]
    nlist=[]
    for row in cur.fetchall():
        #print row[2]
        var1=row[3]
        #print var1 #Printing lines
        #var2=len(var1) # Length of file
        var3=var1.split(".") #SPLITTING INTO LINES
        #print var3 #Printing The Lines 
        #list1.append(var1)
        var4=len(var3) #Number of all lines
        #print "No",var4
        for line in var3:
            #print line
            #flist1.append(line)
            linew=line.split()
            for word in linew:
                if word in a4:
                    windex=a4.index(word)
                    windex1=windex+1
                    word1=a4[windex1]
                    word2=word+"/"+word1
                    nlist.append(word2)
                    #print list1
                    #print nlist
                elif word not in a4:
                    word3=word+"/"+"NA"
                    nlist.append(word3)
                    #print list1
                    #print nlist
                else:
                    print "None"
            
    #print "###",flist1
    #print len(flist1)
    #db.close()
    #print nlist
    lol = lambda lst, sz: [lst[i:i+sz] for i in range(0, len(lst), sz)] #TRYING TO SPLIT THE RESULTS AS SENTENCES 
    nlist1=lol(nlist,7)
    #print nlist1
    for i in nlist1:
        string1=" ".join(i)
        print i
        #print string1
    
   
Thanks in Advance.
        
        
    
    
    
    

Back to comp.lang.python | Previous | NextNext in thread | Find similar | Unroll thread


Thread

Review Request of Python Code subhabangalore@gmail.com - 2016-03-08 20:18 -0800
  Re: Review Request of Python Code Steven D'Aprano <steve+comp.lang.python@pearwood.info> - 2016-03-09 16:10 +1100
  Re: Review Request of Python Code INADA Naoki <songofacandy@gmail.com> - 2016-03-09 16:52 +0900
  Re: Review Request of Python Code Friedrich Rentsch <anthra.norell@bluewin.ch> - 2016-03-09 10:06 +0100
  Re: Review Request of Python Code Matt Wheeler <m@funkyhat.org> - 2016-03-09 12:06 +0000
  Re: Review Request of Python Code Matt Wheeler <m@funkyhat.org> - 2016-03-09 12:33 +0000
  Re: Review Request of Python Code subhabangalore@gmail.com - 2016-03-10 10:12 -0800
    Re: Review Request of Python Code BartC <bc@freeuk.com> - 2016-03-10 18:36 +0000
    Re: Review Request of Python Code Matt Wheeler <m@funkyhat.org> - 2016-03-10 18:51 +0000
      Re: Review Request of Python Code subhabangalore@gmail.com - 2016-03-10 12:14 -0800
    RE: Review Request of Python Code Joaquin Alzola <Joaquin.Alzola@lebara.com> - 2016-03-10 19:12 +0000
  Re: Review Request of Python Code Mark Lawrence <breamoreboy@yahoo.co.uk> - 2016-03-10 19:56 +0000

csiph-web