Path: csiph.com!v102.xanadu-bbs.net!xanadu-bbs.net!feeder.erje.net!eu.feeder.erje.net!xlned.com!feeder3.xlned.com!newsfeed.xs4all.nl!newsfeed1.news.xs4all.nl!xs4all!newsgate.cistron.nl!newsgate.news.xs4all.nl!post.news.xs4all.nl!not-for-mail Return-Path: X-Original-To: python-list@python.org Delivered-To: python-list@mail.python.org X-Spam-Status: OK 0.006 X-Spam-Evidence: '*H*': 0.99; '*S*': 0.00; 'duplicate': 0.07; 'definition,': 0.09; 'received:80.91': 0.09; 'received:80.91.229': 0.09; 'received:gmane.org': 0.09; 'received:list': 0.09; 'random': 0.14; 'times,': 0.14; '4921': 0.16; 'message-id:@4ax.com': 0.16; 'pairs': 0.16; 'pairs,': 0.16; 'received:80.91.229.3': 0.16; 'received:plane.gmane.org': 0.16; 'subject:generation': 0.16; 'subject:random': 0.16; 'unlikely': 0.16; 'aug': 0.22; 'finally,': 0.24; 'url:home': 0.24; 'connected': 0.24; 'header:X-Complaints- To:1': 0.27; 'have,': 0.30; 'matching': 0.30; 'node': 0.31; 'figure': 0.32; 'supposed': 0.32; 'cases': 0.33; "i'd": 0.34; 'really': 0.36; '+0200,': 0.36; 'data,': 0.36; 'subject:data': 0.36; 'charset:us-ascii': 0.36; 'seconds': 0.37; 'to:addr:python- list': 0.38; 'to:addr:python.org': 0.39; 'enough': 0.39; 'received:org': 0.40; 'times': 0.62; 'refer': 0.63; 'different': 0.65; 'paper': 0.75; 'subject:Network': 0.84; 'subject:World': 0.91; 'received:108': 0.93 X-Injected-Via-Gmane: http://gmane.org/ To: python-list@python.org From: Dennis Lee Bieber Subject: Re: Small World Network model random data generation Date: Tue, 26 Aug 2014 09:23:18 -0400 Organization: IISS Elusive Unicorn References: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Gmane-NNTP-Posting-Host: adsl-108-79-219-144.dsl.klmzmi.sbcglobal.net X-Newsreader: Forte Agent 6.00/32.1186 X-No-Archive: YES X-BeenThere: python-list@python.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: General discussion list for the Python programming language List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.python Message-ID: Lines: 53 NNTP-Posting-Host: 2001:888:2000:d::a6 X-Trace: 1409059394 news.xs4all.nl 2964 [2001:888:2000:d::a6]:39839 X-Complaints-To: abuse@xs4all.nl Xref: csiph.com comp.lang.python:77039 On Tue, 26 Aug 2014 12:16:33 +0200, lavanya addepalli declaimed the following: >How can i generate a random data that is identical to my realworld data > By definition, "random data" will be unlikely to ever be "identical" to your "realworld data". >i am supposed to refer the attached paper > >Real Data > >node pairs and the time they spend together connected > >node node time in seconds >4391 2814 16.0 >4945 3545 386.0 >5045 4921 63078.0 >5045 3545 896.0 >4921 3545 896.0 >5045 1683 1108.0 >4921 1683 1108.0 >5454 4391 4161.0 >5454 5070 2755.0 >5070 4391 2935.0 >1158 305 1.0 >5045 3140 48767.0 >4921 3140 58405.0 >5372 2684 460.0 >1885 1158 351.0 >1349 1174 6375.0 > Since I see no cases of duplicate node /pairs/ it is difficult to figure out just what that data really represents... With enough data, with duplicate pairs having different times, I'd likely group by pairs, generate mean and standard deviation for the times of the matching pairs, then generate some count of the pairs to develop weights... Finally, using the weights I'd attempt to generate random node pairs and then use the mean/SD of the result pair to generate a time from the gaussian distribution. With only the data you have, I'd end up with a sparse 2D matrix M[first_node, second_node] = time And then selecting random samples from that... -- Wulfraed Dennis Lee Bieber AF6VN wlfraed@ix.netcom.com HTTP://wlfraed.home.netcom.com/