Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #92696

Re: python financial data cleaning

Path csiph.com!usenet.pasdenom.info!weretis.net!feeder1.news.weretis.net!feeder4.news.weretis.net!feeds.phibee-telecom.net!newsfeed.xs4all.nl!newsfeed1a.news.xs4all.nl!xs4all!post.news.xs4all.nl!not-for-mail
Return-Path <python-python-list@m.gmane.org>
X-Original-To python-list@python.org
Delivered-To python-list@mail.python.org
X-Spam-Status OK 0.015
X-Spam-Evidence '*H*': 0.97; '*S*': 0.00; 'from:addr:yahoo.co.uk': 0.05; 'data:': 0.07; 'received:80.91': 0.09; 'received:80.91.229': 0.09; 'received:gmane.org': 0.09; 'received:list': 0.09; 'python': 0.11; 'assume': 0.11; 'subject:python': 0.14; 'data)': 0.16; 'mark,': 0.16; 'mylist': 0.16; 'nan': 0.16; 'received:80.91.229.3': 0.16; 'received:plane.gmane.org': 0.16; 'sebastian': 0.16; 'wrote:': 0.16; 'language': 0.19; '>>>': 0.20; 'lawrence': 0.22; '2015': 0.23; 'header:In-Reply-To:1': 0.24; 'header:User-Agent:1': 0.26; 'header:X-Complaints-To:1': 0.26; 'data,': 0.27; "i'm": 0.29; 'excel': 0.29; '15,': 0.31; 'implement': 0.32; 'skip:p 30': 0.32; 'language.': 0.32; 'open': 0.33; 'previous': 0.34; 'could': 0.35; 'to:addr:python-list': 0.35; 'knowledge': 0.35; 'list': 0.35; 'but': 0.36; 'url:org': 0.36; 'thanks': 0.36; 'monday,': 0.36; 'subject:: ': 0.37; 'missing': 0.37; 'skip:g 20': 0.37; 'received:org': 0.38; 'say': 0.38; 'to:addr:python.org': 0.39; 'data': 0.40; 'mark': 0.40; 'close': 0.61; 'skip:n 10': 0.63; 'our': 0.64; 'series': 0.65; 'charset:windows-1252': 0.65; 'python-list': 0.66; 'price': 0.68; 'finance': 0.74; 'low': 0.83; 'cheung': 0.84; 'high,': 0.84; 'open,': 0.84; 'pythonistas,': 0.84
X-Injected-Via-Gmane http://gmane.org/
To python-list@python.org
From Mark Lawrence <breamoreboy@yahoo.co.uk>
Subject Re: python financial data cleaning
Date Tue, 16 Jun 2015 20:40:13 +0100
References <b0cbc75c-cc0b-4f27-a8d6-e4c20c356e6d@googlegroups.com> <mailman.480.1434363576.13271.python-list@python.org> <55a0ccda-572c-4324-9169-08028d4fb619@googlegroups.com>
Mime-Version 1.0
Content-Type text/plain; charset=windows-1252; format=flowed
Content-Transfer-Encoding 7bit
X-Gmane-NNTP-Posting-Host host-78-146-5-125.as13285.net
User-Agent Mozilla/5.0 (Windows NT 6.3; WOW64; rv:31.0) Gecko/20100101 Thunderbird/31.7.0
In-Reply-To <55a0ccda-572c-4324-9169-08028d4fb619@googlegroups.com>
X-BeenThere python-list@python.org
X-Mailman-Version 2.1.20+
Precedence list
List-Id General discussion list for the Python programming language <python-list.python.org>
List-Unsubscribe <https://mail.python.org/mailman/options/python-list>, <mailto:python-list-request@python.org?subject=unsubscribe>
List-Archive <http://mail.python.org/pipermail/python-list/>
List-Post <mailto:python-list@python.org>
List-Help <mailto:python-list-request@python.org?subject=help>
List-Subscribe <https://mail.python.org/mailman/listinfo/python-list>, <mailto:python-list-request@python.org?subject=subscribe>
Newsgroups comp.lang.python
Message-ID <mailman.525.1434483634.13271.python-list@python.org> (permalink)
Lines 37
NNTP-Posting-Host 2001:888:2000:d::a6
X-Trace 1434483634 news.xs4all.nl 2911 [2001:888:2000:d::a6]:53674
X-Complaints-To abuse@xs4all.nl
Xref csiph.com comp.lang.python:92696

Show key headers only | View raw


On 15/06/2015 22:01, Sebastian M Cheung via Python-list wrote:
> On Monday, June 15, 2015 at 11:19:48 AM UTC+1, Mark Lawrence wrote:
>> On 15/06/2015 11:12, Sebastian M Cheung via Python-list wrote:
>>> How to do financial data cleaning ? Say I assume a list of 1000 finance series data in myList = Open, High, Low and Close. For missing Close Price data, What is best practice to clean data in Python
>>>
>>
>> http://pandas.pydata.org/
>>
>> --
>> My fellow Pythonistas, ask not what our language can do for you, ask
>> what you can do for our language.
>>
>> Mark Lawrence
>
>
> Hi Mark,
>
> Below I read in DirtyData (financial data) from Excel and then find the number of NaN missing Closed Pricing data:
>
> xls = pd.ExcelFile('DirtyData.xlsm')
> df = xls.parse('Dirty Data', index_col=None, na_values=['NA'])
> print(df.isnull().astype(int).sum())
>
> So if I were to clean missing Open Price data, I could copy from previous or row's Close Price data, but how would I implement it? Thanks
>

I'm sorry but my knowledge of pandas is limited, I just know it's pretty 
much best of breed.  Try stackoverflow or 
https://groups.google.com/forum/#!forum/pydata which is gated to 
gmane.comp.python.pydata

-- 
My fellow Pythonistas, ask not what our language can do for you, ask
what you can do for our language.

Mark Lawrence

Back to comp.lang.python | Previous | NextPrevious in thread | Next in thread | Find similar | Unroll thread


Thread

python financial data cleaning Sebastian M Cheung <minscheung@googlemail.com> - 2015-06-15 03:12 -0700
  Re: python financial data cleaning Mark Lawrence <breamoreboy@yahoo.co.uk> - 2015-06-15 11:19 +0100
    Re: python financial data cleaning Sebastian M Cheung <minscheung@googlemail.com> - 2015-06-15 14:01 -0700
      Re: python financial data cleaning Mark Lawrence <breamoreboy@yahoo.co.uk> - 2015-06-16 20:40 +0100
  Re: python financial data cleaning Sebastian M Cheung <minscheung@googlemail.com> - 2015-06-15 03:59 -0700
  Re: python financial data cleaning Laura Creighton <lac@openend.se> - 2015-06-15 13:34 +0200
    Re: python financial data cleaning Sebastian M Cheung <minscheung@googlemail.com> - 2015-06-15 06:42 -0700

csiph-web