Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #27703

Data cleaning workouts

Date 2012-08-23 00:52 -0700
From Fg Nu <fgnu32@yahoo.com>
Subject Data cleaning workouts
Newsgroups comp.lang.python
Message-ID <mailman.3697.1345708456.4697.python-list@python.org> (permalink)

Show all headers | View raw


List folk,

I am a newbie trying to get used to Python. I was wondering if anyone knows of web resources that teach good practices in data cleaning and management for statistics/analytics/machine learning, particularly using Python.

Ideally, these would be exercises of the form: here is some horrible raw data --> here is what it should look like after it has been cleaned. Guidelines about steps that should always be taken, practices that should be avoided; basically, workflow of data analysis in Python with special emphasis on the cleaning part.

Back to comp.lang.python | Previous | NextNext in thread | Find similar | Unroll thread


Thread

Data cleaning workouts Fg Nu <fgnu32@yahoo.com> - 2012-08-23 00:52 -0700
  Re: Data cleaning workouts rusi <rustompmody@gmail.com> - 2012-08-23 23:01 -0700
    Re: Data cleaning workouts Fg Nu <fgnu32@yahoo.com> - 2012-08-23 23:48 -0700
    Re: Data cleaning workouts Mark Lawrence <breamoreboy@yahoo.co.uk> - 2012-08-24 09:16 +0100

csiph-web