Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]
Groups > sci.stat.math > #10743
| Newsgroups | sci.stat.math |
|---|---|
| Date | 2021-10-08 12:04 -0700 |
| References | <ffd48125-4014-47c2-b057-ec16a61f2906n@googlegroups.com> <nahmlgd0uptm4sq84taobiautommb556gv@4ax.com> <1c284cb3-f20e-48d1-872a-7ec2d5fb9e66n@googlegroups.com> <88rplg924778lsd73ng6kk2km9gousdsil@4ax.com> |
| Message-ID | <69d5a7df-1b43-48c2-aef1-eff44e8a0cf8n@googlegroups.com> (permalink) |
| Subject | Re: unemployment stats |
| From | RichD <r_delaney2001@yahoo.com> |
On October 5, Rich Ulrich wrote: >>>> Given a population of unemployed persons, i.e. names >>>> and phone numbers. You wish to construct a histogram >>>> of # of persons vs. time (# of days) out of work. >>>> Stats 101, right? >>>> Call some random subset of the list, ask them: when >>>> were you laid off? Assuming the sample is unbiased, >>>> it will satisfy the conditions. >>>> No, this method is flawed. Because the person out of >>>> work a long time, has a greater chance of receiving >>>> multiple calls (or at least one call) than one who is >>>> shortly re-employed. This biases the sample, skews >>>> the numbers on the long side. > >>> Well, the number represents what it represents. >>> It is only a mis-report of you mis-report it. > >>>> Therefore, officially published statistics are unreliable. > >>> I think you mean "invalid". And you are wrong, mainly. >>> Technically, in statistics, we have both "reliability" and >>> "validity"... good validity says that it measures >>> what it purports to measure. You should complain about >>> validity: the statistics imply something untrue. > >>Given the goal of the study, is the objection mentioned above, justified? >>i.e. is the methodology flawed? > > My position is that you can collect and report information for > any numbers that might be interesting. > The initial problem is, "Where do these data come from?" - That > might put hard limits on what you can infer. > You are jumping ahead to "bad inference." Showing a > histogram of a cross-section of a stated population (sample) > is not "drawing an inference." > Assuming a simplified, instantaneous cross-sectional sample from > that population, you might use your observations above, > about the implicit weighting, to compute a weighted mean -- > Each person would be weighted by their TIME (as the > "probability of being sampled") and you compute that weighted > mean... as an estimate of ... hmm. The goal isn't to estimate the chance a person might receive a call. The goal is to estimate the distribution of population vs. time unemployed, given a histogram of samples of the unemployed. Then, perhaps, one might predict, probabilistically, how much time a newly unemployed will require to find new work. Intuitively, the distribution should match the sample histogram. That's the desired inference. Very simple. Given all that, review the objection mentioned above: those longer unemployed, will have a greater chance of getting a call. Therefore, the methodology is flawed; the sample isn't unbiased. I have an ulterior motive for posting this - -- Rich
Back to sci.stat.math | Previous | Next — Previous in thread | Next in thread | Find similar
unemployment stats RichD <r_delaney2001@yahoo.com> - 2021-10-03 16:35 -0700
Re: unemployment stats Rich Ulrich <rich.ulrich@comcast.net> - 2021-10-04 14:35 -0400
Re: unemployment stats RichD <r_delaney2001@yahoo.com> - 2021-10-04 19:37 -0700
Re: unemployment stats Rich Ulrich <rich.ulrich@comcast.net> - 2021-10-05 20:40 -0400
Re: unemployment stats RichD <r_delaney2001@yahoo.com> - 2021-10-08 12:04 -0700
Re: unemployment stats RichD <r_delaney2001@yahoo.com> - 2021-10-08 12:10 -0700
Re: unemployment stats Rich Ulrich <rich.ulrich@comcast.net> - 2021-10-09 19:10 -0400
Re: unemployment stats RichD <r_delaney2001@yahoo.com> - 2021-10-23 17:18 -0700
Re: unemployment stats Rich Ulrich <rich.ulrich@comcast.net> - 2021-10-24 13:39 -0400
csiph-web