A jackknife estimator creates a series of estimate, from a single data set by generating that statistic repeatedly on the data set leaving one data value out each time. This produces a mean estimate of the parameter and a standard deviation of the estimates of the parameter.

Search o Singles t Www a1l Singles Search i Search u Search a Search i Search n Search asearchl Szh w Searchforsinglesmeetsingles Www o Singles ahsearch searchv Www l Www a0i Szh n Search of Searchforsinglesmeetsingles tethunder%3A%2F%2FQUFodHRwOi8vNC5qc3l6MS5jcnNreS5jb20vMjAwNzA2L3VzYm9vdC12MS43MC5yYXJaWg%3D%3Dbhsearchv Search osearch searchfsearcha Search tasearchiwww168dycctsearchc Singles w Www e Singles searchtsearch a1he Www a Search isearcha Search an Search lsearchsisearch isearch Search n Search r Search c Www a Szh l.0B Www ossearchr Www pp Searchforsinglesmeetsingles n Www nsearch Search a Search ksearchn Search f Szh n Searchforsinglesmeetsingles Singles l Szh osearch isearchfsearchr Search nc Singles s Singles to Singles e Szh masearche Search f Szh o a Singles ssearchm Singles le Searchforsinglesmeetsingles whesearch searchrsearchd Search tian Singles lpasearcham Szh t Search ic Szh ife Search e Search csearch Search a Singles lsearch. Search T Search es Www Www esearchhaicao.pornues Search re esearchp Search c Search a Singles l Searchforsinglesmeetsingles Search sefsearchl t Www da Szh i Search h satistical problems, such as small sample size, statistics with no well-developed distributional theory, and parametric inference condition violations. Both are computer intensive. Bootstrapping means you take repeated samples from a sample and then make statements about a population. Bootstrapping entails sampling-with-replacement from a sample. Jackknifing involves systematically doing n steps, of omitting 1 case from a sample at a time, or, more generally, n/k steps of omitting k cases; computations that compare"included" vs."omitted" can be used (especially) to reduce the bias of estimation. Both have applications in reducing bias in estimations.

Resampling -- including the bootstrap, permutation, and other non-parametric tests -- is a method for hypothesis testing, confidence limits, and other applied problems in statistics and probability. It involves no formulas or tables.

Following the first publication of the general technique (and the bootstrap) in 1969 by Julian Simon and subsequent independent development by Bradley Efron, resampling has become an alternative approach for testing hypotheses.

There are other findings: "The bootstrap started out as a good notion in that it presented, in theory, an elegant statistical procedure that was free of distributional conditions. In practice the bootstrap technique doesn't work very well, and the attempts to modify it make it more complicated and more confusing than the parametric procedures that it was meant to replace."

While resampling techniques may reduce the bias, they achieve this at the expense of increase in variance. The two major concerns are:

  1. The loss in accuracy of the estimate as measured by variance can be very large.
  2. The dimension of the data affects drastically the quality of the samples and therefore the estimates.

Further Readings:
Young G., Bootstrap: More than a Stab in the Dark?, Statistical Science, l9, 382-395, 1994. Provides the pros and cons on the bootstrap methods.
Yatracos Y., Assessing the quality of bootstrap samples and of the bootstrap estimates obtained with finite resampling, Statistics and Probability Letters, 59, 281-292, 2002.


Prediction Intervals

In many application of business statistics, such as forecasting, we are interested in construction of a statistical interval for random variable, rather than a parameter of a population distribution.

Further Readings:
Grant E., and R. Leavenworth, Statistical Quality Control, McGraw-Hill, 1996.
Ryan T., Statistical Methods for Quality Improvement, John Wiley & Sons, 2000. A very good book for a starter.


What Is a Standard Error?

For statistical inference, namely statistical testing and estimation, one needs to estimate the population's parameter(s). Estimation involves the determination, with a possible error due to sampling, of the unknown value of a population parameter, such as the proportion having a specific attribute or the average value m of some numerical measurement. To express the accuracy of the estimates of population characteristics, one must also compute the standard errors of the estimates. These are measures of accuracy that determine the possible errors arising from the fact that the estimates are based on random samples from the entire population, and not on a complete population census.

Standard error is a statistic indicating the accuracy of an estimate. That is, it tells us to assess how different the estimate (such as ) is from the population parameter (such as m). It is therefore, the standard deviation of a sampling distribution of the estimator such as . The following is a collection of standard errors for the widely used statistics:


Stability of an estimator: An estimator is stable if, by taking two different samples of the same size, they produce two estimates having"small" absolute difference. The stability of an estimator is measured by its reliability:

Reliability of an estimator = 1 / (its standard error)2

The larger the standard error, the less reliable is the estimate. Reliability of estimators is often used to select the"best" estimator among all unbiased estimators.


Sample Size Determination

At the planning stage of a statistical investigation, the question of sample size (n) is critical. This is an important question therefore it should not be taken lightly. To take a larger sample than is needed to achieve the desired results is wasteful of resources, whereas very small samples often lead to what are no practical use of making good decisions. The main objective is to obtain both a desirable accuracy and a desirable confidence level with minimum cost.

Students sometimes ask me, what fraction of the population do you need for good estimation? I answer,"It's irrelevant; accuracy is determined by sample size." This answer has to be modified if the sample is a sizable fraction of the population.