M·C, IST: estimates & confidence intervals

PE & CI's for μ & σ, optimized

Computes point estimate & CI's from unequal size samples' sums.

2025.Jul.01 07:24:53

Table
of sizes, sums

Table of sample sizes and their sums.
In Amer. or Euro. format. • (To simulate sums…)

Confidence level

Level for the confidence intervals. •

Trials, points, seed

Monte Carlo trials, histogram points, seed. •

Candidate "equal"

av. min max (> 0 overrides previous.)

Equal size equival.: aver. | min | max (in table) | other. •

Scale factor

Verify Optimize

Factor (1.st iff 'Verify'; both (bracket) iff 'Optimize'). •

x-limits for plots

x axis limits for plots (±i for infinity).

Show values

Show the graph coordinates.

Computes (analytically) point estimates and (by Monte Carlo) confidence intervals (CI) for μ and σ (Gaussian items) from sample sums alone.

As a conjecture, the average sample size leads to: a Student's t with n − 1 (nearest integer) degrees of freedom; and a chi-square with T − 1 df, and an adjustable scale factor.

Simulates N (trials) sets of T samples, each with (generally) unequal n_t (t = 1..T) elements (as in the Table above, T = 4), in order to compute “exact” confidence intervals. Many (analytical) point estimates are computed, and their (simulated) behavior leads to the confidence intervals.

Example data can be created (or using this Excel.xlsm, with a UDF, no macros) and inserted (copy-paste) in the Table textarea, above. Other suggested data:
(a) equal size, T = 4 samples (n ≡ 20; total items, 80; average size ≡ 20; scale, 0.0140);
(b) "industrial" unequal 50, T = 50 samples (144 ≤ n ≤ 1296; total items, 43632; average size, 873; scale, 0.002);
(c) "industrial" unequal 25, T = 25 samples (432 ≤ n ≤ 1296; total items, 21672; average size, 867; scale, 0.0032)

In industry, the sample sizes are usually unequal (different customers' orders, often multiples of 72, pallet), so the real (simulated) curve, f (with unequal n), and the candidate, 'cand', curves obviously do not match. An open question is whether a candidate n exists for that purpose (proposed: arithmetic mean of the n s). For the default data, the average n (25) makes the curves for the mean coincide. Same for variance, but a rule for the scale has not been found. (For the basis problem, scale is 0.0135.)

Warning: large data sets may lead to excede a typical (Web) time limit (~2 min) with error message as '502 Bad Gateway'. Speed: ~400 million Gaussians/min.

Draws plots of: the estimated value of μ; and the estimated value of σ².

References:

Plate: BagsPECIopt

• 1.3.6.4. Location and Scale Parameters ("formulas for") (NIST/SEMATECH e-Handbook…).

• Keisan Online Calculator: Chi-square distribution Calculator (Casio Computer Co., Ltd.).

• 1796-06-01: Carnot, Sadi Nicolas Léonard († 1832-08-24, 36 yrs.).

http://web.ist.utl.pt/~mcasquilho/compute/qc/Fx-BagsPECIopt.php
Created: 2022-06-02 — Last modified: 2022-06-04