2021-03-13

区間推定（Interval estimation）って何？

f:id:ts0818:20210313155330j:plain

www.itmedia.co.jp

　英老舗オークションハウスのChristie'sは3月11日（現地時間）、初めて主催したデジタルアートのNFT（Non-Fungible Token、代替不可能なトークン）のオンライン競売で、作品が6934万6250ドル（約75億円）で落札されたと発表した。このオークションは同社にとって、イーサリアムでの支払いを受け入れる初のオークションでもあった。

老舗Christie's初のNFTオークション、デジタルアートが約75億円で落札 - ITmedia NEWS

⇧「芸術」も「デジタル」化が進んでいるんですかね、「芸術は爆発だ！」っていう「岡本太郎」氏の言葉がありましたが、「デジタル」化の「爆発」が促進して止まない今日この頃でしょうか、どうもボクです。

というわけで、「区間推定（Interval estimation）」について調べてみました。

レッツトライ～。

区間推定（Interval estimation）って何？

改めまして、Wikipediaさんにお聞きしてみましょう。

In statistics, interval estimation is the use of sample data to calculate an interval of possible values of an unknown population parameter; this is in contrast to point estimation, which gives a single value. Jerzy Neyman (1937) identified interval estimation ("estimation by interval") as distinct from point estimation ("estimation by unique estimate"). In doing so, he recognized that then-recent work quoting results in the form of an estimate plus-or-minus a standard deviation indicated that interval estimation was actually the problem statisticians really had in mind.

https://en.wikipedia.org/wiki/Interval_estimation

⇧『1937年に、Jerzy Neymanさんという方が「点推定（point estimation）」とは異なるものとして「区間推定（interval estimation）」を識別』とあるので、なかなかに歴史ありですね。

「標本（サンプルデータ）」を使用して、未知の「母集団」の「母数（パラメーター）」が取り得る値の「区間（interval）」を「推定（estimation）」すること、それが「区間推定（Interval estimation）」であると。

その「区間（interval）」については、求める方法によっていろんな呼ばれ方をするっぽいんだけど、最も一般的な形式が、

The most prevalent forms of interval estimation are:

confidence intervals (a frequentist method); and
credible intervals (a Bayesian method).

https://en.wikipedia.org/wiki/Interval_estimation

⇧「confidence intervals (a frequentist method)」「credible intervals (a Bayesian method)」の2つあるらしく、Google翻訳によると、「頻度主義的手法（frequentist method）」「ベイズ主義的手法（Bayesian method）」で求める場合は、「信頼区間（confidence intervals）」「信用区間（credible intervals）」って訳されてますな。

その他にも、

Other forms include:

likelihood intervals (a likelihoodist method); and
fiducial intervals (a fiducial method).

https://en.wikipedia.org/wiki/Interval_estimation

⇧「likelihood intervals (a likelihoodist method)」「fiducial intervals (a fiducial method)」の2つがあり、Google翻訳によると、「尤度主義的手法（likelihoodist method）」「基準手法（fiducial method）」で求める場合は、「尤度間隔（likelihood intervals）」「基準間隔（fiducial intervals）」って訳されるそうな。

そもそもとして「母数（パラメーター）」を使わない場合は、

Other forms of statistical intervals, which do not estimate parameters, include:

tolerance intervals (an estimate of a population); and
prediction intervals (an estimate of a future observation, used mainly in regression analysis).

https://en.wikipedia.org/wiki/Interval_estimation

⇧「tolerance intervals (an estimate of a population)」「prediction intervals (an estimate of a future observation, used mainly in regression analysis)」の2つがあり、「母集団の推定（estimate of a population）」「主に回帰分析で使用される、将来の観測の推定（estimate of a future observation, used mainly in regression analysis）」で求める場合は、「許容間隔（tolerance intervals）」「予測区間（prediction intervals）」って訳されるそうな。

そして、「区間推定（interval estimate）」自体の位置づけはというと、

Non-statistical methods that can lead to interval estimates include fuzzy logic. An interval estimate is one type of outcome of a statistical analysis. Some other types of outcome are point estimates and decisions.

https://en.wikipedia.org/wiki/Interval_estimation

⇧ 上記の説明によりますと、

非統計的手法（Non-statistical methods）
- ファジー論理を含む区間推定（interval estimates include fuzzy logic）
統計的分析（statistical analysis）
- 区間推定（interval estimate）
- 点推定（point estimates）
- 決定（decisions）

ってな感じで、「区間推定（interval estimate）」は、「統計的分析（statistical analysis）」にグルーピングされるらしいですが、「ファジー論理を含む区間推定（interval estimates include fuzzy logic）」ってのが「非統計的手法（Non-statistical methods）」として存在するっぽい、という感じなので、「区間推定（interval estimate）」って言ったら、「統計的分析（statistical analysis）」って考えれば良いのかな？

まぁ、何が言いたいかというと、「区間推定（interval estimate）」っていうのは、

頻度主義的手法（frequentist method）
confidence intervals
ベイズ主義的手法（Bayesian method）
credible intervals
尤度主義的手法（likelihoodist method）
likelihood intervals
基準手法（fiducial method）
fiducial intervals
母集団の推定（estimate of a population）
tolerance intervals
主に回帰分析で使用される、将来の観測の推定（estimate of a future observation, used mainly in regression analysis）
prediction intervals

で「区間（interval）」の名前とかが変わってくるっぽいですと。

「頻度主義的手法（frequentist method）」の「信頼区間（confidence intervals）」と「ベイズ主義的手法（Bayesian method）」の「信用区間（credible intervals）」

「区間推定（interval estimate）」の最も一般的な形式が「信頼区間（confidence intervals）」「信用区間（credible intervals）」ということみたいなので、どういうものなのか確認してみますか。

ここで、悲報。

Wikipediaさんの記事の信用性が保証されないぃぃぃぃぃ～！

■頻度主義的手法（frequentist method）

信頼区間（confidence intervals）

This article needs attention from an expert in statistics. The specific problem is: Many reverts and fixes indicate the language of the article needs to be checked carefully. WikiProject Statistics may be able to help recruit an expert. (November 2018)

https://en.wikipedia.org/wiki/Confidence_interval

This article may require cleanup to meet Wikipedia's quality standards. The specific problem is: Prose is confusing, cluttered, and I'm not sure about the accuracy of some things Please help improve this article if you can. (September 2020) (Learn how and when to remove this template message)

https://en.wikipedia.org/wiki/Confidence_interval

⇧「信頼区間（confidence intervals）」については、記述されてることのいくつかに確信性が持てません、だそうです...

Oh, my gosh...

一方、「信用区間（credible intervals）」には、特に注意書きの記載はないっぽい。

Wikipediaさん、信じて良いんだよね？

■ベイズ主義的手法（Bayesian method）

信用区間（credible intervals）

In Bayesian statistics, a credible interval is an interval within which an unobserved parameter value falls with a particular probability. It is an interval in the domain of a posterior probability distribution or a predictive distribution.

https://en.wikipedia.org/wiki/Credible_interval

The generalisation to multivariate problems is the credible region. Credible intervals are analogous to confidence intervals in frequentist statistics, although they differ on a philosophical basis: Bayesian intervals treat their bounds as fixed and the estimated parameter as a random variable, whereas frequentist confidence intervals treat their bounds as random variables and the parameter as a fixed value. Also, Bayesian credible intervals use (and indeed, require) knowledge of the situation-specific prior distribution, while the frequentist confidence intervals do not.

https://en.wikipedia.org/wiki/Credible_interval

For example, in an experiment that determines the distribution of possible values of the parameter $\mu$ , if the subjective probability that $\mu$ lies between 35 and 45 is 0.95, then $35\leq \mu \leq 45$ is a 95% credible interval.

https://en.wikipedia.org/wiki/Credible_interval

⇧ 上記の赤線を引いた部分をGoogle翻訳したところ、

【ベイズ主義的手法（Bayesian method）】信用区間（credible intervals）
- 境界を固定変数として扱い、推定パラメーターを確率変数として扱う。
- 状況固有の事前分布の知識を使用します（実際、必要とします）。
【頻度主義的手法（frequentist method）】信頼区間（confidence intervals）
- 境界を確率変数として扱い、パラメーターを固定値として扱う。
- 状況固有の事前分布の知識を使用しません。

って違いになるらしい。

「推定パラメーター」ってのは原文だと「estimated parameter」ってなってて、何かしらの「分布」の「母数（パラメーター）」って考えて良いのかが分からん...

「推定論」によると、

"Parameter estimation" redirects here. It is not to be confused with Point estimation or Interval estimation.

https://en.wikipedia.org/wiki/Estimation_theory

⇧「Parameter estimation」についてのページが「Estimation theory」ってことらしいんで、「estimated parameter」のことも載ってるのかな？

ページを見てみた感じ、

Estimation theory is a branch of statistics that deals with estimating the values of parameters based on measured empirical data that has a random component. The parameters describe an underlying physical setting in such a way that their value affects the distribution of the measured data. An estimator attempts to approximate the unknown parameters using the measurements.

https://en.wikipedia.org/wiki/Estimation_theory

In estimation theory, two approaches are generally considered.

The probabilistic approach (described in this article) assumes that the measured data is random with probability distribution dependent on the parameters of interest
The set-membership approach assumes that the measured data vector belongs to a set which depends on the parameter vector.

https://en.wikipedia.org/wiki/Estimation_theory

⇧ ってな感じで、2つアプローチがあるって言ってるんだけど、「estimated parameter（推定パラメーター）」については分からず...

もうちょい読み進めると、

For a given model, several statistical "ingredients" are needed so the estimator can be implemented. The first is a statistical sample – a set of data points taken from a random vector (RV) of size N. Put into a vector,

\mathbf {x} ={\begin{bmatrix}x[0]\\x[1]\\\vdots \\x[N-1]\end{bmatrix}}.

Secondly, there are M parameters

\mathbf {\theta } ={\begin{bmatrix}\theta _{1}\\\theta _{2}\\\vdots \\\theta _{M}\end{bmatrix}},

https://en.wikipedia.org/wiki/Estimation_theory

whose values are to be estimated. Third, the continuous probability density function (pdf) or its discrete counterpart, the probability mass function (pmf), of the underlying distribution that generated the data must be stated conditional on the values of the parameters:

p(\mathbf {x} |\mathbf {\theta } ).\,

It is also possible for the parameters themselves to have a probability distribution (e.g., Bayesian statistics). It is then necessary to define the Bayesian probability

\pi (\mathbf {\theta } ).\,

https://en.wikipedia.org/wiki/Estimation_theory

⇧ ってあるので、「estimated parameter（推定パラメーター）」ってのは、何かしらの「分布」の「母数（パラメーター）」って考えて良さそうですかね？

「信頼区間（confidence intervals）」と「信用区間（credible intervals）」の違いについては、

ysk24ok.github.io

⇧ 上記サイト様がまとめてくださっています。

「頻度主義的手法（frequentist method）」の「信頼区間（confidence intervals）」で「区間推定（interval estimate）」について確認

残念ながら、Wikipediaさんの情報の信頼性が高くないみたいなのですが、かといって私自身にアカデミックなバックボーンがあるわけでもないので、「論文」とかで正確な情報を見つける術とかも持ち合わせてないし、致し方ないですが、Wikipediaさんに頼ることにしますかね。（こういうところから「情報格差」が始まって来るよね～）

信頼区間（しんらいくかん、英: Confidence interval, CI）とは、統計学で母集団の真の値が含まれることが、かなり確信 (confident) できる数値範囲のことである。例えば95%CIとは、この範囲に母集団の値が存在すると、95%確信できることを意味する。

信頼区間 - Wikipedia

信頼区間とは、母数空間 Θ 上の関数 g : Θ → R が母数 θ ∈ Θ でとる値 g(θ) を統計的に推定するために用いられる区間。実数 0 < α < 1 と（観測できない）母数 θ により定まる確率分布 P = P_θ をもつ母集団からの標本 X₁, …, X_n に関する統計量 a, b が不等式

P(a\leq g(\theta )\leq b)\geq 1-\alpha

を満たすとき、閉区間 [a, b] を g(θ) の 100(1 − α)% 信頼区間という。

信頼区間 - Wikipedia

値 1 − α（または 100(1 − α)%）は、信頼水準（英: confidence level）または信頼係数（英: confidence coefficient）と呼ばれ、慣習的には95%や99%（つまり α = 0.05, 0.01）などの数値を用いる。これを

100(1 − α)% CI [a, b]

と表記することもある。

信頼区間 - Wikipedia

⇧ まぁ、「論理分布」を仮に決めてあげて、「母数（パラメーター）」で表現できる関数の値が、「区間（interval）」に収まるかどうかってのが、「頻度主義的手法（frequentist method）」の「区間推定（interval estimate）」ってことだとして、この「区間（interval）」のことを「信頼区間（confidence intervals）」って言うそうな。

ここで、衝撃なのが、

具体例

$X 1, \dots, X n$ を、平均 $μ$ 、分散 $σ 2 > 0$ の正規分布に従う母集団から抽出した独立な標本とする。そこで標本平均と不偏分散をそれぞれ

{\begin{aligned}{\overline {X}}&={\frac {1}{n}}\textstyle \sum \limits _{i=1}^{n}X_{i}\\S^{2}&={\frac {1}{n-1}}\textstyle \sum \limits _{i=1}^{n}(X_{i}-{\overline {X}})^{2}\end{aligned}}

とおけば

T={\frac {{\overline {X}}-\mu }{S/{\sqrt {n}}}}

は自由度 n − 1 のt分布に従う。

ここで T が従う分布は（観測できない）母数 θ = (μ, σ²) にはよらないことに注意。

信頼区間 - Wikipedia

t_n−1(α) をこの分布の上側100α%点とすれば

P{\big (}-t_{n-1}(\alpha /2)\leq T\leq t_{n-1}(\alpha /2){\big )}=1-\alpha

となる。したがって

P\left({\overline {X}}-t_{n-1}(\alpha /2){\frac {S}{\sqrt {n}}}\leq \mu \leq {\overline {X}}+t_{n-1}(\alpha /2){\frac {S}{\sqrt {n}}}\right)=1-\alpha

が成り立ち、平均 g(θ) = μ の 100(1 − α)% 信頼区間

\left[{\overline {X}}-t_{n-1}(\alpha /2){\frac {S}{\sqrt {n}}},\quad {\overline {X}}+t_{n-1}(\alpha /2){\frac {S}{\sqrt {n}}}\right]

が得られる。

信頼区間 - Wikipedia

f:id:ts0818:20210313142707p:plain

信頼区間 - Wikipedia

⇧「信頼区間（confidence intervals）」の具体例で、思いっきり「仮説検定」の「t検定」をブッコんで来てるところね。

いや、間違ってないとは思うんですけど、私自身アカデミックなバックボーンがあるわけでもないので何とも判断が付かん...。

ただ、Wikipediaさんの分類がね、

f:id:ts0818:20210313143027p:plain

信頼区間 - Wikipedia

⇧ どう見ても、

推測統計学
- 仮説検定
  t検定
- 区間推定
  信頼区間（confidence intervals）

ってな感じで、「仮説検定」と「区間推定」って別々の方法で何かするんかな～みたいに見える感じで仕分けされてるし、そもそも「区間推定（interval estimate）」の説明の中で「信頼区間（confidence intervals）」ってのが出てきたのに、「区間推定（interval estimate）」での具体例を示さずに、「仮説検定」の「t検定」を具体例とする、その心は？

それとも、あれかな、

推計統計学は、頻度主義に基づいたものとベイズ主義に基づいたものに分けられる。

頻度主義における統計学的推論は、母集団を規定する量＝パラメータ（母数）を既定の固定値としてそれを推定するという方法（パラメトリック推定）に基いて発展してきた。基礎的なパラメトリック推定における統計学的推測は、以下のように細分される。

点推定: ${\hat {\theta }}=\arg \max _{\theta }Estimator(\theta )$
区間推定
仮説検定

最近は、不確実性を確率分布として表現するベイズ統計学が注目されている。

推計統計学 - Wikipedia

⇧「点推定」以外は「信頼区間（confidence intervals）」を使うことが当たり前なんだよ、的な暗黙なルールが存在するってことなのかね？

どうでもいいけど、「ご当地ルール」みたいな感じで、内輪だけで共有されてるみたいな情報でないことを祈りますよ。

結局のところ、よく分からんけども、「頻度主義的手法（frequentist method）」の「区間推定（interval estimate）」の際の「信頼区間（confidence intervals）」ってのは、「仮説検定」の「t検定」で具体例が示されることから、「仮説検定」で出てくる「信頼区間（confidence intervals）」について調べておけば良いってことなんですかね。

何て言うか、定義については、然るべき団体とかを設立してちゃんと共通のものを文書かなんかで定めておいて、誰でも参照できるような体制を確立しておいてもらえると助かるんですけどね...。

毎度毎度モヤモヤ感が半端ない...

今回はこのへんで。