- Quantity of interest or parameter of a model
Frequentist: is described as unknown and deterministic
Example: Throwing a coin for 100 times, there are 60 times that coin head appears, then . When sample data approaches infinity, this method from frequentist is able to give an accurate estimation of . However, if sample data is very scarce, then severe bias could occur. To conclude, more data, more accurate estimation of with frequentist method.Bayesian: is described as random. There are two inputs: prior , likelihood (似然) . There is one output: Posterior
Bayesian estimation is based on Bayesian rule:
Therefore, we have . Because, as observation (or sample data) is given as a condition in , so .
Example: Considering this example of flipping a coin again. is a distribution, instead of a deterministic value of 0.6 in this example. With the increase of sample data, trusts measurement more than prior.
Note: if prior is uniform distribution, then Bayesian method is equal to frequentist method.Maximum likelihood Estimation (MLE) - frequentist method
A given set of observations, random sample data , which is independent and identical distribution. The estimation of using MLE method can be expressed below:
The last line of above equation is called Negative Log likelihood (NLL)Maximum A Posteriori (MAP) - Bayesian method
A given set of observations, random sample data , which is independent and identical distribution. The estimation of using MAP method can be expressed below:
Given that prior is a Gaussian distribution:
Then