The Bayesian approach to finite population prediction often assumes a parametric model, but it aims to find the posterior distribution of T given \(y_s\). Point estimates can be obtained by setting a loss function, although in many practical problems, the posterior mean is often considered and its associated variance is given by the posterior variance, i.e.:
\[\begin{equation} \tag{2.3} E(T | y_s) = 1^{'}_s y_s + 1^{'}_{\bar{s}} E(y_{\bar{s}} | y_s) \hspace{0.7cm} \text{and} \hspace{0.7cm} V(T | y_s) = 1^{'}_{\bar{s}} V(y_{\bar{s}} | y_s) 1_{\bar{s}} \end{equation}\]
It is possible to obtain an approximation to the quantities in (2.3) by using a Bayes linear estimation approach. Here, we will particularly obtain the estimators by assuming a general two-stage hierarchical model for finite population, specified only by its mean and variance-covariance matrix, presented in Bolfarine and Zacks (1992), page 76. Particular cases describing usual population structures found in practice are easily derived from (2.4). The general model can be written as:
\[\begin{equation} \tag{2.4} Y \hspace{0.1cm} | \hspace{0.1cm} \beta \sim [X \beta, V] \hspace{0.7cm} \text{and} \hspace{0.7cm} \beta \sim [a,R] \end{equation}\]
where \(X\) is a covariate matrix of dimension \(N \times p\), with rows \(X_i = (x_{i1}, ..., x_{ip})\), \(i = 1, ..., N\);
\(\beta = (\beta_1, ..., \beta_p)'\) is a \(p \times 1\) vector of unknown parameters; and \(y\), given \(\beta\), is a random vector with mean \(X\beta\) and known covariance matrix \(V\) of dimension \(N \times N\). Analogously \(a\) and \(R\) are the respective \(p \times 1\) prior mean vector and \(p \times p\) prior covariance matrix of \(\beta\).
Since the response vector \(y\) is partitioned into \(y_s\) and \(y_\bar{s}\), the matrix \(X\), which is assumed to be known, is analogously partitioned into \(X_s\) and \(X_\bar{s}\), and \(V\) is partitioned into \(V_s\), \(V_\bar{s}\), \(V_{s \bar{s}}\) and \(V_{\bar{s} s}\). The first aim is to predict \(y_\bar{s}\) given the observed sample \(y_s\) and then the total \(T\). We did this in the following steps: first, we used a joint prior distribution that is only partially specified in terms of moments, as follows:
\[\begin{equation} \begin{pmatrix} y_{\bar{s}}\\ y_s \end{pmatrix} \Big| \beta \hspace{0.1cm} \sim \hspace{0.1cm} \begin{bmatrix} \begin{pmatrix} X_{\bar{s}} \beta\\ X_s \beta \end{pmatrix},\begin{pmatrix} V_{\bar{s}} & V_{\bar{s} s}\\ V_{s \bar{s}} & V_s \end{pmatrix} \end{bmatrix} \end{equation}\]
(…)
The BLE_Reg() function will apply this methodology to the given sample, calculate the Bayes Linear Estimator (and its associate variance) to the parameter \(\beta\) and for the individuals not in the sample, given the auxiliary variable values. In a simple model the auxiliary variable will have value \(1\) for every individual.
xs <- matrix(c(1,1,1,1,2,3,5,0),nrow=4,ncol=2)
ys <- c(12,17,28,2)
x_nots <- matrix(c(1,1,1,0,1,4),nrow=3,ncol=2)
a <- c(1.5,6)
R <- matrix(c(10,2,2,10),nrow=2,ncol=2)
Vs <- diag(c(1,1,1,1))
V_nots <- diag(c(1,1,1))
Estimator <- BLE_Reg(ys,xs,a,R,Vs,x_nots,V_nots)
Estimator
#> $est.beta
#>       Beta
#> 1 1.723363
#> 2 5.206675
#> 
#> $Vest.beta
#>           V1          V2
#> 1  0.6708234 -0.17568311
#> 2 -0.1756831  0.07225381
#> 
#> $est.mean
#>      y_nots
#> 1  1.723363
#> 2  6.930039
#> 3 22.550064
#> 
#> $Vest.mean
#>            V1         V2          V3
#> 1  1.67082340 0.49514029 -0.03190904
#> 2  0.49514029 1.39171098  0.08142307
#> 3 -0.03190904 0.08142307  1.42141940
#> 
#> $est.tot
#> [1] 90.20347
#> 
#> $Vest.tot
#> [1] 5.573262