FAQs: Choosing survey weights for gllamm

www.gllamm.org

Which survey weights should I use?

Title		Choosing survey weights for gllamm
Author		Minjeong Jeon, University of California, Berkeley
Date		July 2012

The answer depends on what kind of sampling weights you have in your data set. Suppose you have a dataset consisting of two levels. We can think of the total weights (w_ij) as product of leve1-2 weights (w_j) and level-1 weights (w_i|j):

w_ij = w_j × w_i|j where w_j is the inverse of the probability that the level-2 unit (or primary sampling unit) was selected and w_i|j is the inverse of the conditional probability that the level-1 unit was selected given that the level-2 unit that it belongs to was selected. Use the level-1 and level-2 weights if you have both of them. If only level-2 weights are available (and the level-1 units were sampled from level-2 units with equal probabilities), use the level-2 weights only. Similarly, if you have level-1 weights only and the level-2 units were sampled with equal probabilities, use level-1 weights only.

If you have level-1 weights, do not use them without proper rescaling of the weights. Scaling weights is still an on-going research area; see for example Rabe-Hesketh and Skrondal (2006).

A problem arises when you have only total weights. If you have only total weights, do not use the total weights for pweight() option. Remember that pweight() allows weights for individual levels only. When weights at individual levels are not available, one alternative way is to utilize design variables. By using the design variables as covariates in your regression model, you can still obtain consistent estimates. Another alternative is not to use multilevel modeling but to use ordinary regression models with the svy prefix command.

Examples and documentation

Exercise 10.8 on PISA data in Rabe-Hesketh, S. and Skrondal, A. (2012). Multilevel and Longitudinal Modeling Using Stata (Third Edition). Volume II: Categorical Responses, Counts, and Survival. College Station, TX: Stata Press.
- Download Chapter 10
- Solution to exercise 10.8
Rabe-Hesketh, S. and Skrondal, A. (2006). Multilevel modeling of complex survey data. Journal of the Royal Statistical Society, Series A 169, 805-827. Local
- PISA data
- Do-file

References

Rabe-Hesketh, S. and Skrondal, A. (2006). Multilevel modeling of complex survey data. Journal of the Royal Statistical Society, Series A 169, 805-827. Local