German Microdata Lab

Variance Estimation using Microcensus Data under consideration of Sampling Design

The microcensus (MZ) is carried out as a stratified area sample with sample districts as a primary unit. Not all stratification variables are available in the factually anonymized microcensus (FAMZ). Moreover, the selection of the FAMZ from the MZ represents another stage of the survey. The consequence of this complex design is that no standard assumption of a simple random sample can be made for variance estimates on the basis of factually anonymized microcensus data. To date, only design-effect factors published by the Federal Statistical Office could be used for projections or statements about the population. However, these factors are not available for every unit of analysis and do not take into consideration the sampling of a 70 percent survey at household level for the factually anonymized file. The 1996 microcensus was the first time that the building category as well as identification of the primary unit (system-independent sample district number) was made available for the purposes of projection under consideration of sample design, so it was subsequently possible to examine the question in close cooperation with Prof. Dr. Ulrich Rendtel (J.W. Goethe University, Frankfurt/M.) as to how users of the factually anonymized microcensus data can adequately take into account the sample design for variance estimation. For variance estimation, programs are developed for the most common statistical packages (SAS, SPSS, Stata).

  • Schimpl-Neimanns, B., 2010: Varianzschätzung für Mikrozensus Scientific Use Files ab 2005. GESIS-Technical Report Nr. 2010/03. [.pdf] (1.22 MB)
  • Schimpl-Neimanns, B., 2009: Schätzung des Stichprobenfehlers im Mikrozensus mit Stata – Eine Einführung mit Beispielen zum Campus File Mikrozensus 2002. GESIS-Methodenbericht 2009/02. Mannheim: GESIS. [.pdf] (1.26 MB)
  • Rendtel, U.; Schimpl-Neimanns, B., 2000: Varianzschätzungen für den faktisch anonymisierten Mikrozensus. In: Jahrbücher für Nationalökonomie und Statistik 220/6, 2000, S. 759-776.