Harmonizing and synthesizing partnership histories

Extensive harmonization of survey and especially biographical data has not been common in the social sciences. HaSpaD provides a tool for harmonizing and accumulating, and thus comprehensively analyzing, survey-based longitudinal data sets on partnership biographies.

The following studies were harmonized and merged for a joint analysis for the third-party funded project "Harmonizing and synthesizing partnership histories from different research data infrastructures" (HaSpaD):

  • the panel studies pairfam, SOEP and SHARE,
  • the cross-section studies General German Social Survey, Mannheim Divorce Study and the Fertility and Family Survey,
  • as well as the cross-section studies combined with partially repeated surveys Family Surveys, German Life History Studies and the Generations and Gender Surveys.

The project provides syntax-based harmonization processes available through the HaSpaD Harmonization Wizard. The HaSpaD Harmonization Wizard enables a customized selection of survey programs and variables. After downloading the source data sets from their repositories, the HaSpaD syntax package enables to generate an individually customized and harmonized data set of the source datasets. In addition to biographical data on partnerships, the HaSpaD target dataset may include other variables such as age, gender, citizenship, and education level. If all surveys are selected, the target data set will contain approximately 182,000 partnership biographies.

Title Harmonizing and synthesizing partnership histories from different research data infrastructures (HaSpaD)
Funded by Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) (Link to the DFG-Site)
Project lead Dr. Sonja Schulz, Dr. Bernd Weiß
Technical lead Dr. Brigitte Mathiak
Staff Sebastian Sterl, Anna-Carolina Haensch, Lisa Schmid, Antonia May and Matthäus Zloch
Student assistants Franziska Kern, Dastan Jasim, Marcel Klemm, Lea Kroener, Jan Siepermann and Jan Heinicke
GESIS departments Survey Data Curation (SDC), Survey Design and Methodology (SDM), technical support by Knowledge Technologies for the Social Sciences (KTS)
Data version 1.0.0
Contact
  • Dr. Sonja Schulz | Email: sonja.schulz(at)gesis(dot)org | Tel: +49 (0221) 47694-440
  • Dr. Bernd Weiß | Email: bernd.weiss(at)gesis(dot)org | Tel: +49 (0621) 1246-557
Study title Description and link to repository

Panel Analysis of Intimate Relationships and Family Dynamics (pairfam), Data Release 12.0 

The survey "Panel Analysis of Intimate Relationships and Family Dynamics" (pairfam) is a panel survey in multi-actor design. The dataset contains information on about 37,000 Living-Apart-Together relationships (LAT), nonmarital cohabiting couples (NEL) and marriages (survey years 2008-2019, annually).

RDC pairfam (https://www.pairfam.de/en/

Version 12.0

German General Social Survey (GGSS/ALLBUS
 – Cumulation 1980 – 2016

 

The German General Social Survey (ALLBUS-Cumulation) is a cross-sectional survey and partially panel in single-actor design. The dataset contains information on about 21,500 marriages (survey years 1980, 1982, 1984, 1986, 1988, 1991, 2000 and 2010).

RDC ALLBUS (https://www.gesis.org/en/allbus/allbus-home)

Version 1.0.0

Family Surveys

 

The Family Surveys (FS) are cross-sectional surveys, partially panel surveys in single-actor design. The dataset contains information on about 34,000 Living-Apart-Together relationships (LAT), nonmarital cohabiting couples (NEL) and marriages (survey years 1988, 1991, 1994/1995 and 2000).

RDC DJI-German Youth Institute (https://surveys.dji.de/index.php?m=msg,0&fID=2)

Family Survey I. Wave West 1980 Version 1.0.0

Family Survey I. Wave East 1990 Version 1.1.0

Family Survey II. Wave 1994 Version 1.1.0

Family Survey III. Wave 2000 Version 1.1.0

Mannheim Divorce Study

 

The Mannheim Divorce Study (MSS) is a cross-sectional survey in single-actor design. The dataset contains information on about 5,000 first marriages (survey year 1996).

Mannheim Centre for European Social Research (MZES) (http://www.mzes.uni-mannheim.de/d7/en/projects/determinants-of-divorce)

Version 2.0.0

German Fertility and Family Survey

 

The Fertility and Family Survey (FFS) is a cross-sectional survey in single-actor design. The dataset contains information on about 8,000 nonmarital cohabiting couples (NEL) and marriages (survey year 1992).

Federal Institute for Population Research (BiB) (https://www.bib.bund.de/EN/Research/Surveys/FFS/family-and-fertility-survey.html;jsessionid=0292024A4996684F17E2C310BDFD4C5A.2_cid389)

Version 1.0.0

German Life History Study

 

The German Life History Study (GLHS) is a cross-sectional survey, partially panel survey in single-actor design. The dataset contains information on about 13,700 Living-Apart-Together relationships (LAT), nonmarital cohabiting couples (NEL) and marriages (survey years 1981-83, 1985-1986, 1987-1988, 1988-1989, 1991-1992, 1996-1997, 1998-1999 and 2005-2005).

Max Planck Institute for Human Development (https://www.mpib-berlin.mpg.de/research/concluded-areas/center-for-sociology-and-the-study-of-the-life-course)

GLHS West I Version 1.1.0

GLHS West II A - Personal Survey Version 1.1.0

GLHS West II T - Telephone Survey Version 1.0.0

GLHS West III Version 1.1.0

GLHS GDR Version 1.0.0

GLHS East 71 Version 1.0.0

GLHS East Panel, birth cohorts 1939-41, 1951-52, 1959-61 Version 1.0.0

GLHS East Panel, birth cohort 1929-31 Version 1.0.0

GLHS West 64/71 Version 1.0.0

GLHS Panel 71 Version 1.0.0

Generations & Gender Survey (Sub sample Germany)

 

The Generations and Gender Survey (GGS) is a cross-sectional survey, and partially panel survey in single-actor design. The dataset contains information on about 10,100 Living-Apart-Together relationships (LAT), nonmarital cohabiting couples (NEL) and marriages (survey years 2005 and 2008/2009).

Federal Institute for Population Research (BiB), Generations and Gender Programme (GGP) (https://www.ggp-i.org/): doi:10.17026/dans-z5z-xn8g, GGS Wave 1

Federal Institute for Population Research (BiB), Generations and Gender Programme (GGP) (https://www.ggp-i.org/): doi:10.17026/dans-xm6-a262, GGS Wave 2

Wave 1 Version 4.3. - Consolidated

Wave 2 Version 1.3. - Consolidated

The Survey of Health, Ageing and Retirement in Europe (SHARE)

 

The Survey of Health, Ageing and Retirement in Europe (SHARE) is a panel study in multi-actor design. The dataset contains information on about 2,500 Living-Apart-Together relationships (LAT), nonmarital cohabiting couples NEL) and marriages (SHARELIFE, survey years 2008/2009).

RDC SHARE (http://www.share-project.org/home0.html)

Version 6-0-0

Please Note: Currently, version 7-1-0 (2021-09-22) is available. To obtain version 6-0-0, please contact share-rdc(at)centerdata(dot)nl.

Socio-Economic Panel (SOEP), data of the years 1984-2018

 

The Socio-Economic Panel (SOEP) is a panel study in multi-actor design. The dataset contains information on about 38,500 marriages (survey years 1984-2018).

Research Data Center SOEP, German Institute for Economic Research (DIW Berlin) (https://www.diw.de/en/diw_02.c.222518.en/research_data_center_of_the_soep.html)

Version 35

Please Note: Currently, version 36 (2021-09-22) is available. To obtain version 36, please contact soepmail(at)diw(dot)de.

The HaSpaD project offers a syntax package that enables the creation of a customized and harmonized dataset based partnership biographies from surveys in Germany. The HaSpaD syntax package is available through the HaSpaD-Harmonization Wizard. Here, survey programs and additional variables can be selected. After the selection, users are asked to download and unpack the haspad.zip file, in which the harmonization syntax is carried out. Users are asked to individually sign user agreements with the providing research repositories of used source data sets. The original source files must be downloaded manually and saved in the respective folder within the HaSpaD environment ( ~haspad/2_source-folder). Subsequently, the HaSpaD harmonization syntax can be carried out to generate the customized and harmonized data set.

Below, an overview of system requirements, notes on data acquisition and a step-by-step short guide on how to obtain the HaSpad target dataset is provided.

System requirements To create a HaSpaD dataset, a storage capacity of at least 15 Gigabytes as well as a Stata version 14.0 or newer is required. We furthermore suggest using at least Stata SE.
Data acquisition

The HaSpaD project does not provide datasets. The original data sets that are harmonized and merged via the HaSpaD syntax package, must be obtained from other data providers or research data centers, with whom users need to enter into individual use agreements. More detailed information on original source data sets as well as contact information on respetive research repositories can be found in section "Included survey programs". When using the HaSpaD harmonization syntax, besides citing the HaSpaD project, also original data sources need to be cited. Citation rules can be found in section "Citations".

Please Note: The HaSpaD syntax package relies on exact versions of the original data sets. The successful generation of the HaSpaD dataset can only be guaranteed if exactly the same versions of the source data are used which were used to generate the data of the HaSpaD project in the first place.

Quickstart
  1. Select survey programs and additional variables using the HaSpaD-Harmonization Wizard at https://haspad.gesis.org/wizard.
  2. Download ZIP-folder haspad.zip and unzip the folder. The folder structure has to be kept as follows:
     

    haspad

    1_documentation

    2_source

    3_harmonization

    4_target

    main.do

    README.txt

  3. Access original data sources through the GESIS Data Catalog (DBK) and/or the respective research data centres. In case the data is issued via Zip-folders: please do not unzip yet.
  4. Save the original data sources or ZIP-folders of the original data sources from step 3 in the directory ~haspad/2_source.
  5. In case the original data sources were issued via ZIP-folders, unpack of these ZIP-folders from step 4 in ~haspad/2_source. Please unpack the data into a separate subfolder with the name of the original ZIP file, e.g. ~2_source\ZA2645_LV1.zip has to be unpacked as ~2_source\ZA2645_LV1\.
  6. Open the do-file main.do in the folder haspad.
  7. Insert the file path that indicates where the haspad folder has been stored in line 40 of the main.do.
     

    Example: unpacked folder haspad was stored in c:/mydata

    >40 cd "c:/mydata/haspad"

  8. Run complete do-file master.do.
  9. The target data set target.dta including the selected variables can now be opened in ~haspad/4_target.

The HaSpaD project does not provide data sets for downloads. The HaSpaD project provides a syntax package that harmonizes source datasets from survey programs provided by data repositories. For detailed information on how to generate the HaSpaD target dataset, see the section "The HaSpaD Harmonization" and the HaSpaD Harmonization Wizard pages (https://haspad.gesis.org/wizard).

The HaSpaD Data Manual provides a detailed overview of the entire project, information on all integrated survey programs, and the harmonized partnership biography variables.

HaSpaD Datenhandbuch (german) HaSpaD Datenhandbuch (1.40 MB)
HaSpaD Data manual (english) HaSpaD Data Manual (1.38 MB)

 

When using the HaSpaD preparation the HaSpaD project should be cited in the following manner:

HaSpaD Project Schulz, Sonja; Weiß, Bernd; Sterl, Sebastian; Haensch, Anna-Carolina; Schmid, Lisa; May, Antonia (2021): HaSpaD - Harmonizing and synthesizing partnership histories from different research data infrastructures.
A model project for linking research data from various infrastructures. Preparation files for the creation of a couple biography data set. Version 1.0.0. GESIS – Leibniz Institute for the Social Sciences, Cologne and Mannheim. doi: (tba).
HaSpaD Data Manual Schulz, Sonja; Weiß, Bernd; Sterl, Sebastian; Haensch, Anna-Carolina; Schmid, Lisa; May, Antonia (2021): HaSpaD – Data Manual. Version 1. GESIS – Leibniz Institute for the Social Sciences. doi: (tba).

 

When using the HaSpaD syntax, in addition to citing the HaSpaD project, the selected and used original data sources must be cited as well.

Source data of the survey programs

Short study title Survey citation
pairfam

Brüderl, Josef, Sonja Drobnič, Karsten Hank, Franz. J. Neyer, Sabine Walper, Philipp Alt, Elisabeth Borschel, Christiane Bozoyan, Madison Garrett, Svenja Geissler, Tita Gonzalez Avilés, Nicolai Gröpler, Kristin Hajek, Michel Herzig, Bernadette Huyer-May, Rüdiger Lenke, Renate Lorenz, Katharina Lutz, Lara Minkus, Timo Peter, Trang Phan, Richard Preetz, Julia Reim, Barbara Sawatzki, Claudia Schmiedeberg, Philipp Schütze, Nina Schumann, Carolin Thönnissen, Katharina Timmermann & Martin Wetzel (2021). The German Family Panel (pairfam). GESIS Data Archive, Cologne. ZA5678 Data file Version 12.0.0, doi:10.4232/pairfam.5678.12.0.0

ALLBUS - Cumulation GESIS - Leibniz Institute for the Social Sciences (2018): ALLBUS/GGSS 1980-2016 (Kumulierte Allgemeine Bevölkerungsumfrage der Sozialwissenschaften / Cumulated German General Social Survey 1980-2016). GESIS Data Archive, Cologne. ZA4586 Data file Version 1.0.0, https://doi.org/10.4232/1.13029
Family Survey I. Wave West 1988 Deutsches Jugendinstitut (DJI), München (1992): Change and Development of Forms of Family Life in West Germany (Survey of Families). GESIS Data Archive, Cologne. ZA2245 Data file Version 1.0.0, https://doi.org/10.4232/1.2245
Family Survey I. Wave East 1990 Deutsches Jugendinstitut (DJI), München (2018): Family and Partner Relations in Eastern Germany (Survey of Families). GESIS Data Archive, Cologne. ZA2392 Data file Version 1.1.0, https://doi.org/10.4232/1.13196
Family Survey II. Wave 1994 Deutsches Jugendinstitut (DJI), München (2018): Change and Development of Ways of Family Life - 2nd Wave (Survey of Families). GESIS Data Archive, Cologne. ZA2860 Data file Version 1.1.0, https://doi.org/10.4232/1.13197
Family Survey III. Wave 2000 Deutsches Jugendinstitut (DJI), München (2018): Change and Development of Families` Way of Life - 3rd Wave (Family Survey). GESIS Data Archive, Cologne. ZA3920 Data file Version 1.1.0, https://doi.org/10.4232/1.13198
Mannheim Divorce Study Esser, Hartmut; Gostomski, Christian Babka von; Hartmann, Josef (2018): Mannheim Divorce Study 1996. GESIS Data Archive, Cologne. ZA3188 Data file Version 2.0.0, https://doi.org/10.4232/1.13056
German Fertility and Family Survey Bundesinstitut für Bevölkerungsforschung, Wiesbaden (2002): German Fertility and Family Survey 1992. GESIS Data Archive, Cologne. ZA3400 Data file Version 1.0.0, https://doi.org/10.4232/1.3400
GLHS West I Mayer, Karl U. (2018): Courses of Life and Social Change: Courses of Life and Welfare Development (Life History Study LV-West I). GESIS Data Archive, Cologne. ZA2645 Data file Version 1.1.0, https://doi.org/10.4232/1.13193
GLHS West II A - Personal Survey Mayer, Karl U. (2018): Courses of Life and Social Change: The Between-the-War Cohort in Transition to Retirement (Life History Study LV-West II A - Personal Interview). GESIS Data Archive, Cologne. ZA2646 Data file Version 1.1.0, https://doi.org/10.4232/1.13194
GLHS West II T - Telephone Survey Mayer, Karl U. (1995): Courses of Life and Social Change: The Between-the-War Cohort in Transition to Retirement (Life History Study LV-West II T - Telephone Interview). GESIS Data Archive, Cologne. ZA2647 Data file Version 1.0.0, https://doi.org/10.4232/1.2647
GLHS West III Mayer, Karl U. (2018): Courses of Life and Social Change: Access to Occupation in Employment Crisis (Life History Study LV-West III). GESIS Data Archive, Cologne. ZA2648 Data file Version 1.1.0, https://doi.org/10.4232/1.13195
GLHS GDR Mayer, Karl U. (1995): Courses of Life and Historical Change in East Germany (Life History Study LV DDR). GESIS Data Archive, Cologne. ZA2644 Data file Version 1.0.0, https://doi.org/10.4232/1.2644
GLHS East 71 Mayer, Karl U. (2004): East German Life Courses After Unification (Life History Study LV Ost 71). GESIS Data Archive, Cologne. ZA3926 Data file Version 1.0.0, https://doi.org/10.4232/1.3926
GLHS East Panel Mayer, Karl U. (2004): East German Life Courses After Unification (Life History Study LV-Ost Panel). GESIS Data Archive, Cologne. ZA3925 Data file Version 1.0.0, https://doi.org/10.4232/1.3925
GLHS West 64/71 Mayer, Karl U.; Kleinhenz, Gerhard (2004): Education, Training, and Occupation: Life Courses of the 1964 and 1971 Birth Cohorts in West Germany (Life History Study LV-West 64/71). GESIS Data Archive, Cologne. ZA3927 Data file Version 1.0.0, https://doi.org/10.4232/1.3927
GLHS Panel 71 Mayer, Karl U. (2014): Early Careers and Starting a Family: Life Courses of the 1971 Birth Cohorts in East and West Germany (Life History Study LV-Panel 71). GESIS Data Archive, Cologne. ZA5099 Data file Version 1.0.0, https://doi.org/10.4232/1.5099
Generations & Gender Survey (2005)

Generations & Gender Survey (2005) (German subsample). This paper uses data from the GGS Waves 1, and 2 (DOIs:  10.17026/dans-z5z-xn8g10.17026/dans-xm6-a262), see Gauthier, A. H. et al. (2018) or visit the GGP website (https://www.ggp-i.org/) for methodological details. Working Version: GGS Wave 1 Version 4.3. - Consolidated. identification: GGSW1.Cons. 

Generations & Gender Survey (2008)

Generations & Gender Survey (2008) (German subsample). This paper uses data from the GGS Waves 1, and 2 (DOIs: 10.17026/dans-z5z-xn8g10.17026/dans-xm6-a262), see Gauthier, A. H. et al. (2018) or visit the GGP website (https://www.ggp-i.org/) for methodological details. Working Version: GGS Wave 2 Version 1.3. - Consolidated. identification: GGSW2.Cons.

SHARE

The Survey of Health, Ageing and Retirement in Europe (SHARE) /SHARE Waves 1, 2, and 3. This paper uses data from SHARE Waves 1, 2 and 3 (DOIs: 10.6103/SHARE.w1.60010.6103/SHARE.w2.60010.6103/SHARE.w3.600), see Börsch-Supan et al. (2013) for methodological details.(1)
The SHARE data collection has been funded by the European Commission through FP5 (QLK6-CT-2001-00360), FP6 (SHARE-I3: RII-CT-2006-062193, COMPARE: CIT5-CT-2005-028857, SHARELIFE: CIT4-CT-2006-028812), FP7 (SHARE-PREP: GA N°211909, SHARE-LEAP: GA N°227822, SHARE M4: GA N°261982) and Horizon 2020 (SHARE-DEV3: GA N°676536, SERISS: GA N°654221) and by DG Employment, Social Affairs & Inclusion. Additional funding from the German Ministry of Education and Research, the Max Planck Society for the Advancement of Science, the U.S. National Institute on Aging (U01_AG09740-13S2, P01_AG005842, P01_AG08291, P30_AG12815, R21_AG025169, Y1-AG-4553-01, IAG_BSR06-11, OGHA_04-064, HHSN271201300071C) and from various national funding sources is gratefully acknowledged(see www.share-project.org).

(1) Usually this information will be part of the text of a scientific publication already. If this is the case, there is no need to include this information in the acknowledgement once more.

  • Börsch-Supan, A. (2017). Survey of Health, Ageing and Retirement in Europe (SHARE) Wave 1. Release version: 6.0.0. SHARE-ERIC. Data set. DOI: 10.6103/SHARE.w1.600
  • Börsch-Supan, A. (2017). Survey of Health, Ageing and Retirement in Europe (SHARE) Wave 2. Release version: 6.0.0. SHARE-ERIC. Data set. DOI: 10.6103/SHARE.w2.600
  • Börsch-Supan, A. (2017). Survey of Health, Ageing and Retirement in Europe (SHARE) Wave 3 – SHARELIFE. Release version: 6.0.0. SHARE-ERIC. Data set. DOI: 10.6103/SHARE.w3.600

 

  • Börsch-Supan, A., M. Brandt, C. Hunkler, T. Kneip, J. Korbmacher, F. Malter, B. Schaan, S. Stuck, S. Zuber (2013). Data Resource Profile: The Survey of Health, Ageing and Retirement in Europe (SHARE). International Journal of Epidemiology. DOI: 10.1093/ije/dyt088
SOEP Liebig, Stefan; Goebel, Jan; Kroh, Martin; Schröder, Carsten; Grabka, Markus; Schupp, Jürgen; Bartels, Charlotte; Fedorets,Alexandra; Franken, Andreas; Jacobsen, Jannes; Kara, Selin; Krause, Peter; Kröger,Hannes; Metzing, Maria; Richter, David; Schacht, Diana; Schmelzer, Paul; Schmitt, Christian; Schnitzlein, Daniel; Siegers, Rainer; Wenzig, Knut; Zimmermann, Stefan (2019): Sozio-oekonomisches Panel (SOEP), Daten für die Jahre 1984-2018, Version 35, SOEP, 2019 doi: 10.5684/soep.v35.