Cleansing procedures for overlaps and inconsistencies in administrative data. The case of German labour market data

Research output: Journal contributionsJournal articlesResearchpeer-review

Standard

Cleansing procedures for overlaps and inconsistencies in administrative data. The case of German labour market data. / Scioch, Patrycja; Oberschachtsiek, Dirk.

In: Historical Social Research, Vol. 34, No. 3, 2009, p. 242-259.

Research output: Journal contributionsJournal articlesResearchpeer-review

Harvard

APA

Vancouver

Bibtex

@article{46e99fcd53604927a58a89d91a82dcf9,
title = "Cleansing procedures for overlaps and inconsistencies in administrative data. The case of German labour market data",
abstract = "»Die Bedeutung von Bereinigungsprozeduren fur {\"U}berschneidungen und Inkonsistenzen in administrativen Daten. Am Beispiel deutscher Arbeitsmarktdaten«. Process-generated and administrative datasets have become increasingly important for labour market research over the past ten years. Major advantages of this data are large sample sizes, absence of retrospective gaps and unit nonresponses. Nevertheless, the quality and validity of the information remain unclear. This paper contributes to this subject, focusing on the variation of research results due to alternative data cleansing procedures. In particular, the paper uses the general set up for data cleaning proposed by Wunsch/Lechner (2008) in evaluating the outcome of training programmes in Germany. First results are limited to the sensitivity of the construction of the sample populations used for the counterfactuals analysis. The results emphasize that sample construction seems to be robust to the scenario used for the data cleansing.",
keywords = "Data fusion, Data management record linkage, Labour market data, Longitudinal analysis, Process-generated data, Public administrational data, Social bookkeeping data, Economics",
author = "Patrycja Scioch and Dirk Oberschachtsiek",
year = "2009",
language = "English",
volume = "34",
pages = "242--259",
journal = "Historical Social Research",
issn = "0172-6404",
publisher = "Zentrum fur Historische Sozialforschung e.V",
number = "3",

}

RIS

TY - JOUR

T1 - Cleansing procedures for overlaps and inconsistencies in administrative data. The case of German labour market data

AU - Scioch, Patrycja

AU - Oberschachtsiek, Dirk

PY - 2009

Y1 - 2009

N2 - »Die Bedeutung von Bereinigungsprozeduren fur Überschneidungen und Inkonsistenzen in administrativen Daten. Am Beispiel deutscher Arbeitsmarktdaten«. Process-generated and administrative datasets have become increasingly important for labour market research over the past ten years. Major advantages of this data are large sample sizes, absence of retrospective gaps and unit nonresponses. Nevertheless, the quality and validity of the information remain unclear. This paper contributes to this subject, focusing on the variation of research results due to alternative data cleansing procedures. In particular, the paper uses the general set up for data cleaning proposed by Wunsch/Lechner (2008) in evaluating the outcome of training programmes in Germany. First results are limited to the sensitivity of the construction of the sample populations used for the counterfactuals analysis. The results emphasize that sample construction seems to be robust to the scenario used for the data cleansing.

AB - »Die Bedeutung von Bereinigungsprozeduren fur Überschneidungen und Inkonsistenzen in administrativen Daten. Am Beispiel deutscher Arbeitsmarktdaten«. Process-generated and administrative datasets have become increasingly important for labour market research over the past ten years. Major advantages of this data are large sample sizes, absence of retrospective gaps and unit nonresponses. Nevertheless, the quality and validity of the information remain unclear. This paper contributes to this subject, focusing on the variation of research results due to alternative data cleansing procedures. In particular, the paper uses the general set up for data cleaning proposed by Wunsch/Lechner (2008) in evaluating the outcome of training programmes in Germany. First results are limited to the sensitivity of the construction of the sample populations used for the counterfactuals analysis. The results emphasize that sample construction seems to be robust to the scenario used for the data cleansing.

KW - Data fusion

KW - Data management record linkage

KW - Labour market data

KW - Longitudinal analysis

KW - Process-generated data

KW - Public administrational data

KW - Social bookkeeping data

KW - Economics

UR - http://www.scopus.com/inward/record.url?scp=70350304881&partnerID=8YFLogxK

M3 - Journal articles

AN - SCOPUS:70350304881

VL - 34

SP - 242

EP - 259

JO - Historical Social Research

JF - Historical Social Research

SN - 0172-6404

IS - 3

ER -