Anonymized Firm Data under Test: Evidence from a Replication Study

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

This paper contributes to the literature on the use of anonymized firm level data by reporting results from a replication study. To test for the practical usefulness of anonymized data I selected two of my published papers based on different cross sections of firm data. The data used there were anonymized by micro aggregation. I replicated the analyses reported in the papers with the anonymized data, and then compared the results to those produced with the original data. Frequently, the reported levels of statistical significance differ. Furthermore, statistically significant coefficients sometimes differ by order of magnitude. Therefore, at least for the moderate sample sizes used here micro-aggregated firm data should not be considered as a tool for empirical research.

Original languageEnglish
JournalJahrbücher für Nationalökonomie und Statistik
Volume225
Issue number5
Pages (from-to)584-591
Number of pages8
ISSN0021-4027
DOIs
Publication statusPublished - 09.2005

    Research areas

  • Economics - Replicative studies, Datasets, Statical estimation, statistical significace, Aggregation, Business entities, Research tools, Applied econometrics, industry
  • Anonymized firm data, Micro aggregation, Replication study

Links

DOI

Recently viewed

Publications

  1. Simulation of stresses during casting of binary magnesium-aluminum alloys
  2. Privacy-Preserving Localization and Social Distance Monitoring with Low-Resolution Thermal Imaging and Deep Learning
  3. A microsystem for growth inhibition test of Enterococcus faecalis based on impedance measurement
  4. Using an adaptive memory strategy to improve a multistart heuristic for sequencing by hybridization
  5. High temperature deformation mechanisms and processing map for hot working of cast-homogenized Mg-3Sn-2Ca alloy
  6. Assessing pre-travel online destination experience values of destination websites
  7. Analysis of life cycle datasets for the material gold
  8. Using Long-Duration Static Stretch Training to Counteract Strength and Flexibility Deficits in Moderately Trained Participants
  9. Compression behavior of typical silicone rubbers for soft robotics applications at elevated temperatures
  10. Exploring intrinsic, instrumental and relational values for sustainable management of social-ecological systems
  11. Intra-firm Wage Compression and Cost Coverage of Training
  12. Using a Bivariate Polynomial in an EKF for State and Inductance Estimations in the Presence of Saturation Effects to Adaptively Control a PMSM
  13. Combination of a reduced order state observer and an Extended Kalman Filter for Peltier cells
  14. Improving Flood Forecasting in a Developing Country
  15. A transfer operator based numerical investigation of coherent structures in three-dimensional Southern ocean circulation
  16. Investigation of the Controllability of Inductive Power Transmission Systems based on Flexible Coils
  17. Confidence levels and likelihood terms in IPCC reports
  18. Optimal scheduling for Automated Guided Vehicles (AGV) in blocking job-shops
  19. Natural enemy diversity reduces temporal variability in wasp but not bee parasitism
  20. Unusual two‐bond 13C, 13C coupling constants in sulphones
  21. Elastomeric Prepregs for Soft Robotics Applications
  22. Predicting the future performance of soccer players
  23. Open Innovation in Schools
  24. Testing for a break in the persistence in yield spreads of EMU government bonds
  25. CubeQA—question answering on RDF data cubes
  26. Fluorometer controlled apparatus designed for long-duration algal-feeding experiments and environmental effect studies with mussels
  27. Careless responding detection revisited
  28. Selecting methods for ecosystem service assessment
  29. Integrated simulation method for investment decisions of micro production systems
  30. A duty-block network approach for an integrated driver rostering problem in public bus transport
  31. Diversity promotes temporal stability across levels of ecosystem organization in experimental grasslands
  32. Reduction of springback by use of deep drawing tools with locally and temporally varying stiffness