How to get really smart: Modeling retest and training effects in ability testing using computer-generated figural matrix items

Philipp Alexander Freund; Heinz Holling

doi:10.1016/j.intell.2011.02.009

How to get really smart: Modeling retest and training effects in ability testing using computer-generated figural matrix items

Research output: Journal contributions › Journal articles › Research › peer-review

Standard

How to get really smart: Modeling retest and training effects in ability testing using computer-generated figural matrix items. / Freund, Philipp Alexander; Holling, Heinz.
In: Intelligence, Vol. 39, No. 4, 07.2011, p. 233-243.

Research output: Journal contributions › Journal articles › Research › peer-review

Bibtex

@article{3f131713a30c4416813a82a38b05ad28,

title = "How to get really smart: Modeling retest and training effects in ability testing using computer-generated figural matrix items",

abstract = "The interpretation of retest scores is problematic because they are potentially affected by measurement and predictive bias, which impact construct validity, and because their size differs as a function of various factors. This paper investigates the construct stability of scores on a figural matrices test and models retest effects at the level of the individual test taker as a function of covariates (simple retest vs. training, use of identical vs. parallel retest forms, and general mental ability). A total of N=189 subjects took two tests of matrix items that were automatically generated according to a strict construction rationale. Between test administrations, participants in the intervention groups received training, while controls did not. The Rasch model fit the data at both time points, but there was a lack of item difficulty parameter invariance across time. Training increased test performance beyond simple retesting, but there was no large difference between the identical and parallel retest forms at the individual level. Individuals varied greatly in how they profited from retest experience, training, and the use of identical vs. parallel retest forms. The results suggest that even with carefully designed tasks, it is problematic to directly compare scores from initial tests and retests. Test administrators should emphasize learning potential instead of state level assessment, and inter-individual differences with regard to test experience should be taken into account when interpreting test results.",

keywords = "Economics, empirical/statistics, Figural matrix items, Individual change, Rational item construction, Retest effects, Training effects",

author = "Freund, {Philipp Alexander} and Heinz Holling",

year = "2011",

month = jul,

doi = "10.1016/j.intell.2011.02.009",

language = "English",

volume = "39",

pages = "233--243",

journal = "Intelligence",

issn = "0160-2896",

publisher = "Elsevier Ltd",

number = "4",

}

RIS

TY - JOUR

T1 - How to get really smart: Modeling retest and training effects in ability testing using computer-generated figural matrix items

AU - Freund, Philipp Alexander

AU - Holling, Heinz

PY - 2011/7

Y1 - 2011/7

N2 - The interpretation of retest scores is problematic because they are potentially affected by measurement and predictive bias, which impact construct validity, and because their size differs as a function of various factors. This paper investigates the construct stability of scores on a figural matrices test and models retest effects at the level of the individual test taker as a function of covariates (simple retest vs. training, use of identical vs. parallel retest forms, and general mental ability). A total of N=189 subjects took two tests of matrix items that were automatically generated according to a strict construction rationale. Between test administrations, participants in the intervention groups received training, while controls did not. The Rasch model fit the data at both time points, but there was a lack of item difficulty parameter invariance across time. Training increased test performance beyond simple retesting, but there was no large difference between the identical and parallel retest forms at the individual level. Individuals varied greatly in how they profited from retest experience, training, and the use of identical vs. parallel retest forms. The results suggest that even with carefully designed tasks, it is problematic to directly compare scores from initial tests and retests. Test administrators should emphasize learning potential instead of state level assessment, and inter-individual differences with regard to test experience should be taken into account when interpreting test results.

AB - The interpretation of retest scores is problematic because they are potentially affected by measurement and predictive bias, which impact construct validity, and because their size differs as a function of various factors. This paper investigates the construct stability of scores on a figural matrices test and models retest effects at the level of the individual test taker as a function of covariates (simple retest vs. training, use of identical vs. parallel retest forms, and general mental ability). A total of N=189 subjects took two tests of matrix items that were automatically generated according to a strict construction rationale. Between test administrations, participants in the intervention groups received training, while controls did not. The Rasch model fit the data at both time points, but there was a lack of item difficulty parameter invariance across time. Training increased test performance beyond simple retesting, but there was no large difference between the identical and parallel retest forms at the individual level. Individuals varied greatly in how they profited from retest experience, training, and the use of identical vs. parallel retest forms. The results suggest that even with carefully designed tasks, it is problematic to directly compare scores from initial tests and retests. Test administrators should emphasize learning potential instead of state level assessment, and inter-individual differences with regard to test experience should be taken into account when interpreting test results.

KW - Economics, empirical/statistics

KW - Figural matrix items

KW - Individual change

KW - Rational item construction

KW - Retest effects

KW - Training effects

UR - http://www.scopus.com/inward/record.url?scp=79957661124&partnerID=8YFLogxK

U2 - 10.1016/j.intell.2011.02.009

DO - 10.1016/j.intell.2011.02.009

M3 - Journal articles

VL - 39

SP - 233

EP - 243

JO - Intelligence

JF - Intelligence

SN - 0160-2896

IS - 4

ER -

Related by journal

Rotational complexity in mental rotation tests: Cognitive processes in tasks requiring mental rotation around cardinal and skewed rotation axes

Nolte, N., Schmitz, F., Fleischer, J., Bungart, M. & Leutner, D., 01.03.2022, In: Intelligence. 91, 11 p., 101626.

Research output: Journal contributions › Journal articles › Research › peer-review

Bifactor Models for Predicting Criteria by General and Specific Factors: Problems of Nonidentifiability and Alternative Solutions

Eid, M., Krumm, S., Koch, T. & Schulze, J., 07.09.2018, In: Intelligence. 6, 3, 23 p., 42.

Research output: Journal contributions › Journal articles › Research › peer-review

Complex problem solving and intelligence: A meta-analysis

Stadler, M., Becker, N., Gödker, M., Leutner, D. & Greiff, S., 01.12.2015, In: Intelligence. 53, p. 92-101 10 p.

Research output: Journal contributions › Journal articles › Research › peer-review

Gender differences on general knowledge tests: Are they due to Differential Item Functioning?

Steinmayr, R., Bergold, S., Margraf-Stiksrud, J. & Freund, P. A., 01.05.2015, In: Intelligence. 50, p. 164 - 174 11 p.

Research output: Journal contributions › Journal articles › Research › peer-review

Intelligence assessment with computer simulations

Kröner, S., Plass, J. L. & Leutner, D., 01.07.2005, In: Intelligence. 33, 4, p. 347-368 22 p.

Research output: Journal contributions › Journal articles › Research › peer-review

Other publications by the same author(s)

Need for cognition, academic self-efficacy and parental education predict the intention to go to college – evidence from a multigroup study

Kramer, L., Lüdtke, S. & Freund, P. A., 05.02.2025, In: Frontiers in Psychology. 16, 8 p., 1487038.

Research output: Journal contributions › Journal articles › Research › peer-review

The role of empathy and empathic leadership practices in schools – a scoping review

Manke, S. N., Pietsch, M. & Freund, P. A., 19.06.2025, (E-pub ahead of print) In: Educational Review. 31 p.

Research output: Journal contributions › Scientific review articles › Research

Expanding the pie or spoiling the cake? How the number of negotiation issues affects integrative bargaining

Warsitzka, M., Zhang, H., Beersma, B., Freund, P. A. & Trötschel, R., 01.08.2024, In: Journal of Applied Psychology. 109, 8, p. 1224-1249 26 p.

Research output: Journal contributions › Journal articles › Research › peer-review

Warum brauchen wir eine Forschungsethik? Moralisches Entscheiden: Lehrkonzeption „Forschungsethik in der Psychologie“ - Warum brauchen wir eine Forschungsethik? Moralisches Entscheiden

Fischer, S., Freund, P. A. & Greve, W., 2024

Research output: other publications › Other › Education

Berufsbezogenes Selbstkonzept und Berufsmotivation von Lehrkräften. Erfassung, Ausprägungen und Bedeutung der Berufserfahrung

Kuhl, P., Phieler, S. M. & Freund, P. A., 01.07.2023, Professionalisierung von Lehrkräften im Beruf: Stand und Perspektiven der Lehrkräftebildung und Professionsforschung . Porsch, R. & Gollub, P. (eds.). Münster: Waxmann Verlag, p. 53-68 16 p.

Research output: Contributions to collected editions/works › Contributions to collected editions/anthologies › Research › peer-review

DOI

https://doi.org/10.1016/j.intell.2011.02.009
Final published version

How to get really smart: Modeling retest and training effects in ability testing using computer-generated figural matrix items

Standard

Harvard

APA

Vancouver

Bibtex

RIS

Related by journal

Rotational complexity in mental rotation tests: Cognitive processes in tasks requiring mental rotation around cardinal and skewed rotation axes

Bifactor Models for Predicting Criteria by General and Specific Factors: Problems of Nonidentifiability and Alternative Solutions

Complex problem solving and intelligence: A meta-analysis

Gender differences on general knowledge tests: Are they due to Differential Item Functioning?

Intelligence assessment with computer simulations

Other publications by the same author(s)

Need for cognition, academic self-efficacy and parental education predict the intention to go to college – evidence from a multigroup study

The role of empathy and empathic leadership practices in schools – a scoping review

Expanding the pie or spoiling the cake? How the number of negotiation issues affects integrative bargaining

Warum brauchen wir eine Forschungsethik? Moralisches Entscheiden: Lehrkonzeption „Forschungsethik in der Psychologie“ - Warum brauchen wir eine Forschungsethik? Moralisches Entscheiden

Berufsbezogenes Selbstkonzept und Berufsmotivation von Lehrkräften. Erfassung, Ausprägungen und Bedeutung der Berufserfahrung

DOI

Recently viewed

Projects

Activities

Prizes

Publications

Press / Media