SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
On a Reparameterization of the MC-DINA Model 11 March 2025 at 10:40

On a Reparameterization of the MC-DINA Model

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Lawrence T. DeCarlo116042Teachers College， Columbia University， New York， NY， USA

11 March 2025 at 10:40

Applied Psychological Measurement, Ahead of Print.
The MC-DINA model is a cognitive diagnosis model (CDM) for multiple-choice items that was introduced by de la Torre (2009). The model extends the usual CDM in two basic ways: it allows for nominal responses instead of only dichotomous responses, and it ...

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
A Two-Step Q-Matrix Estimation Method 10 October 2024 at 11:00

A Two-Step Q-Matrix Estimation Method

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Hans-Friedrich Köhn， Chia-Yi Chiu， Olasumbo Oluwalana， Hyunjoo Kim， Jiaxi Wang114589University of Illinois at Urbana-Champaign， IL， USA25930Teachers College， Columbia University， NY， USA36729Educational Testing Service， NJ， USA4242612Independent Researcher

10 October 2024 at 11:00

Applied Psychological Measurement, Volume 49, Issue 1-2, Page 3-28, March 2025.
Cognitive Diagnosis Models in educational measurement are restricted latent class models that describe ability in a knowledge domain as a composite of latent skills an examinee may have mastered or failed. Different combinations of skills define distinct ...

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
Optimal Test Design for Estimation of Mean Ability Growth 15 October 2024 at 01:37

Optimal Test Design for Estimation of Mean Ability Growth

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Jonas Bjermo1Linköping University， Sweden27675Stockholm University， Sweden

15 October 2024 at 01:37

Applied Psychological Measurement, Volume 49, Issue 1-2, Page 29-49, March 2025.
The design of an achievement test is crucial for many reasons. This article focuses on a population’s ability growth between school grades. We define design as the allocating of test items concerning the difficulties. The objective is to present an ...

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
The Improved EMS Algorithm for Latent Variable Selection in M3PL Model 21 October 2024 at 01:37

The Improved EMS Algorithm for Latent Variable Selection in M3PL Model

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Laixu Shang， Ping-Feng Xu， Na Shan， Man-Lai Tang， Qian-Zhen Zheng166344Zhejiang Normal University， Jinhua， China247821Northeast Normal University， Changchun， China3Shanghai Zhangjiang Institute of Mathematics， Shanghai， China43769University of Hertfordshire， Hertfordshire， UK

21 October 2024 at 01:37

Applied Psychological Measurement, Volume 49, Issue 1-2, Page 50-70, March 2025.
One of the main concerns in multidimensional item response theory (MIRT) is to detect the relationship between items and latent traits, which can be treated as a latent variable selection problem. An attractive method for latent variable selection in ...

R Package for Calculating Estimators of the Proportion of Explained Variance and Standardized Regression Coefficients in Multiply Imputed Datasets

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Joost R. van Ginkel， Julian D. Karch1100575Leiden University， Leiden， The Netherlands

27 January 2025 at 02:12

Applied Psychological Measurement, Ahead of Print.

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
An Experimental Design to Investigate Item Parameter Drift 24 January 2025 at 08:41

An Experimental Design to Investigate Item Parameter Drift

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Peter Baldwin， Irina Grabovsky， Kimberly A. Swygert， Thomas Fogle， Pilar Reid， Brian E. ClauserNBME， Philadelphia， PA， USA

24 January 2025 at 08:41

Applied Psychological Measurement, Ahead of Print.
Methods for detecting item parameter drift may be inadequate when every exposed item is at risk for drift. To address this scenario, a strategy for detecting item parameter drift is proposed that uses only unexposed items deployed in a stratified random ...

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
Application of Bayesian Decision Theory in Detecting Test Fraud 28 January 2025 at 12:07

Application of Bayesian Decision Theory in Detecting Test Fraud

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Sandip Sinharay， Matthew S. Johnson16729Educational Testing Service， NJ， USA

28 January 2025 at 12:07

Applied Psychological Measurement, Ahead of Print.
This article suggests a new approach based on Bayesian decision theory (e.g., Cronbach & Gleser, 1965; Ferguson, 1967) for detection of test fraud. The approach leads to a simple decision rule that involves the computation of the posterior probability ...

Compound Optimal Design for Online Item Calibration Under the Two-Parameter Logistic Model

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Lihong Song， Wenyi Wang1School of Education， 12642Jiangxi Normal University， Nanchang， China2School of Computer and Information Engineering， 12642Jiangxi Normal University， Nanchang， China

29 January 2025 at 05:24

Applied Psychological Measurement, Ahead of Print.
Under the theory of sequential design, compound optimal design with two optimality criteria can be used to solve the problem of efficient calibration of item parameters of item response theory model. In order to efficiently calibrate item parameters in ...

Few and Different: Detecting Examinees With Preknowledge Using Extended Isolation Forests

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Nate R. Smith， Lisa A. Keller， Richard A. Feinberg， Chunyan Liu114707University of Massachusetts Amherst， Amherst， MA， USA244207National Board of Medical Examiners， Philadelphia， PA， USA

21 February 2025 at 04:14

Applied Psychological Measurement, Ahead of Print.
Item preknowledge refers to the case where examinees have advanced knowledge of test material prior to taking the examination. When examinees have item preknowledge, the scores that result from those item responses are not true reflections of the examinee’...

Impact of Parameter Predictability and Joint Modeling of Response Accuracy and Response Time on Ability Estimates

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Maryam Pezeshki， Susan Embretson11372School of Psychology， Georgia Institute of Technology， GA， USA

26 February 2025 at 12:47

Applied Psychological Measurement, Ahead of Print.
To maintain test quality, a large supply of items is typically desired. Automatic item generation can result in a reduction in cost and labor, especially if the generated items have predictable item parameters and thus possibly reducing or eliminating the ...

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
Weighted Answer Similarity Analysis 1 March 2025 at 11:54

Weighted Answer Similarity Analysis

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Nicholas Trout， Kylie Gorney13078Michigan State University， East Lansing， MI， USA

1 March 2025 at 11:54

Applied Psychological Measurement, Ahead of Print.
Romero et al. (2015; see also Wollack, 1997) developed theωstatistic as a method for detecting unusually similar answers between pairs of examinees. For each pair, theωstatistic considers whether the observed number of similar answers is significantly ...

Test Security and the Pandemic: Comparison of Test Center and Online Proctor Delivery Modalities

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Kirk A. Becker， Jinghua Liu， Paul E. Jones1142883Pearson VUE， Chicago IL， USA

24 April 2024 at 02:04

Applied Psychological Measurement, Ahead of Print.
Published information is limited regarding the security of testing programs, and even less on the relative security of different testing modalities: in-person at test centers (TC) versus remote online proctored (OP) testing. This article begins by ...

A Generalized Multi-Detector Combination Approach for Differential Item Functioning Detection

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Shan Huang， Hidetoki Ishii112965Nagoya University， Japan

20 December 2024 at 04:34

Applied Psychological Measurement, Ahead of Print.
Many studies on differential item functioning (DIF) detection rely on single detection methods (SDMs), each of which necessitates specific assumptions that may not always be validated. Using an inappropriate SDM can lead to diminished accuracy in DIF ...

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
An Information Manifold Perspective for Analyzing Test Data 20 December 2024 at 05:39

An Information Manifold Perspective for Analyzing Test Data

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: James O. Ramsay， Juan Li， Joakim Wallmark， Marie Wiberg15620McGill University， Montreal， Canada210055Ottawa Hospital Research Institute， Ottawa， Canada38075Umeå University， Sweden

20 December 2024 at 05:39

Applied Psychological Measurement, Ahead of Print.
Modifications of current psychometric models for analyzing test data are proposed that produce an additive scale measure of information. This information measure is a one-dimensional space curve or curved surface manifold that is invariant across varying ...

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
Inference of Correlations Among Testlet Effects: A Latent Variable Selection Method 26 December 2024 at 09:46

Inference of Correlations Among Testlet Effects: A Latent Variable Selection Method

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Xin Xu， Jinxin Guo， Tao Xin1College of Science， Minzu University of China， Beijing， China2Collaborative Innovation Center of Assessment Toward Basic Education Quality， 47836Beijing Normal University， Beijing， China3School of Educational Science， Anhui Normal University， Wuhu， China

26 December 2024 at 09:46

Applied Psychological Measurement, Ahead of Print.
In psychological and educational measurement, a testlet-based test is a common and popular format, especially in some large-scale assessments. In modeling testlet effects, a standard bifactor model, as a common strategy, assumes different testlet effects ...

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
Adaptive Measurement of Change in the Context of Item Parameter Drift 30 December 2024 at 09:34

Adaptive Measurement of Change in the Context of Item Parameter Drift

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Allison W. Cooperman， Ming Him Tai， Joseph N. DeWeese， David J. Weiss15635University of Minnesota – Twin Cities， Minneapolis， MN， USA28082Pennsylvania State University， State College， PA， USA

30 December 2024 at 09:34

Applied Psychological Measurement, Ahead of Print.
Adaptive measurement of change (AMC) uses computerized adaptive testing (CAT) to measure and test the significance of intraindividual change on one or more latent traits. The extant AMC research has so far assumed that item parameter values are constant ...

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
Comparing Approaches to Estimating Person Parameters for the MUPP Model 28 January 2025 at 07:30

Comparing Approaches to Estimating Person Parameters for the MUPP Model

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: David M. LaHuis， Caitlin E. Blackmore， Gage M. Ammons12816Wright State University， Dayton， OH， USA297158Aon Hewitt， Lincolnshire， IL， USA

28 January 2025 at 07:30

Applied Psychological Measurement, Ahead of Print.
This study compared maximum a posteriori (MAP), expected a posteriori (EAP), and Markov Chain Monte Carlo (MCMC) approaches to computing person scores from the Multi-Unidimensional Pairwise Preference Model. The MCMC approach used the No-U-Turn sampling (...

Semi-Parametric Item Response Theory With O’Sullivan Splines for Item Responses and Response Time

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Chen-Wei Liu134879National Taiwan Normal University， Taiwan

3 February 2025 at 01:08

Applied Psychological Measurement, Ahead of Print.
Response time (RT) has been an essential resource for supplementing the estimation accuracy of latent traits and item parameters in educational testing. Most item response theory (IRT) approaches are based on parametric RT models. However, since test ...

Modeling Within- and Between-Person Differences in the Use of the Middle Category in Likert Scales

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Jesper Tijmstra， Maria Bolsinova17899Tilburg University， The Netherlands

3 March 2025 at 04:34

Applied Psychological Measurement, Ahead of Print.
When using Likert scales, the inclusion of a middle-category response option poses a challenge for the valid measurement of the psychological attribute of interest. While this middle category is often included to provide respondents with a neutral ...

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
Book Review: Generalized Kernel Equating with Applications in R 23 January 2025 at 03:55

Book Review: Generalized Kernel Equating with Applications in R

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Michela Battauz1University of Udine， Italy

23 January 2025 at 03:55

Applied Psychological Measurement, Ahead of Print.

Modeling Within- and Between-Person Differences in the Use of the Middle Category in Likert Scales

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Jesper Tijmstra Maria Bolsinova

3 March 2025 at 04:34

Applied Psychological Measurement, Ahead of Print.
When using Likert scales, the inclusion of a middle-category response option poses a challenge for the valid measurement of the psychological attribute of interest. While this middle category is often included to provide respondents with a neutral response option, respondents may in practice also select this category when they do not want to or cannot give an informative response. If one analyzes the response data without considering these two possible uses of the middle response category, measurement may be confounded. In this paper, we propose a response-mixture IRTree model for the analysis of Likert-scale data. This model acknowledges that the middle response category can either be selected as a non-response option (and hence be uninformative for the attribute of interest) or to communicate a neutral position (and hence be informative), and that this choice depends on both person- and item-characteristics. For each observed middle-category response, the probability that it was intended to be informative is modeled, and both the attribute of substantive interest and a non-response tendency are estimated. The performance of the model is evaluated in a simulation study, and the procedure is applied to empirical data from personality psychology.

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
Weighted Answer Similarity Analysis 1 March 2025 at 11:54

Weighted Answer Similarity Analysis

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Nicholas Trout Kylie Gorney

1 March 2025 at 11:54

Applied Psychological Measurement, Ahead of Print.
Romero et al. (2015; see also Wollack, 1997) developed the ω statistic as a method for detecting unusually similar answers between pairs of examinees. For each pair, the ω statistic considers whether the observed number of similar answers is significantly larger than the expected number of similar answers. However, one limitation of ω is that it does not account for the particular items on which similar answers are observed. Therefore, in this study, we propose a weighted version of the ω statistic that takes this information into account. We compare the performance of the new and existing statistics using detailed simulations in which several factors are manipulated. Results show that while both the new and existing statistics are able to control the Type I error rate, the new statistic is more powerful, on average.

Impact of Parameter Predictability and Joint Modeling of Response Accuracy and Response Time on Ability Estimates

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Maryam Pezeshki Susan Embretson

26 February 2025 at 12:47

Applied Psychological Measurement, Ahead of Print.
To maintain test quality, a large supply of items is typically desired. Automatic item generation can result in a reduction in cost and labor, especially if the generated items have predictable item parameters and thus possibly reducing or eliminating the need for empirical tryout. However, the effect of different levels of item parameter predictability on the accuracy of trait estimation using item response theory models is unclear. If predictability is lower, adding response time as a collateral source of information may mitigate the effect on trait estimation accuracy. The present study investigates the impact of varying item parameter predictability on trait estimation accuracy, along with the impact of adding response time as a collateral source of information. Results indicated that trait estimation accuracy using item family model-based item parameters differed only slightly from using known item parameters. Somewhat larger trait estimation errors resulted from using cognitive complexity features to predict item parameters. Further, adding response times to the model resulted in more accurate trait estimation for tests with lower item difficulty levels (e.g., achievement tests). Implications for item generation and response processes aspect of validity are discussed.

Few and Different: Detecting Examinees With Preknowledge Using Extended Isolation Forests

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Nate R. Smith Lisa A. Keller Richard A. Feinberg Chunyan Liu

21 February 2025 at 04:14

Applied Psychological Measurement, Ahead of Print.
Item preknowledge refers to the case where examinees have advanced knowledge of test material prior to taking the examination. When examinees have item preknowledge, the scores that result from those item responses are not true reflections of the examinee’s proficiency. Further, this contamination in the data also has an impact on the item parameter estimates and therefore has an impact on scores for all examinees, regardless of whether they had prior knowledge. To ensure the validity of test scores, it is essential to identify both issues: compromised items (CIs) and examinees with preknowledge (EWPs). In some cases, the CIs are known, and the task is reduced to determining the EWPs. However, due to the potential threat to validity, it is critical for high-stakes testing programs to have a process for routinely monitoring for evidence of EWPs, often when CIs are unknown. Further, even knowing that specific items may have been compromised does not guarantee that any examinees had prior access to those items, or that those examinees that did have prior access know how to effectively use the preknowledge. Therefore, this paper attempts to use response behavior to identify item preknowledge without knowledge of which items may or may not have been compromised. While most research in this area has relied on traditional psychometric models, we investigate the utility of an unsupervised machine learning algorithm, extended isolation forest (EIF), to detect EWPs. Similar to previous research, the response behavior being analyzed is response time (RT) and response accuracy (RA).

Semi-Parametric Item Response Theory With O’Sullivan Splines for Item Responses and Response Time

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Chen-Wei Liu

3 February 2025 at 01:08

Applied Psychological Measurement, Ahead of Print.
Response time (RT) has been an essential resource for supplementing the estimation accuracy of latent traits and item parameters in educational testing. Most item response theory (IRT) approaches are based on parametric RT models. However, since test takers may alter their behaviors during a test due to motivation or strategy shifts, fatigue, or other causes, parametric IRT models are unlikely to capture such subtle and nonlinear information. In this work, we propose a novel semi-parametric IRT model with O’Sullivan splines to accommodate the flexible mean RT shapes and explore the underlying nonlinear relationships between latent traits and RT. A simulation study was conducted to demonstrate the substantial improvement in parameter estimation achieved by the new model, as well as the detriment of using parametric models in terms of biases and measurement errors. Using this model, a dataset of mathematics test scores and RT from the Programme for International Student Assessment was analyzed to demonstrate the evident nonlinearity and to compare the proposed model with existing models in terms of model fitting. The findings presented in this study indicate the promising nature of the new approach, suggesting its potential as an additional psychometric tool to enhance test reliability and reduce measurement errors.

Compound Optimal Design for Online Item Calibration Under the Two-Parameter Logistic Model

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Lihong Song Wenyi Wang

29 January 2025 at 05:24

Applied Psychological Measurement, Ahead of Print.
Under the theory of sequential design, compound optimal design with two optimality criteria can be used to solve the problem of efficient calibration of item parameters of item response theory model. In order to efficiently calibrate item parameters in computerized testing, a compound optimal design is proposed for the simultaneous estimation of item difficulty and discrimination parameters under the two-parameter logistic model, which adaptively focuses on optimizing the parameter which is difficult to estimate. The compound optimal design using the acceptance probability can provide ability design points to optimize the item difficulty and discrimination parameters, respectively. Simulation and real data analysis studies showed that the compound optimal design outperformed than the D-optimal and random design in terms of the recovery of both discrimination and difficulty parameters.

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
Comparing Approaches to Estimating Person Parameters for the MUPP Model 28 January 2025 at 07:30

Comparing Approaches to Estimating Person Parameters for the MUPP Model

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: David M. LaHuis Caitlin E. Blackmore Gage M. Ammons

28 January 2025 at 07:30

Applied Psychological Measurement, Ahead of Print.
This study compared maximum a posteriori (MAP), expected a posteriori (EAP), and Markov Chain Monte Carlo (MCMC) approaches to computing person scores from the Multi-Unidimensional Pairwise Preference Model. The MCMC approach used the No-U-Turn sampling (NUTS). Results suggested the EAP with fully crossed quadrature and the NUTS outperformed the others when there were fewer dimensions. In addition, the NUTS produced the most accurate estimates in larger dimension conditions. The number of items per dimension had the largest effect on person parameter recovery.

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
Application of Bayesian Decision Theory in Detecting Test Fraud 28 January 2025 at 12:07

Application of Bayesian Decision Theory in Detecting Test Fraud

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Sandip Sinharay Matthew S. Johnson

28 January 2025 at 12:07

Applied Psychological Measurement, Ahead of Print.
This article suggests a new approach based on Bayesian decision theory (e.g., Cronbach & Gleser, 1965; Ferguson, 1967) for detection of test fraud. The approach leads to a simple decision rule that involves the computation of the posterior probability that an examinee committed test fraud given the data. The suggested approach was applied to a real data set that involved actual test fraud.

R Package for Calculating Estimators of the Proportion of Explained Variance and Standardized Regression Coefficients in Multiply Imputed Datasets

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Joost R. van Ginkel Julian D. Karch

27 January 2025 at 02:12

Applied Psychological Measurement, Ahead of Print.

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
An Experimental Design to Investigate Item Parameter Drift 24 January 2025 at 08:41

An Experimental Design to Investigate Item Parameter Drift

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Peter Baldwin Irina Grabovsky Kimberly A. Swygert Thomas Fogle Pilar Reid Brian E. Clauser

24 January 2025 at 08:41

Applied Psychological Measurement, Ahead of Print.
Methods for detecting item parameter drift may be inadequate when every exposed item is at risk for drift. To address this scenario, a strategy for detecting item parameter drift is proposed that uses only unexposed items deployed in a stratified random method within an experimental design. The proposed method is illustrated by investigating unexpected score increases on a high-stakes licensure exam. Results for this example were suggestive of item parameter drift but not significant at the .05 level.

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
Book Review: Generalized Kernel Equating with Applications in R 23 January 2025 at 03:55

Book Review: Generalized Kernel Equating with Applications in R

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Michela Battauz

23 January 2025 at 03:55

Applied Psychological Measurement, Ahead of Print.

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents
Adaptive Measurement of Change in the Context of Item Parameter Drift 30 December 2024 at 09:34

Adaptive Measurement of Change in the Context of Item Parameter Drift

SAGE Publications Inc: Applied Psychological Measurement: Table of Contents

By: Allison W. Cooperman Ming Him Tai Joseph N. DeWeese David J. Weiss

30 December 2024 at 09:34

Applied Psychological Measurement, Ahead of Print.
Adaptive measurement of change (AMC) uses computerized adaptive testing (CAT) to measure and test the significance of intraindividual change on one or more latent traits. The extant AMC research has so far assumed that item parameter values are constant across testing occasions. Yet item parameters might change over time, a phenomenon termed item parameter drift (IPD). The current study examined AMC’s performance in the context of IPD with unidimensional, dichotomous CATs across two testing occasions. A Monte Carlo simulation revealed that AMC false and true positive rates were primarily affected by changes in the difficulty parameter. False positive rates were related to the location of the drift items relative to the latent trait continuum, as the administration of more drift items spuriously increased the magnitude of estimated trait change. Moreover, true positive rates depended upon an interaction between the direction of difficulty parameter drift and the latent trait change trajectory. A follow-up simulation further showed that the number of items in the CAT with parameter drift impacted AMC false and true positive rates, with these relationships moderated by IPD characteristics and the latent trait change trajectory. It is recommended that test administrators confirm the absence of IPD prior to using AMC for measuring intraindividual change with educational and psychological tests.

Normal view