A new item response theory model for rater centrality using a hierarchical rater model approach

Journal article


Qiu, Xue-Lan, Chiu, Ming Ming, Wang, Wen-Chung and Chen, Po-Hsi. (2022). A new item response theory model for rater centrality using a hierarchical rater model approach. Behavior Research Methods. 54(4), pp. 1854-1868. https://doi.org/10.3758/s13428-021-01699-y
AuthorsQiu, Xue-Lan, Chiu, Ming Ming, Wang, Wen-Chung and Chen, Po-Hsi
Abstract

Rater centrality, in which raters overuse middle scores for rating, is a common rater error which can affect test scores and subsequent decisions. Past studies on rater errors have focused on rater severity and inconsistency, neglecting rater centrality. This study proposes a new model within the hierarchical rater model framework to explicitly specify and directly estimate rater centrality in addition to rater severity and inconsistency. Simulations were conducted using the freeware JAGS to evaluate the parameter recovery of the new model and the consequences of ignoring rater centrality. The results revealed that the model had good parameter recovery with small bias, low root mean square errors, and high test score reliability, especially when a fully crossed linking design was used. Ignoring centrality yielded poor item difficulty estimates, person ability estimates, rater errors estimates, and underestimated reliability. We also showcase how the new model can be used, using an empirical example involving English essays in the Advanced Placement exam.

Keywordsrater errors; centrality effect; hierarchical rater model; item response theory
Year2022
JournalBehavior Research Methods
Journal citation54 (4), pp. 1854-1868
PublisherSpringer
ISSN1554-351X
Digital Object Identifier (DOI)https://doi.org/10.3758/s13428-021-01699-y
PubMed ID34725802
Scopus EID2-s2.0-85118336119
Page range1854-1868
FunderResearch Grants Council of the Hong Kong Special Administrative Region, China
Publisher's version
License
All rights reserved
File Access Level
Controlled
Output statusPublished
Publication dates
Online01 Nov 2021
Publication process dates
Accepted29 Aug 2021
Deposited16 Nov 2023
Grant ID18613716
Permalink -

https://acuresearchbank.acu.edu.au/item/8zz25/a-new-item-response-theory-model-for-rater-centrality-using-a-hierarchical-rater-model-approach

Restricted files

Publisher's version

  • 31
    total views
  • 0
    total downloads
  • 2
    views this month
  • 0
    downloads this month
These values are for the period from 19th October 2020, when this repository was created.

Export as

Related outputs

Item Response Theory Models for Polytomous Multidimensional Forced-Choice Items to Measure Construct Differentiation
Qiu, Xuelan, de la Torre, Jimmy, Wang, You-Gan and Wu, Jinran. (2024). Item Response Theory Models for Polytomous Multidimensional Forced-Choice Items to Measure Construct Differentiation. Educational Measurement: Issues and Practice. pp. 1-12. https://doi.org/10.1111/emip.12621
An Iterative Scale Purification Procedure on lz for the Detection of Aberrant Responses
Qiu, Xuelan, Huang, Sheng-Yun, Wang, Wen-Chung and Wang, You-Gan. (2024). An Iterative Scale Purification Procedure on lz for the Detection of Aberrant Responses. Multivariate Behavioral Research. 59(1), pp. 62-77. https://doi.org/10.1080/00273171.2023.2211564
A dual process item response theory model for polytomous multidimensional forced-choice items
Qiu, Xuelan and de la Torre, Jimmy. (2023). A dual process item response theory model for polytomous multidimensional forced-choice items. British Journal of Mathematical and Statistical Psychology. 76(3), pp. 491-512. https://doi.org/10.1111/bmsp.12303
Computerized adaptive testing for ipsative tests with multidimensional pairwise-comparison items : Algorithm development and applications
Qiu, Xue-Lan, de la Torre, Jimmy, Ro, Sage and Wang, Wen-Chung. (2022). Computerized adaptive testing for ipsative tests with multidimensional pairwise-comparison items : Algorithm development and applications. Applied Psychological Measurement. 46(4), pp. 255-272. https://doi.org/10.1177/01466216221084209
An empirical Q-Matrix validation method for the polytomous G-DINA model
de la Torre, Jimmy, Qiu, Xue-Lan and Santos, Kevin Carl. (2022). An empirical Q-Matrix validation method for the polytomous G-DINA model. Psychometrika. 87(2), pp. 693-724. https://doi.org/10.1007/s11336-021-09821-x
Equity in mathematics education in Hong Kong : Evidence from TIMSS 2011 to 2019
Qiu, Xue-Lan and Leung, Frederick K. S.. (2022). Equity in mathematics education in Hong Kong : Evidence from TIMSS 2011 to 2019. Large-scale Assessments in Education. 10(1), p. Article 3. https://doi.org/10.1186/s40536-022-00121-z
Assessment of differential statement functioning in ipsative tests with multidimensional forced-choice items
Qiu, Xue-Lan and Wang, Wen-Chung. (2021). Assessment of differential statement functioning in ipsative tests with multidimensional forced-choice items. Applied Psychological Measurement. 45(2), pp. 79-94. https://doi.org/10.1177/0146621620965739
Student self-assessment : Why do they do it?
Yan, Zi, Brown, Gavin T. L., Lee, John Chi-Kin and Qiu, Xue-Lan. (2020). Student self-assessment : Why do they do it? Educational Psychology. 40(4), pp. 509-532. https://doi.org/10.1080/01443410.2019.1672038
Measuring Dynamic Goals for Marriage : Development and Validation of the Marital Goal Scale Using Rasch Modeling
Li, Tianyuan, Hiu-Ling Tsang, Vivian, Fung, H, Qiu, Xuelan and Wang, Wen-Chung. (2019). Measuring Dynamic Goals for Marriage : Development and Validation of the Marital Goal Scale Using Rasch Modeling. Psychological Assessment. 32(3), pp. 211-226. https://doi.org/10.1037/pas0000779
Multilevel modeling of cognitive diagnostic assessment : The multilevel DINA example
Wang, Wen-Chung and Qiu, Xue-Lan. (2019). Multilevel modeling of cognitive diagnostic assessment : The multilevel DINA example. Applied Psychological Measurement. 43(1), pp. 34-50. https://doi.org/10.1177/0146621618765713
Item response theory modeling for examinee-selected items with rater effect
Liu, Chen-Wei, Qiu, Xue-Lan and Wang, Wen-Chung. (2019). Item response theory modeling for examinee-selected items with rater effect. Applied Psychological Measurement. 43(6), pp. 435-448. https://doi.org/10.1177/0146621618798667