Validating a forced‑choice method for eliciting quality‑of‑reasoning judgments

Journal article

Marcoci, Alexandru, Stelmach, Margaret E., Rowe, Luke, Barnett, Ashley, Primoratz, Tamar, Kruger, Ariel, Karvetski, Christopher W., Stone, Benjamin, Diamond, Michael L., Saletta, Morgan, van Gelder, Tim, Tetlock, Philip E. and Dennis, Simon. (2023). Validating a forced‑choice method for eliciting quality‑of‑reasoning judgments. Behavior Research Methods. 56, pp. 4958-4973. https://doi.org/10.3758/s13428-023-02234-x

Publication dates
Authors	Marcoci, Alexandru, Stelmach, Margaret E., Rowe, Luke, Barnett, Ashley, Primoratz, Tamar, Kruger, Ariel, Karvetski, Christopher W., Stone, Benjamin, Diamond, Michael L., Saletta, Morgan, van Gelder, Tim, Tetlock, Philip E. and Dennis, Simon
Abstract	In this paper we investigate the criterion validity of forced-choice comparisons of the quality of written arguments with normative solutions. Across two studies, novices and experts assessing quality of reasoning through a forced-choice design were both able to choose arguments supporting more accurate solutions—62.2% (SE = 1%) of the time for novices and 74.4% (SE = 1%) for experts—and arguments produced by larger teams—up to 82% of the time for novices and 85% for experts—with high inter-rater reliability, namely 70.58% (95% CI = 1.18) agreement for novices and 80.98% (95% CI = 2.26) for experts. We also explored two methods for increasing efficiency. We found that the number of comparative judgments needed could be substantially reduced with little accuracy loss by leveraging transitivity and producing quality-of-reasoning assessments using an AVL tree method. Moreover, a regression model trained to predict scores based on automatically derived linguistic features of participants’ judgments achieved a high correlation with the objective accuracy scores of the arguments in our dataset. Despite the inherent subjectivity involved in evaluating differing quality of reasoning, the forced-choice paradigm allows even novice raters to perform beyond chance and can provide a valid, reliable, and efficient method for producing quality-of-reasoning assessments at scale.
Keywords	reasoning; quality of reasoning; comparative judgment; forced choice; automatic reasoning assessment
Year	2023
Journal	Behavior Research Methods
Journal citation	56, pp. 4958-4973
Publisher	Springer
ISSN	0743-3808
Digital Object Identifier (DOI)	https://doi.org/10.3758/s13428-023-02234-x
PubMed ID	37833511
Scopus EID	2-s2.0-85174061724
Web address (URL)	https://link.springer.com/article/10.3758/s13428-023-02234-x
Open access	Published as ‘gold’ (paid) open access
Research or scholarly	Research
Page range	4958-4973
Funder	Office of the Director of National Intelligence (ODNI), United States of America
	Intelligence Advanced Research Projects Activity (IARPA), United States of America
	The British Academy
	The Leverhulme Trust
Publisher's version	OA_Marcoci_2023_Validating_a_forced_choice_method_for.pdf License CC BY 4.0 File Access Level Open
Output status	Published
Online	13 Oct 2023
Publication process dates
Accepted	02 Sep 2023
Deposited	27 Nov 2023
Grant ID	16122000002
	SRG2223\231699
Additional information	© The Author(s) 2023.
	This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/

Permalink -

https://acuresearchbank.acu.edu.au/item/8zzq0/validating-a-forced-choice-method-for-eliciting-quality-of-reasoning-judgments

Download files

Publisher's version

	OA_Marcoci_2023_Validating_a_forced_choice_method_for.pdf
License: CC BY 4.0
File access level: Open

118
total views
55
total downloads
7
views this month
6
downloads this month

These values are for the period from 19th October 2020, when this repository was created.

Export as

Related outputs

Students can identify quality teachers, but can they distinguish between dimensions of quality teaching? A comparative analysis of the structure behind the Tripod Survey

Witter, Michael and Rowe, Luke. (2024). Students can identify quality teachers, but can they distinguish between dimensions of quality teaching? A comparative analysis of the structure behind the Tripod Survey. Educational Assessment. 29(4), pp. 251-273. https://doi.org/10.1080/10627197.2024.2414966

High-performing teams : Is collective intelligence the answer?

Rowe, Luke I., Hattie, John and Munro, John. (2024). High-performing teams : Is collective intelligence the answer? PLoS ONE. 19(8), p. Article e0307945. https://doi.org/10.1371/journal.pone.0307945

Coding and computational thinking across the curriculum : A review of educational outcomes

Mills, Kathy Ann, Cope, Jen, Scholes, Laura and Rowe, Luke. (2024). Coding and computational thinking across the curriculum : A review of educational outcomes. Review of Educational Research. pp. 1-38. https://doi.org/10.3102/00346543241241327

60% of Australian English teachers think video games are a ‘legitimate’ text to study. But only 15% have used one

Gutierrez, Amanda, Mills, Kathy Ann, Scholes, Laura and Rowe, Luke. (2024). 60% of Australian English teachers think video games are a ‘legitimate’ text to study. But only 15% have used one. The Conversation. pp. 1-4.

Research Through the Eyes of Teachers

Rowe, Luke and Hattie, John. (2023). Research Through the Eyes of Teachers. In In Their Own Words: What Scholars and Teachers WantYou to Know About Why and Howto Apply the Science of Learning inYour Academic Setting pp. 44-60 Society for the Teaching of Psychology.

What do secondary teachers think about digital games for learning : Stupid fixation or the future of education?

Gutierrez, Amanda, Mills, Kathy, Scholes, Laura, Rowe, Luke and Pink, Elizabeth. (2023). What do secondary teachers think about digital games for learning : Stupid fixation or the future of education? Teaching and Teacher Education. 133, p. Article 104278. https://doi.org/10.1016/j.tate.2023.104278

Spiritual and Pedagogical Accompaniment (SPA) program 2022

Gutierrez, Amanda and Rowe, Luke. (2023). Spiritual and Pedagogical Accompaniment (SPA) program 2022 Brisbane, Queensland: Australian Catholic University.

Spiritual and Pedagogical Accompaniment (SPA) program (2019-2021)

Gutierrez, Amanda and Rowe, Luke. (2022). Spiritual and Pedagogical Accompaniment (SPA) program (2019-2021) Brisbane, Queensland: Australian Catholic University.

Video gaming and digital competence among elementary school students

Scholes, Laura, Rowe, Luke, Mills, Kathy A., Gutierrez, Amanda and Pink, Elizabeth. (2022). Video gaming and digital competence among elementary school students. Learning, Media and Technology. pp. 1-16. https://doi.org/10.1080/17439884.2022.2156537

autopsych : An R Shiny tool for the reproducible Rasch analysis, differential item functioning, equating, and examination of group effects

Courtney, Matthew G.R., Chang, Kevin C.T., Mei, Bing, Meissel, Kane, Rowe, Luke and Issayeva, Laila B.. (2021). autopsych : An R Shiny tool for the reproducible Rasch analysis, differential item functioning, equating, and examination of group effects. PLoS ONE. 16(10), p. e0257682. https://doi.org/10.1371/journal.pone.0257682

g versus c : comparing individual and collective intelligence across two meta-analyses

Rowe, Luke I., Hattie, John and Hester, Robert. (2021). g versus c : comparing individual and collective intelligence across two meta-analyses. Cognitive Research: Principles and Implications. 6(1), p. Article 26. https://doi.org/10.1186/s41235-021-00285-2

Metacognition and self‑regulated learning

Rowe, Luke and Kang, Sean. (2019). Metacognition and self‑regulated learning Australia: Evidence for Learning.

Open dialogue peer review : A response to Claxton & Lucas

Hattie, John, Clinton, Janet and Rowe, Luke. (2016). Open dialogue peer review : A response to Claxton & Lucas. Psychology of Education Review. 40(1), pp. 30-37. https://doi.org/10.53841/bpsper.2016.40.1.30

Validating a forced‑choice method for eliciting quality‑of‑reasoning judgments

Download files

Publisher's version

118

55

7

6

Export as

Related outputs