四参数Logistic加权模型下被试能力稳健估计

设为首页

收藏本站

网站地图 | English | 公务邮箱

远程访问

NSTL服务站

四参数Logistic加权模型下被试能力稳健估计

详细信息查看全文 | 推荐本文 |

英文篇名：The Ability Overestimation and Ability Underestimation of the Examinee under the Weighted-Score Logistic Model
作者：梅云 ; 简小珠 ; 刘建平
英文作者：Mei Yun;Jian Xiaozhu;Liu Jianping;School of Psychology, Jiangxi Normal University;Jiangxi Key Laboratory of Psychology and Cognitive Science;School of Education, Jinggangshan University;
关键词：Logistic加权模型 ; 猜测现象 ; 失误现象 ; 能力高估 ; 能力低估
英文关键词：weighted Logistic model;;guessing phenomena;;randon error phenomena;;ability overestimated;;ability underestimated
中文刊名：XLKX
英文刊名：Journal of Psychological Science
机构：江西师范大学心理学院;江西省心理与认知科学重点实验室;井冈山大学教育学院;
出版日期：2019-01-20
出版单位：心理科学
年：2019
期：v.42;No.237
基金：国家社会科学基金项目(14BSH071);; 江西省高校人文社会科学项目(XL1515)的资助
语种：中文;
页：XLKX201901025
页数：7
CN：01
ISSN：31-1582/B
分类号：165-171

摘要

设计项目参数、被试得分已知的测验情境,在两、三、四参数Logistic加权模型下进行能力估计,发现被试得分等级之间的能力步长存在着均匀的步长间距,被试得分能较好的反映多级记分的分数加权作用。两参数Logistic加权模型下会出现被试能力参数估计扰动现象,猜测现象会导致能力高估现象,失误现象会导致能力低估现象;三参数Logistic加权模型c型下能力高估现象未出现或不明显;三参数Logistic加权模型γ型下能力低估现象未出现或不明显;四参数Logistic加权模型下被试能力高估现象和低估现象都未出现或不明显,四参数Logistic加权模型是被试能力稳健性估计较好的方法。
The weighted-score logistic model(WSLM) was proposed by Jian, Dai, & Dai(2016). Based on the item emphases of the polytomously scored item, the WSLM model adds the weighted-score parameters into the dichotomous logistic model. Because the dichotomous model has five forms at least. Similarly, the weighted-score logistic model also has four forms, including the one-parameter weighted-score logistic model, the twoparameter weighted-score logistic model, the three-parameter weighted-score logistic model including c parameter, the three-parameter weighted-score logistic model including γ parameter, and the four-parameter weighted-score logistic model.There are response disturbances such as random guessing, carelessness, transcription error in the educational tests. In the paper and pencil testes or computerized adaptive testing, the aberrant responses such as careless errors and lucky guesses would cause significant ability estimation biases in previous studies. Mislevy & Bock(1982) proposed the Biweight estimator and made comparison between the Biweight estimator and maximum likelihood estimator. Results showed that the Biweight estimator could typically reduce Biases, thereby dispel measurement disturbances. And threeparameter logistic IRT model, four-parameter logistic IRT model, Huber robust estimation, and the other methods have therefore been proposed to address the response disturbance, including random guessing, carelessness, etc.The paper compares the robustifying ability estimates of the four models in an example of a test. The four models compared include twoparameter WSLM, three-parameter WSLM with c parameter, three-parameter WSLM with γ parameter, and four-parameter WSLM. Second, three simulation studies in three test cases are presented respectively, with the aim of comparing the four approaches, including 2 PM-MLE, Biweight estimation, Huber estimation and 4 PM-Robust estimation. The hypothetical test instrument contains 34 items, with difficulty thresholds b～N(0,1), and log(a)～N(0,1). The 35 th item with difficulty thresholds ranges from-4 to 4. The ability of the middle-ability examinee is estimated by the responses on the 34 items of the basic test under two-parameter logistic model, and the ability estimation is seen as the reference value for the other three models.Based on the two-parameter WSLM, the ability of the examinees will be overestimated when there is guessing phenomenon on the difficult items; Meanwhile, the ability of the examinees will be underestimated when there is sleeping phenomenon on the easy items. Secondly, for the threeparameter WSLM which contains c parameter, the overestimation phenomenon would be rectified, that is, the response disturbances would disappear. However, the underestimation phenomenon still exists when the examinees miss the easy items. Thirdly, for the three-parameter WSLM which contains γ parameter, the underestimation phenomenon would be rectified well when the examinees miss the easy items, that is, the response disturbances would disappear. But the overestimation phenomenon still exists when the examinees get the difficult items. Fourthly, for the four-parameter WSLM which contains c, γ parameter, the underestimation phenomenon would be rectified well when the examinees miss the easy items, and the overestimation phenomenon would also be rectified well when the low-ability examinees get the difficult items luckily, that is, the response disturbances would disappear. So, the examinee can get the ability robust estimation under the four-parameter WSLM when there are response disturbances such as random guessing and carelessness error in the tests.

引文

简小珠,戴步云,戴海琦.(2016).Logistic加权模型的理论构建与模拟分析.心理学报,48(12),1625-1630.
    简小珠,戴海琦.(2016).4参数GRM对猜测现象和失误现象的纠正.江西师范大学学报(自然科学版),40(2),140-144.
    简小珠,戴海崎,彭春妹.(2007).IRT中Logistic模型的c、γ参数对能力估计的改善.心理学报,39(4),737-746.
    漆书青,戴海崎,丁树良.(2002).现代教育与心理测量学原理.北京:高等教育出版社.
    张华华,程莹.(2005).计算机化自适应测验(CAT)的发展和前景展望.考试研究,1(1),12-24.
    Baker,F.B.,&Kim,S.H.(2004).Item response theory:Parameter estimation techniques.New York:Marcel Dekker,Inc.
    Barton,M.A.,&Lord,F.M.(1981).An upper asymptote for the three-parameter Logistic item response model(Tech.Rep.RR-81-20).Princeton,NJ:Educational Testing Service.
    Carlson,S.(2000).ETS finds flaws in the way online GRE rates some students.The Chronicle of Higher Education,47(8),A47.
    Chang,H.H.,&Ying,Z.L.(2002).To weight or not to weight-Balancing influence of initial and later items in adaptive testing.Paper presented at the Annual Meeting of National Council on Measurement in Education,New Orleans,LA.
    Embretson,S.E.,&Reise,S.P.(2000).Item response theory for psychologists.Mahwah,NJ:Lawrence Erlbaum Associates.
    Mislevy,R.J.,&Bock,R.D.(1982).Biweight estimates of latent ability.Educational and Psychological Measurement,42(3),725-737.
    Rulison,K.L.,&Loken,E.(2009).I've fallen and I can't get up:Can highability students recover from early mistakes in CAT?Applied Psychological Measurement,33(2),83-101.
    Schuster,C.,&Yuan,K.H.(2011).Robust estimation of latent ability in item response models.Journal of Educational and Behavioral Statistics,36(6),720-735.
    Wainer,H.,&Wright,B.D.(1980).Robust estimation of ability in the Rasch model.Psychometrika,45(3),373-391.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700