ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	3

Descriptor

Design	3
Test Items	2
Test Reliability	2
Adaptive Testing	1
Computation	1
Computer Assisted Testing	1
Correlation	1
Data Use	1
Interrater Reliability	1
Intervals	1
Pretests Posttests	1
Programming	1
Scores	1
Simulation	1
Statistical Data	1
Statistical Inference	1
Test Bias	1
Test Construction	1
Test Validity	1
More ▼

Source

Journal of Educational and…

Author

Bonett, Douglas G.	1
Emons, Wilco H. M.	1
Gu, Zhengguo	1
Hsiu-Yi Chao	1
Jyun-Hong Chen	1
Sijtsma, Klaas	1

Publication Type

Journal Articles	3
Reports - Research	2
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 3 results Save | Export

Statistical Inference for G-Indices of Agreement

Peer reviewed

Direct link

Bonett, Douglas G. – Journal of Educational and Behavioral Statistics, 2022

The limitations of Cohen's ? are reviewed and an alternative G-index is recommended for assessing nominal-scale agreement. Maximum likelihood estimates, standard errors, and confidence intervals for a two-rater G-index are derived for one-group and two-group designs. A new G-index of agreement for multirater designs is proposed. Statistical…

Descriptors: Statistical Inference, Statistical Data, Interrater Reliability, Design

Utilizing Real-Time Test Data to Solve Attenuation Paradox in Computerized Adaptive Testing to Enhance Optimal Design

Peer reviewed

Direct link

Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024

To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…

Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation