ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Computation	5
Test Bias	5
Test Items	5
Test Reliability	5
Item Response Theory	3
Scores	3
Ability	2
Markov Processes	2
Models	2
Monte Carlo Methods	2
Simulation	2
Accuracy	1
Adolescents	1
Arithmetic	1
At Risk Students	1
Bayesian Statistics	1
Children	1
Comparative Analysis	1
Correlation	1
Design	1
Differences	1
Elementary Secondary Education	1
Error of Measurement	1
Foreign Countries	1
Goodness of Fit	1
More ▼

Source

ETS Research Report Series	1
Educational and Psychological…	1
International Journal of…	1
Journal of Educational and…	1
Journal of Psychoeducational…	1

Author

Emons, Wilco H. M.	1
Gu, Zhengguo	1
Huang, Hung-Yu	1
Lee, Yi-Hsuan	1
Marbach, Joshua	1
Sijtsma, Klaas	1
Wang, Wen-Chung	1
Wang, Zhen	1
Yao, Lihua	1
Zhang, Jinming	1

Publication Type

Journal Articles	5
Reports - Research	4
Reports - Evaluative	1

Education Level

Elementary Secondary Education	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Alabama	1
California	1
Idaho	1
Nebraska	1
New Mexico	1
New York	1
North Dakota	1
Ohio	1
Taiwan	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

Test Review: Reynolds, C. R., Voress, J. V., Kamphaus, R. W. (2015), "Mathematics Fluency and Calculation Tests (MFaCTs) review." PRO-ED

Peer reviewed

Direct link

Marbach, Joshua – Journal of Psychoeducational Assessment, 2017

The Mathematics Fluency and Calculation Tests (MFaCTs) are a series of measures designed to assess for arithmetic calculation skills and calculation fluency in children ages 6 through 18. There are five main purposes of the MFaCTs: (1) identifying students who are behind in basic math fact automaticity; (2) evaluating possible delays in arithmetic…

Descriptors: Mathematics Tests, Computation, Mathematics Skills, Arithmetic

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

Higher Order Testlet Response Models for Hierarchical Latent Traits and Testlet-Based Items

Peer reviewed

Direct link

Huang, Hung-Yu; Wang, Wen-Chung – Educational and Psychological Measurement, 2013

Both testlet design and hierarchical latent traits are fairly common in educational and psychological measurements. This study aimed to develop a new class of higher order testlet response models that consider both local item dependence within testlets and a hierarchy of latent traits. Due to high dimensionality, the authors adopted the Bayesian…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Computation