ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	7

Descriptor

Error of Measurement	9
Evaluation Criteria	9
Test Items	9
Item Response Theory	5
Comparative Analysis	3
Equated Scores	3
Item Analysis	3
Sample Size	3
Bayesian Statistics	2
Evaluation Methods	2
Measurement Techniques	2
Sampling	2
Scaling	2
Simulation	2
Statistical Bias	2
Test Construction	2
Test Length	2
Test Theory	2
Academic Achievement	1
Accuracy	1
Achievement Tests	1
Adaptive Testing	1
Bias	1
Career Development	1
Classification	1
More ▼

Source

ETS Research Report Series	3
Online Submission	2
Applied Measurement in…	1
Applied Psychological…	1
Journal of Educational…	1

Publication Type

Reports - Research	8
Journal Articles	7
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Different Methods of Adjusting for Form Difficulty under the Rasch Model: Impact on Consistency of Assessment Results. Research Report. ETS RR-19-08

Peer reviewed
PDF on ERIC

Download full text

Manna, Venessa F.; Gu, Lixiong – ETS Research Report Series, 2019

When using the Rasch model, equating with a nonequivalent groups anchor test design is commonly achieved by adjustment of new form item difficulty using an additive equating constant. Using simulated 5-year data, this report compares 4 approaches to calculating the equating constants and the subsequent impact on equating results. The 4 approaches…

Descriptors: Item Response Theory, Test Items, Test Construction, Sample Size

A Modified "a"-Stratified Method for Computerized Adaptive Testing. Research Report. ETS RR-19-10

Peer reviewed
PDF on ERIC

Download full text

Gu, Lixiong; Ling, Guangming; Qu, Yanxuan – ETS Research Report Series, 2019

Research has found that the "a"-stratified item selection strategy (STR) for computerized adaptive tests (CATs) may lead to insufficient use of high a items at later stages of the tests and thus to reduced measurement precision. A refined approach, unequal item selection across strata (USTR), effectively improves test precision over the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Use, Test Items

Detecting Differential Item Discrimination (DID) and the Consequences of Ignoring DID in Multilevel Item Response Models

Peer reviewed

Direct link

Lee, Woo-yeol; Cho, Sun-Joo – Journal of Educational Measurement, 2017

Cross-level invariance in a multilevel item response model can be investigated by testing whether the within-level item discriminations are equal to the between-level item discriminations. Testing the cross-level invariance assumption is important to understand constructs in multilevel data. However, in most multilevel item response model…

Descriptors: Test Items, Item Response Theory, Item Analysis, Simulation

Impact of Missing Data on Rasch Model Estimations

Download full text

Soysal, Sümeyra; Arikan, Çigdem Akin; Inal, Hatice – Online Submission, 2016

This study aims to investigate the effect of methods to deal with missing data on item difficulty estimations under different test length conditions and sampling sizes. In this line, a data set including 10, 20 and 40 items with 100 and 5000 sampling size was prepared. Deletion process was applied at the rates of 5%, 10% and 20% under conditions…

Descriptors: Research Problems, Data Analysis, Item Response Theory, Test Items

A Criterion to Evaluate the Individual Raw-to-Scale Equating Conversions. Research Report. ETS RR-13-05

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Puhan, Gautam; Walker, Michael – ETS Research Report Series, 2013

In this study we investigated when an equating conversion line is problematic in terms of gaps and clumps. We suggest using the conditional standard error of measurement (CSEM) to measure the scale scores that are inappropriate in the overall raw-to-scale transformation.

Descriptors: Equated Scores, Test Items, Evaluation Criteria, Error of Measurement

Equating Error in Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2006

Traditionally, error in equating observed scores on two versions of a test is defined as the difference between the transformations that equate the quantiles of their distributions in the sample and population of test takers. But it is argued that if the goal of equating is to adjust the scores of test takers on one version of the test to make…

Descriptors: Equated Scores, Evaluation Criteria, Models, Error of Measurement

Evaluation of Linking Methods for Placing Three-Parameter Logistic Item Parameter Estimates onto a One-Parameter Scale

Download full text

Karkee, Thakur B.; Wright, Karen R. – Online Submission, 2004

Different item response theory (IRT) models may be employed for item calibration. Change of testing vendors, for example, may result in the adoption of a different model than that previously used with a testing program. To provide scale continuity and preserve cut score integrity, item parameter estimates from the new model must be linked to the…

Descriptors: Measures (Individuals), Evaluation Criteria, Testing, Integrity

A Theoretical and Empirical Comparison of Three Approaches to Achievement Testing.

Haladyna, Tom; Roid, Gale – 1976

Three approaches to the construction of achievement tests are compared: construct, operational, and empirical. The construct approach is based upon classical test theory and measures an abstract representation of the instructional objectives. The operational approach specifies instructional intent through instructional objectives, facet design,…

Descriptors: Academic Achievement, Achievement Tests, Career Development, Comparative Analysis

Gu, Lixiong	2
Arikan, Çigdem Akin	1
Cho, Sun-Joo	1
Guo, Hongwen	1
Haladyna, Tom	1
Inal, Hatice	1
Karkee, Thakur B.	1
Kim, Stella Yun	1
Lee, Won-Chan	1
Lee, Woo-yeol	1
Ling, Guangming	1
Manna, Venessa F.	1
Puhan, Gautam	1
Qu, Yanxuan	1
Roid, Gale	1
Soysal, Sümeyra	1
Walker, Michael	1
Wright, Karen R.	1
van der Linden, Wim J.	1
More ▼