ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Descriptor

Educational Testing	7
Test Length	7
Item Response Theory	4
Testing Problems	3
Computer Assisted Testing	2
Goodness of Fit	2
Measurement	2
Measurement Techniques	2
Psychological Testing	2
Research Methodology	2
Simulation	2
Test Bias	2
Test Items	2
Test Theory	2
Accuracy	1
Benchmarking	1
Classification	1
Cognitive Tests	1
Comparative Analysis	1
Correlation	1
Cutting Scores	1
Diagnostic Tests	1
Difficulty Level	1
Educational Assessment	1
Equated Scores	1
More ▼

Source

Journal of Educational…	2
ETS Research Report Series	1
Education Sciences	1
Educational and Psychological…	1
Evaluation in Education:…	1
Popular Measurement	1

Author

Cui, Ying	2
Bergstrom, Betty	1
Deville, Craig	1
Dorans, Neil J.	1
Gershon, Richard C.	1
Guo, Hongwen	1
Leighton, Jacqueline P.	1
Mousavi, Amin	1
Munoz-Sandoval, Ana	1
O'Neill, Thomas	1
Van Der Linden, Wim J.	1
Veldkamp, Bernard P.	1
Woodcock, Richard W.	1
Wright, Benjamin D.	1
van der Linden, Wim J.	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	4
Reports - Evaluative	3
Collected Works - General	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

A Note on Using Weighted Sum Scores in the P-DIF Statistic. Research Report. ETS RR-19-32

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2019

The Mantel-Haenszel delta difference (MH D-DIF) and the standardized proportion difference (STD P-DIF) are two observed-score methods that have been used to assess differential item functioning (DIF) at Educational Testing Service since the early 1990s. Latentvariable approaches to assessing measurement invariance at the item level have been…

Descriptors: Test Bias, Educational Testing, Statistical Analysis, Item Response Theory

The Effect of Person Misfit on Item Parameter Estimation and Classification Accuracy: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Mousavi, Amin; Cui, Ying – Education Sciences, 2020

Often, important decisions regarding accountability and placement of students in performance categories are made on the basis of test scores generated from tests, therefore, it is important to evaluate the validity of the inferences derived from test results. One of the threats to the validity of such inferences is aberrant responding. Several…

Descriptors: Student Evaluation, Educational Testing, Psychological Testing, Item Response Theory

On the Issue of Item Selection in Computerized Adaptive Testing with Response Times

Peer reviewed

Direct link

Veldkamp, Bernard P. – Journal of Educational Measurement, 2016

Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…

Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

The Use of Moment Estimators for Mixtures of Two Binomials with One Known Success Parameter.

Peer reviewed

Van Der Linden, Wim J. – Educational and Psychological Measurement, 1983

This paper focuses on mixtures of two binomials with one known success parameter. It is shown how moment estimators can be obtained for the remaining unknown parameters of such mixtures, and results are presented from a Monte Carlo study carried out to explore the statistical properties of these estimators. (PN)

Descriptors: Educational Testing, Error of Measurement, Estimation (Mathematics), Guessing (Tests)

Passing Score and Length of a Mastery Test.

van der Linden, Wim J. – Evaluation in Education: International Progress, 1982

In mastery testing a linear relationship between an optimal passing score and test length is presented with a new optimization criterion. The usual indifference zone approach, a binomial error model, decision errors, and corrections for guessing are discussed. Related results in sequential testing and the latent class approach are included. (CM)

Descriptors: Cutting Scores, Educational Testing, Mastery Tests, Mathematical Models

Testing Testing Testing.

Peer reviewed

Deville, Craig; O'Neill, Thomas; Wright, Benjamin D.; Woodcock, Richard W.; Munoz-Sandoval, Ana; Gershon, Richard C.; Bergstrom, Betty – Popular Measurement, 1998

Articles in this special section consider (1) flow in test taking (Craig Deville); (2) testwiseness (Thomas O'Neill); (3) test length (Benjamin Wright); (4) cross-language test equating (Richard W. Woodcock and Ana Munoz-Sandoval); (5) computer-assisted testing and testwiseness (Richard Gershon and Betty Bergstrom); and (6) Web-enhanced testing…

Descriptors: Computer Assisted Testing, Educational Testing, Equated Scores, Measurement Techniques