ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	12

Descriptor

Educational Testing	35
Testing Problems	9
Evaluation Methods	8
Models	8
Test Construction	8
Measurement	6
Psychometrics	6
Standardized Tests	6
Student Evaluation	6
Educational Assessment	5
Elementary Secondary Education	5
Item Response Theory	5
Psychological Testing	5
Simulation	5
Test Use	5
Test Validity	5
Achievement Tests	4
Evaluation Research	4
Multiple Choice Tests	4
Scoring	4
Test Bias	4
Test Items	4
Comparative Analysis	3
Computer Assisted Testing	3
Evaluation Problems	3
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	25
Reports - Research	12
Reports - Evaluative	5
Reports - Descriptive	4
Information Analyses	3
Opinion Papers	2
Speeches/Meeting Papers	2

Education Level

Elementary Secondary Education	2
Secondary Education	1

Audience

Location

New Jersey

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Bruininks Oseretsky Test of…	1
Sequential Tests of…	1
System of Multicultural…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Assessing the Impact of Equating Error on Group Means and Group Mean Differences

Peer reviewed

Direct link

Li, Dongmei – Journal of Educational Measurement, 2022

Equating error is usually small relative to the magnitude of measurement error, but it could be one of the major sources of error contributing to mean scores of large groups in educational measurement, such as the year-to-year state mean score fluctuations. Though testing programs may routinely calculate the standard error of equating (SEE), the…

Descriptors: Error Patterns, Educational Testing, Group Testing, Statistical Analysis

Measuring the Uncertainty of Imputed Scores

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2023

Technical difficulties and other unforeseen events occasionally lead to incomplete data on educational tests, which necessitates the reporting of imputed scores to some examinees. While there exist several approaches for reporting imputed scores, there is a lack of any guidance on the reporting of the uncertainty of imputed scores. In this paper,…

Descriptors: Evaluation Methods, Scores, Standardized Tests, Simulation

Performance of Person-Fit Statistics under Model Misspecification

Peer reviewed

Direct link

Hong, Seong Eun; Monroe, Scott; Falk, Carl F. – Journal of Educational Measurement, 2020

In educational and psychological measurement, a person-fit statistic (PFS) is designed to identify aberrant response patterns. For parametric PFSs, valid inference depends on several assumptions, one of which is that the item response theory (IRT) model is correctly specified. Previous studies have used empirical data sets to explore the effects…

Descriptors: Educational Testing, Psychological Testing, Goodness of Fit, Error of Measurement

A New Person-Fit Statistic for the Lognormal Model for Response Times

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2018

Response-time models are of increasing interest in educational and psychological testing. This article focuses on the lognormal model for response times, which is one of the most popular response-time models, and suggests a simple person-fit statistic for the model. The distribution of the statistic under the null hypothesis of no misfit is proved…

Descriptors: Reaction Time, Educational Testing, Psychological Testing, Models

On the Issue of Item Selection in Computerized Adaptive Testing with Response Times

Peer reviewed

Direct link

Veldkamp, Bernard P. – Journal of Educational Measurement, 2016

Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…

Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

Performance of the Generalized S-X[Superscript 2] Item Fit Index for Polytomous IRT Models

Peer reviewed

Direct link

Kang, Taehoon; Chen, Troy T. – Journal of Educational Measurement, 2008

Orlando and Thissen's S-X[superscript 2] item fit index has performed better than traditional item fit statistics such as Yen' s Q[subscript 1] and McKinley and Mill' s G[superscript 2] for dichotomous item response theory (IRT) models. This study extends the utility of S-X[superscript 2] to polytomous IRT models, including the generalized partial…

Descriptors: Item Response Theory, Models, Rating Scales, Generalization

Monitoring Rater Performance over Time: A Framework for Detecting Differential Accuracy and Differential Scale Category Use

Peer reviewed

Direct link

Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009

In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…

Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)

Judges' Use of Examinee Performance Data in an Angoff Standard-Setting Exercise for a Medical Licensing Examination: An Experimental Study

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009

Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…

Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

The Effect of Distractions on Sixth-Grade Students in a Testing Situation

Peer reviewed

Trentham, Landa L. – Journal of Educational Measurement, 1975

Descriptors: Comparative Testing, Educational Testing, Elementary Education, Grade 6

Model-Free CUSUM Methods for Person Fit

Peer reviewed

Direct link

Armstrong, Ronald D.; Shi, Min – Journal of Educational Measurement, 2009

This article demonstrates the use of a new class of model-free cumulative sum (CUSUM) statistics to detect person fit given the responses to a linear test. The fundamental statistic being accumulated is the likelihood ratio of two probabilities. The detection performance of this CUSUM scheme is compared to other model-free person-fit statistics…

Descriptors: Probability, Simulation, Models, Psychometrics

Modeling Diagnostic Assessments with Bayesian Networks

Peer reviewed

Direct link

Almond, Russell G.; DiBello, Louis V.; Moulder, Brad; Zapata-Rivera, Juan-Diego – Journal of Educational Measurement, 2007

This paper defines Bayesian network models and examines their applications to IRT-based cognitive diagnostic modeling. These models are especially suited to building inference engines designed to be synchronous with the finer grained student models that arise in skills diagnostic assessment. Aspects of the theory and use of Bayesian network models…

Descriptors: Inferences, Models, Item Response Theory, Cognitive Measurement

Standards for Educational & Psychological Tests

Peer reviewed

Lennon, Roger T. – Journal of Educational Measurement, 1975

Reviews the 1974 Standards, an updating serving as a guide to test making and publishing, and training of persons for these endeavors. (DEP)

Descriptors: Educational Testing, Psychological Testing, Scoring, Standards

Evaluating Comparability in Computerized Adaptive Testing: Issues, Criteria and an Example.

Peer reviewed

Wang, Tianyou; Kolen, Michael J. – Journal of Educational Measurement, 2001

Reviews research literature on comparability issues in computerized adaptive testing (CAT) and synthesizes issues specific to comparability and test security. Develops a framework for evaluating comparability that contains three categories of criteria: (1) validity; (2) psychometric property/reliability; and (3) statistical assumption/test…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Criteria

Previous Page | Next Page »

Pages: 1 | 2 | 3

Madaus, George F.	2
Sinharay, Sandip	2
Airasian, Peter W.	1
Almond, Russell G.	1
Armstrong, Ronald D.	1
Baker, Eva L.	1
Baldwin, Su G.	1
Biggs, J. B.	1
Braun, P. H.	1
Brennan, Robert L.	1
Bridgeford, Nancy J.	1
Chen, Troy T.	1
Clauser, Brian E.	1
Cole, Nancy S.	1
Conklin, Jonathan E.	1
Cui, Ying	1
DiBello, Louis V.	1
Dillon, Gerard F.	1
Embretson, Susan	1
Falk, Carl F.	1
Fine, Doris Landau	1
Gorin, Joanna	1
Herman, Joan L.	1
Hogan, Thomas P.	1
More ▼