ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	10

Descriptor

Test Theory	35
Test Items	11
Equated Scores	9
Scores	6
Test Reliability	6
Item Analysis	5
Measurement Techniques	5
Statistical Analysis	5
Testing Problems	5
College Entrance Examinations	4
Correlation	4
Evaluation Methods	4
Mathematical Models	4
Multiple Choice Tests	4
Test Bias	4
True Scores	4
Classification	3
Computation	3
Error Patterns	3
Error of Measurement	3
Latent Trait Theory	3
Mathematics Tests	3
Models	3
Psychometrics	3
Secondary Education	3
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	34
Reports - Research	21
Reports - Evaluative	9
Opinion Papers	4
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Audience

Researchers

Location

Israel	1
Jordan	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
Graduate Record Examinations	1
Peabody Picture Vocabulary…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

Effectiveness of Equating at the Passing Score for Exams with Small Sample Sizes

Peer reviewed

Direct link

Wolkowitz, Amanda A.; Wright, Keith D. – Journal of Educational Measurement, 2019

This article explores the amount of equating error at a passing score when equating scores from exams with small samples sizes. This article focuses on equating using classical test theory methods of Tucker linear, Levine linear, frequency estimation, and chained equipercentile equating. Both simulation and real data studies were used in the…

Descriptors: Error Patterns, Sample Size, Test Theory, Test Bias

Examining Psychometric Properties and Level Classification of the van Hiele Geometry Test Using CTT and CDM Frameworks

Peer reviewed

Direct link

Chen, Yi-Hsin; Senk, Sharon L.; Thompson, Denisse R.; Voogt, Kevin – Journal of Educational Measurement, 2019

The van Hiele theory and van Hiele Geometry Test have been extensively used in mathematics assessments across countries. The purpose of this study is to use classical test theory (CTT) and cognitive diagnostic modeling (CDM) frameworks to examine psychometric properties of the van Hiele Geometry Test and to compare how various classification…

Descriptors: Geometry, Mathematics Tests, Test Theory, Psychometrics

Analysis of Added Value of Subscores with Respect to Classification

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2014

Brennan noted that users of test scores often want (indeed, demand) that subscores be reported, along with total test scores, for diagnostic purposes. Haberman suggested a method based on classical test theory (CTT) to determine if subscores have added value over the total score. One way to interpret the method is that a subscore has added value…

Descriptors: Scores, Test Theory, Classification, Cutting Scores

More Issues in Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2013

This article is a response to the commentaries on the position paper on observed-score equating by van der Linden (this issue). The response focuses on the more general issues in these commentaries, such as the nature of the observed scores that are equated, the importance of test-theory assumptions in equating, the necessity to use multiple…

Descriptors: Equated Scores, Test Theory, Transformations (Mathematics)

Comments on van der Linden's Critique and Proposal for Equating

Peer reviewed

Direct link

Holland, Paul W. – Journal of Educational Measurement, 2013

While agreeing with van der Linden (this issue) that test equating needs better theoretical underpinnings, my comments criticize several aspects of his article. His examples are, for the most part, worthless; he does not use well-established terminology correctly; his view of 100 years of attempts to give a theoretical basis for equating is…

Descriptors: Equated Scores, Test Theory, Transformations (Mathematics), Computation

Some Conceptual Issues in Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2013

In spite of all of the technical progress in observed-score equating, several of the more conceptual aspects of the process still are not well understood. As a result, the equating literature struggles with rather complex criteria of equating, lack of a test-theoretic foundation, confusing terminology, and ad hoc analyses. A return to Lord's…

Descriptors: Equated Scores, Statistical Analysis, Computation, Data Collection

How Often Do Subscores Have Added Value? Results from Operational and Simulated Data

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2010

Recently, there has been an increasing level of interest in subscores for their potential diagnostic value. Haberman suggested a method based on classical test theory to determine whether subscores have added value over total scores. In this article I first provide a rich collection of results regarding when subscores were found to have added…

Descriptors: Scores, Test Theory, Simulation, Reliability

Conceptual Issues in Response-Time Modeling

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2009

Two different traditions of response-time (RT) modeling are reviewed: the tradition of distinct models for RTs and responses, and the tradition of model integration in which RTs are incorporated in response models or the other way around. Several conceptual issues underlying both traditions are made explicit and analyzed for their consequences. We…

Descriptors: Test Items, Models, Reaction Time, Measurement

Detecting Differential Speededness in Multistage Testing

Peer reviewed

Direct link

van der Linden, Wim J.; Breithaupt, Krista; Chuah, Siang Chee; Zhang, Yanwei – Journal of Educational Measurement, 2007

A potential undesirable effect of multistage testing is differential speededness, which happens if some of the test takers run out of time because they receive subtests with items that are more time intensive than others. This article shows how a probabilistic response-time model can be used for estimating differences in time intensities and speed…

Descriptors: Adaptive Testing, Evaluation Methods, Test Items, Reaction Time

A Comparative Study of Indices for Internal Consistency.

Peer reviewed

Cudeck, Robert – Journal of Educational Measurement, 1980

Methods for evaluating the consistency of responses to test items were compared. When a researcher is unwilling to make the assumptions of classical test theory, has only a small number of items, or is in a tailored testing context, Cliff's dominance indices may be useful. (Author/CTM)

Descriptors: Error Patterns, Item Analysis, Test Items, Test Reliability

Demonstrating the Utility of the Standardization Approach to Assessing Unexpected Differential Item Performance on the Scholastic Aptitude Test.

Peer reviewed

Dorans, Neil J.; Kulick, Edward – Journal of Educational Measurement, 1986

The standardization method for assessing unexpected differential item performance or differential item functioning is introduced. Findings of five studies are summarized, in which the statistical method of standardization is used to look for unexpected differences in item performance across different subpopulations of the Scholastic Aptitude Test.…

Descriptors: Groups, Item Analysis, Sociometric Techniques, Standardized Tests

On the Reliability of Categorically Scored Examinations

Peer reviewed

Direct link

Kupermintz, Haggai – Journal of Educational Measurement, 2004

A decision-theoretic approach to the question of reliability in categorically scored examinations is explored. The concepts of true scores and errors are discussed as they deviate from conventional psychometric definitions and measurement error in categorical scores is cast in terms of misclassifications. A reliability measure based on…

Descriptors: Test Reliability, Error of Measurement, Psychometrics, Test Theory

On the Direct Measurement of Face Validity: A Comment on Nevo.

Peer reviewed

Secolsky, Charles – Journal of Educational Measurement, 1987

For measuring the face validity of a test, Nevo suggested that test takers and nonprofessional users rate items on a five point scale. This article questions the ability of those raters and the credibility of the aggregated judgment as evidence of the validity of the test. (JAZ)

Descriptors: Content Validity, Measurement Techniques, Rating Scales, Test Items

On Interpreting Test Scores as Social Indicators: Statistical Considerations.

Peer reviewed

Spencer, Bruce D. – Journal of Educational Measurement, 1983

Because test scores are ordinal not cordinal attributes, the average test score often is a misleading way to summarize the scores of a group of individuals. Similarly, correlation coefficients may be misleading summary measures of association between test scores. Proper, readily interpretable, summary statistics are developed from a theory of…

Descriptors: Correlation, Measurement Techniques, Scores, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3

van der Linden, Wim J.	4
Holland, Paul W.	2
Mislevy, Robert J.	2
Sinharay, Sandip	2
Allen, Nancy L.	1
Angoff, William H.	1
Beland, Anne	1
Belfry, M. Joan	1
Bennett, Randy Elliot	1
Breithaupt, Krista	1
Budescu, David	1
Budescu, David V.	1
Chen, Yi-Hsin	1
Chuah, Siang Chee	1
Cowell, William R.	1
Cudeck, Robert	1
DeCarlo, Lawrence T.	1
Divgi, D. R.	1
Dorans, Neil J.	1
Gitomer, Drew H.	1
Hamilton, Lawrence C.	1
Harris, Deborah J.	1
Hills, John R.	1
Jaradat, Derar	1
More ▼