ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	9

Descriptor

Comparative Analysis	15
Error of Measurement	15
Test Theory	15
Item Response Theory	9
Achievement Tests	4
Models	4
Test Items	4
Item Analysis	3
Mathematical Models	3
Measurement Techniques	3
Scores	3
Test Interpretation	3
Test Reliability	3
True Scores	3
Career Development	2
College Entrance Examinations	2
Computation	2
Correlation	2
Criterion Referenced Tests	2
Equated Scores	2
Equations (Mathematics)	2
Evaluation Criteria	2
Foreign Countries	2
Generalizability Theory	2
Norm Referenced Tests	2
More ▼

Source

Applied Psychological…	2
ACT, Inc.	1
Applied Measurement in…	1
Educational and Psychological…	1
Health Education Research	1
International Journal of…	1
International Online Journal…	1
Journal of Educational…	1
Research in the Schools	1

Publication Type

Reports - Research	10
Journal Articles	9
Reports - Evaluative	4
Reports - Descriptive	2
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Postsecondary Education	2
Elementary Secondary Education	1

Audience

Location

Turkey

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	2
Gates MacGinitie Reading Tests	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Comparison of Performance Measures Obtained from Foreign Language Tests According to Item Response Theory vs Classical Test Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – International Online Journal of Education and Teaching, 2022

Foreign language testing is a multi-dimensional phenomenon and obtaining objective and error-free scores on learners' language skills is often problematic. While assessing foreign language performance on high-stakes tests, using different testing approaches including Classical Test Theory (CTT), Generalizability Theory (GT) and/or Item Response…

Descriptors: Second Language Learning, Second Language Instruction, Item Response Theory, Language Tests

Comparison of IRT and CTT Using Secondary School Reading Comprehension Assessments

Peer reviewed

Direct link

Coggins, Joanne V.; Kim, Jwa K.; Briggs, Laura C. – Research in the Schools, 2017

The Gates-MacGinitie Reading Comprehension Test, fourth edition (GMRT-4) and the ACT Reading Tests (ACT-R) were administered to 423 high school students in order to explore the similarities and dissimilarities of data produced through classical test theory (CTT) and item response theory (IRT) analysis. Despite the many advantages of IRT…

Descriptors: Item Response Theory, Test Theory, Reading Comprehension, Reading Tests

The Comparison of Item Parameters Estimated from Parametric and Nonparametric Item Response Theory Models in Case of the Violance of Local Independence Assumption

Peer reviewed
PDF on ERIC

Download full text

Dirlik, Ezgi Mor – International Journal of Progressive Education, 2019

Item response theory (IRT) has so many advantages than its precedent Classical Test Theory (CTT) such as non-changing item parameters, ability parameter estimations free from the items. However, in order to get these advantages, some assumptions should be met and they are; unidimensionality, normality and local independence. However, it is not…

Descriptors: Comparative Analysis, Nonparametric Statistics, Item Response Theory, Models

On the Relationship between Classical Test Theory and Item Response Theory: From One to the Other and Back

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2016

The frequently neglected and often misunderstood relationship between classical test theory and item response theory is discussed for the unidimensional case with binary measures and no guessing. It is pointed out that popular item response models can be directly obtained from classical test theory-based models by accounting for the discrete…

Descriptors: Test Theory, Item Response Theory, Models, Correlation

The Reliability and Precision of Total Scores and IRT Estimates as a Function of Polytomous IRT Parameters and Latent Trait Distribution

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2013

A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…

Descriptors: Item Response Theory, Reliability, Scores, Error of Measurement

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

Download full text

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling

Generalizability Theory and Classical Test Theory

Peer reviewed

Direct link

Brennan, Robert L. – Applied Measurement in Education, 2011

Broadly conceived, reliability involves quantifying the consistencies and inconsistencies in observed scores. Generalizability theory, or G theory, is particularly well suited to addressing such matters in that it enables an investigator to quantify and distinguish the sources of inconsistencies in observed scores that arise, or could arise, over…

Descriptors: Generalizability Theory, Test Theory, Test Reliability, Item Response Theory

Tolerance Intervals for True Scores.

Peer reviewed

Jarjoura, David – Journal of Educational Statistics, 1985

Issues regarding tolerance and confidence intervals are discussed within the context of educational measurement, and conceptual distinctions are drawn between these two types of intervals. Points are raised about the advantages of tolerance intervals when the focus is on a particular observed score rather than a particular examinee. (Author/BW)

Descriptors: Comparative Analysis, Error of Measurement, Mathematical Models, Test Interpretation

Equating Error in Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2006

Traditionally, error in equating observed scores on two versions of a test is defined as the difference between the transformations that equate the quantiles of their distributions in the sample and population of test takers. But it is argued that if the goal of equating is to adjust the scores of test takers on one version of the test to make…

Descriptors: Equated Scores, Evaluation Criteria, Models, Error of Measurement

Improving Measurement in Health Education and Health Behavior Research Using Item Response Modeling: Comparison with the Classical Test Theory Approach

Peer reviewed

Direct link

Wilson, Mark; Allen, Diane D.; Li, Jun Corser – Health Education Research, 2006

This paper compares the approach and resultant outcomes of item response models (IRMs) and classical test theory (CTT). First, it reviews basic ideas of CTT, and compares them to the ideas about using IRMs introduced in an earlier paper. It then applies a comparison scheme based on the AERA/APA/NCME "Standards for Educational and…

Descriptors: Health Education, Self Efficacy, Health Behavior, Measures (Individuals)

Comparing Measurement Theories.

Download full text

Schumacker, Randall E. – 1998

In comparing measurement theories, it is evident that the awareness of the concept of measurement error during the time of Galileo has lead to the formulation of observed scores comprising a true score and error (classical theory), universe score and various random error components (generalizability theory), or individual latent ability and error…

Descriptors: Comparative Analysis, Computer Software, Error of Measurement, Generalizability Theory

Direct and Indirect Equating: A Comparison of Four Methods Using the Rasch Model.

Download full text

Morrison, Carol A.; Fitzpatrick, Steven J. – 1992

An attempt was made to determine which item response theory (IRT) equating method results in the least amount of equating error or "scale drift" when equating scores across one or more test forms. An internal anchor test design was employed with five different test forms, each consisting of 30 items, 10 in common with the base test and 5…

Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Error of Measurement

An Alternative Interpretation of Three Stability Models. Measurement and Methodology, Work Unit 2: Technical Adequacy of Tests.

Wilcox, Rand R. – 1978

Two fundamental problems in mental test theory are to estimate true score and to estimate the amount of error when testing an examinee. In this report, three probability models which characterize a single test item in terms of a population of examinees are described. How these models may be modified to characterize a single examinee in terms of an…

Descriptors: Achievement Tests, Comparative Analysis, Error of Measurement, Mathematical Models

The Paradox of Criterion-Referenced Measurement.

Download full text

Haladyna, Tom – 1976

The existence of criterion-referenced (CR) measurement is questioned in this paper. Despite beliefs that differences exist between two alternative forms of measurement, CR and Norm Referenced (NR), an analysis of philosophical and psychological descriptions of measurement, as well as a growing number of empirical studies, reveal that the common…

Descriptors: Academic Standards, Achievement Tests, Career Development, Comparative Analysis

A Theoretical and Empirical Comparison of Three Approaches to Achievement Testing.

Haladyna, Tom; Roid, Gale – 1976

Three approaches to the construction of achievement tests are compared: construct, operational, and empirical. The construct approach is based upon classical test theory and measures an abstract representation of the instructional objectives. The operational approach specifies instructional intent through instructional objectives, facet design,…

Descriptors: Academic Achievement, Achievement Tests, Career Development, Comparative Analysis

Haladyna, Tom	2
Allen, Diane D.	1
Brennan, Robert L.	1
Briggs, Laura C.	1
Coggins, Joanne V.	1
Cui, Zhongmin	1
Culpepper, Steven Andrew	1
Dirlik, Ezgi Mor	1
Fang, Yu	1
Fitzpatrick, Steven J.	1
Jarjoura, David	1
Kim, Jwa K.	1
Li, Jun Corser	1
Marcoulides, George A.	1
Morrison, Carol A.	1
Polat, Murat	1
Raykov, Tenko	1
Roid, Gale	1
Schumacker, Randall E.	1
Traynor, Anne	1
Wilcox, Rand R.	1
Wilson, Mark	1
Woodruff, David	1
van der Linden, Wim J.	1
More ▼