ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	8

Descriptor

Comparative Analysis	15
Measurement Techniques	15
Test Theory	15
Evaluation Methods	6
Educational Testing	5
Foreign Countries	5
Psychometrics	5
Test Interpretation	5
Testing Problems	5
Classification	4
Definitions	4
Educational Assessment	4
Equated Scores	4
High Stakes Tests	4
Predictive Measurement	4
Test Construction	4
Test Use	4
Error of Measurement	3
Evaluation Criteria	3
Item Response Theory	3
Measures (Individuals)	3
Models	3
Scaling	3
Test Items	3
Test Validity	3
More ▼

Source

Measurement:…	4
Applied Psychological…	1
Educational Research and…	1
Educational and Psychological…	1
Evaluation in Education:…	1
Multivariate Behavioral…	1
Research in Higher Education	1

Publication Type

Journal Articles	10
Reports - Evaluative	5
Opinion Papers	4
Reports - Research	4
Speeches/Meeting Papers	3
Collected Works - Proceedings	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education	5
Higher Education	3
Postsecondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 2	1
Primary Education	1

Audience

Location

United Kingdom (England)	3
United Kingdom	2
United Kingdom (Wales)	2
United States	2
Australia	1
California	1
Netherlands	1
Sweden	1
United Kingdom (Northern…	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

On the Relationship between Classical Test Theory and Item Response Theory: From One to the Other and Back

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2016

The frequently neglected and often misunderstood relationship between classical test theory and item response theory is discussed for the unidimensional case with binary measures and no guessing. It is pointed out that popular item response models can be directly obtained from classical test theory-based models by accounting for the discrete…

Descriptors: Test Theory, Item Response Theory, Models, Correlation

Coefficient Alpha and Reliability of Scale Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Psychological Measurement, 2013

The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…

Descriptors: Raw Scores, Scaling, Reliability, Computation

Measuring Student Involvement: A Comparison of Classical Test Theory and Item Response Theory in the Construction of Scales from Student Surveys

Peer reviewed

Direct link

Sharkness, Jessica; DeAngelo, Linda – Research in Higher Education, 2011

This study compares the psychometric utility of Classical Test Theory (CTT) and Item Response Theory (IRT) for scale construction with data from higher education student surveys. Using 2008 Your First College Year (YFCY) survey data from the Cooperative Institutional Research Program at the Higher Education Research Institute at UCLA, two scales…

Descriptors: Student Surveys, Measures (Individuals), Psychometrics, Item Response Theory

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

On Applications of Rasch Models in International Comparative Large-Scale Assessments: A Historical Review

Peer reviewed

Direct link

Wendt, Heike; Bos, Wilfried; Goy, Martin – Educational Research and Evaluation, 2011

Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…

Descriptors: Measures (Individuals), Test Theory, Group Testing, Educational Testing

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Classical Test Theory and Item Response Theory: Analytical and Empirical Comparisons.

Download full text

Hwang, Dae-Yeop – 2002

This study compared classical test theory (CTT) and item response theory (IRT). The behavior of the item and person statistics derived from these two measurement frameworks was examined analytically and empirically using a data set obtained from BILOG (R. Mislay and D. Block, 1997). The example was a 15-item test with a sample size of 600…

Descriptors: Comparative Analysis, Measurement Techniques, Scores, Statistical Distributions

Comparing Measurement Theories.

Download full text

Schumacker, Randall E. – 1998

In comparing measurement theories, it is evident that the awareness of the concept of measurement error during the time of Galileo has lead to the formulation of observed scores comprising a true score and error (classical theory), universe score and various random error components (generalizability theory), or individual latent ability and error…

Descriptors: Comparative Analysis, Computer Software, Error of Measurement, Generalizability Theory

Binomial Test Models for Domain-Referenced Testing.

van den Brink, Wulfert – Evaluation in Education: International Progress, 1982

Binomial models for domain-referenced testing are compared, emphasizing the assumptions underlying the beta-binomial model. Advantages and disadvantages are discussed. A proposed item sampling model is presented which takes the effect of guessing into account. (Author/CM)

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Sampling, Measurement Techniques

Confirmatory Measurement Model Comparisons Using Latent Means.

Peer reviewed

Millsap, Roger E.; Everson, Howard – Multivariate Behavioral Research, 1991

Use of confirmatory factor analysis (CFA) with nonzero latent means in testing six different measurement models from classical test theory is discussed. Implications of the six models for observed mean and covariance structures are described, and three examples of the use of CFA in testing the models are presented. (SLD)

Descriptors: Comparative Analysis, Equations (Mathematics), Goodness of Fit, Mathematical Models

The Unnumbered Graphic Scale as a Data-Collection Method: An Investigation Comparing Three Measurement Strategies in the Context of Q-Technique Factor Analysis.

Download full text

Thompson, Bruce; Dennings, Bruce – 1993

Q-technique factor analysis identifies clusters or factors of people, rather than of variables, and has proven very popular, especially with regard to testing typology theories. The present study investigated the utility of three different protocols for obtaining data for Q-technique studies. These three protocols were: (1) a conventional ipsative…

Descriptors: Classification, Comparative Analysis, Data Collection, Factor Analysis

Practice and Problems in Language Testing 5. Non-Classical Test Theory; Final Examinations in Secondary Schools. Papers Presented at the International Language Testing Symposium (5th, Arnhem, Netherlands, March 25-26, 1982).

van Weeren, J., Ed. – 1983

Presented in this symposium reader are nine papers, four of which deal with the theory and impact of the Rasch model on language testing and five of which discuss final examinations in secondary schools in both general and specific terms. The papers are: "Introduction to Rasch Measurement: Some Implications for Language Testing" (J. J.…

Descriptors: Adolescents, Comparative Analysis, Comparative Education, Difficulty Level

A Theoretical and Empirical Comparison of Three Approaches to Achievement Testing.

Haladyna, Tom; Roid, Gale – 1976

Three approaches to the construction of achievement tests are compared: construct, operational, and empirical. The construct approach is based upon classical test theory and measures an abstract representation of the instructional objectives. The operational approach specifies instructional intent through instructional objectives, facet design,…

Descriptors: Academic Achievement, Achievement Tests, Career Development, Comparative Analysis

Almehrizi, Rashid S.	1
Baird, Jo-Anne	1
Bos, Wilfried	1
Cresswell, Mike	1
DeAngelo, Linda	1
Dennings, Bruce	1
Everson, Howard	1
Goy, Martin	1
Haladyna, Tom	1
Hwang, Dae-Yeop	1
Marcoulides, George A.	1
Millsap, Roger E.	1
Newton, Paul E.	1
Raykov, Tenko	1
Roid, Gale	1
Schumacker, Randall E.	1
Sharkness, Jessica	1
Thompson, Bruce	1
Wendt, Heike	1
van Weeren, J., Ed.	1
van den Brink, Wulfert	1
von Davier, Alina A.	1
More ▼