ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	13
Since 2006 (last 20 years)	30

Descriptor

Computation	30
Test Theory	30
Item Response Theory	13
Scores	10
Statistical Analysis	10
Test Items	10
Comparative Analysis	8
Reliability	8
Error of Measurement	7
Models	6
Correlation	5
Difficulty Level	5
Equations (Mathematics)	5
Test Reliability	5
Foreign Countries	4
Multiple Choice Tests	4
Simulation	4
College Entrance Examinations	3
Elementary School Students	3
Factor Analysis	3
Grade 2	3
Maximum Likelihood Statistics	3
Psychometrics	3
Scaling	3
Test Construction	3
More ▼

Source

Applied Psychological…	5
Educational and Psychological…	3
Journal of Educational…	3
Applied Measurement in…	2
Behavioral Research and…	2
Educational Measurement:…	2
Journal of Educational and…	2
ProQuest LLC	2
ACT, Inc.	1
Advances in Physiology…	1
Annenberg Institute for…	1
International Journal of…	1
Journal of Science Education…	1
Journal of Special Education	1
Measurement:…	1
Practical Assessment,…	1
Research Papers in Education	1
More ▼

Publication Type

Journal Articles	24
Reports - Research	14
Reports - Evaluative	7
Reports - Descriptive	6
Dissertations/Theses -…	2
Numerical/Quantitative Data	2
Opinion Papers	1

Education Level

Elementary Education	7
Higher Education	5
Postsecondary Education	4
Early Childhood Education	3
Grade 2	3
Grade 4	3
Grade 5	3
Grade 6	3
Grade 7	3
Grade 8	3
Middle Schools	3
Primary Education	3
Secondary Education	3
Grade 3	2
Junior High Schools	2
Grade 1	1
Intermediate Grades	1
More ▼

Audience

Location

Australia	1
Colorado	1
Florida	1
Germany	1
New York	1
North Carolina	1
Sweden	1
Tennessee	1
Texas	1
United Kingdom (England)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Eysenck Personality Inventory	1
Law School Admission Test	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Rasch Measurement v. Item Response Theory: Knowing When to Cross the Line

Peer reviewed
PDF on ERIC

Download full text

Stemler, Steven E.; Naples, Adam – Practical Assessment, Research & Evaluation, 2021

When students receive the same score on a test, does that mean they know the same amount about the topic? The answer to this question is more complex than it may first appear. This paper compares classical and modern test theories in terms of how they estimate student ability. Crucial distinctions between the aims of Rasch Measurement and IRT are…

Descriptors: Item Response Theory, Test Theory, Ability, Computation

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

A Closed-Form Alternative for Estimating [omega] Reliability under Unidimensionality

Peer reviewed

Direct link

Hancock, Gregory R.; An, Ji – Measurement: Interdisciplinary Research and Perspectives, 2020

As an alternative to Cronbach's [alpha] for estimating scale reliability, McDonald's [omega] has attracted increased attention within the methodological community for its less stringent measurement assumptions. Notwithstanding, [omega] is still seldom used by practitioners, likely due to its unavailability in popular software packages (e.g., SPSS)…

Descriptors: Evaluation, Alternative Assessment, Reliability, Test Reliability

Cluster-Robust Variance Estimators for Binary Observations in Heterogeneous Groups and Their Application to Psychometric Analyses of Repeated Measures

Direct link

Sarah Marie Marquis – ProQuest LLC, 2020

This dissertation is composed of a study of estimation methods in classical and test theories and the elaboration and application of a cluster-robust variance estimator. Variance estimators derived from generalized estimating equations are known to be robust to most covariance structures and are therefore well suited for psychometric analysis of…

Descriptors: Multivariate Analysis, Robustness (Statistics), Computation, Test Theory

Estimating Treatment Effects with the Explanatory Item Response Model. EdWorkingPaper No. 22-677

Download full text

Joshua B. Gilbert – Annenberg Institute for School Reform at Brown University, 2022

This simulation study examines the characteristics of the Explanatory Item Response Model (EIRM) when estimating treatment effects when compared to classical test theory (CTT) sum and mean scores and item response theory (IRT)-based theta scores. Results show that the EIRM and IRT theta scores provide generally equivalent bias and false positive…

Descriptors: Item Response Theory, Models, Test Theory, Computation

Digital ITEMS Module 1: Reliability in Classical Test Theory

Peer reviewed

Direct link

Lewis, Charlie; Chajewski, Michael; Rupp, André A. – Educational Measurement: Issues and Practice, 2018

In this ITEMS module, we provide a two-part introduction to the topic of reliability from the perspective of "classical test theory" (CTT). In the first part, which is directed primarily at beginning learners, we review and build on the content presented in the original didactic ITEMS article by Traub and Rowley (1991). Specifically, we…

Descriptors: Test Reliability, Test Theory, Computation, Data Collection

A Cognitive Diagnosis Model for Continuous Response

Peer reviewed

Direct link

Minchen, Nathan D.; de la Torre, Jimmy; Liu, Ying – Journal of Educational and Behavioral Statistics, 2017

Nondichotomous response models have been of greater interest in recent years due to the increasing use of different scoring methods and various performance measures. As an important alternative to dichotomous scoring, the use of continuous response formats has been found in the literature. To assess finer-grained skills or attributes and to…

Descriptors: Models, Psychometrics, Test Theory, Maximum Likelihood Statistics

On True Score Evaluation Using Item Response Theory Modeling

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Harrison, Michael – Educational and Psychological Measurement, 2019

Building on prior research on the relationships between key concepts in item response theory and classical test theory, this note contributes to highlighting their important and useful links. A readily and widely applicable latent variable modeling procedure is discussed that can be used for point and interval estimation of the individual person…

Descriptors: True Scores, Item Response Theory, Test Items, Test Theory

Modifying Spearman's Attenuation Equation to Yield Partial Corrections for Measurement Error--With Application to Sample Size Calculations

Peer reviewed

Direct link

Nicewander, W. Alan – Educational and Psychological Measurement, 2018

Spearman's correction for attenuation (measurement error) corrects a correlation coefficient for measurement errors in either-or-both of two variables, and follows from the assumptions of classical test theory. Spearman's equation removes all measurement error from a correlation coefficient which translates into "increasing the reliability of…

Descriptors: Error of Measurement, Correlation, Sample Size, Computation

Components of Variance of Scales with a Bifactor Subscale Structure from Two Calculations of Alpha

Peer reviewed

Direct link

Andrich, David – Educational Measurement: Issues and Practice, 2016

Since Cronbach's (1951) elaboration of a from its introduction by Guttman (1945), this coefficient has become ubiquitous in characterizing assessment instruments in education, psychology, and other social sciences. Also ubiquitous are caveats on the calculation and interpretation of this coefficient. This article summarizes a recent contribution…

Descriptors: Computation, Correlation, Test Theory, Measures (Individuals)

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

"TechCheck": Development and Validation of an Unplugged Assessment of Computational Thinking in Early Childhood Education

Peer reviewed

Direct link

Relkin, Emily; de Ruiter, Laura; Bers, Marina Umaschi – Journal of Science Education and Technology, 2020

There is a need for developmentally appropriate Computational Thinking (CT) assessments that can be implemented in early childhood classrooms. We developed a new instrument called "TechCheck" for assessing CT skills in young children that does not require prior knowledge of computer programming. "TechCheck" is based on…

Descriptors: Developmentally Appropriate Practices, Computation, Thinking Skills, Early Childhood Education

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

Comments on van der Linden's Critique and Proposal for Equating

Peer reviewed

Direct link

Holland, Paul W. – Journal of Educational Measurement, 2013

While agreeing with van der Linden (this issue) that test equating needs better theoretical underpinnings, my comments criticize several aspects of his article. His examples are, for the most part, worthless; he does not use well-established terminology correctly; his view of 100 years of attempts to give a theoretical basis for equating is…

Descriptors: Equated Scores, Test Theory, Transformations (Mathematics), Computation

Some Conceptual Issues in Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2013

In spite of all of the technical progress in observed-score equating, several of the more conceptual aspects of the process still are not well understood. As a result, the equating literature struggles with rather complex criteria of equating, lack of a test-theoretic foundation, confusing terminology, and ad hoc analyses. A return to Lord's…

Descriptors: Equated Scores, Statistical Analysis, Computation, Data Collection

Previous Page | Next Page »

Pages: 1 | 2

Andrich, David	3
Ketterlin-Geller, Leanne R.	2
Liu, Kimy	2
Tindal, Gerald	2
Almehrizi, Rashid S.	1
An, Ji	1
Beauducel, Andre	1
Beretvas, S. Natasha	1
Bers, Marina Umaschi	1
Bramley, Tom	1
Calmettes, Guillaume	1
Chajewski, Michael	1
Clemens, Nathan H.	1
Cui, Zhongmin	1
Culpepper, Steven Andrew	1
Davis, John L.	1
DeCarlo, Lawrence T.	1
Deng, Nina	1
Dhawan, Vikas	1
Dimitrov, Dimiter M.	1
Drummond, Gordon B.	1
Fang, Yu	1
Haberman, Shelby	1
Hancock, Gregory R.	1
Harrison, Michael	1
More ▼