ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	8

Descriptor

Error of Measurement	10
Test Theory	10
Item Response Theory	5
Reliability	4
Test Items	4
Correlation	3
Comparative Analysis	2
Computation	2
Decision Making	2
Generalizability Theory	2
Measurement	2
Measurement Techniques	2
Raw Scores	2
Regression (Statistics)	2
Scores	2
Test Construction	2
Adolescents	1
Behavior Theories	1
Comparative Education	1
Computer Software	1
Criterion Referenced Tests	1
Curriculum Based Assessment	1
Disabilities	1
Educational Policy	1
Educational Quality	1
More ▼

Source

Applied Psychological…	2
Alberta Journal of…	1
Educational and Psychological…	1
International Journal of…	1
International Journal of…	1
Journal of Early Adolescence	1
Journal of Special Education	1
Measurement:…	1

Publication Type

Reports - Descriptive	10
Journal Articles	9
Speeches/Meeting Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Wechsler Preschool and…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018

Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…

Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making

The Reliability and Precision of Total Scores and IRT Estimates as a Function of Polytomous IRT Parameters and Latent Trait Distribution

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2013

A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…

Descriptors: Item Response Theory, Reliability, Scores, Error of Measurement

Generalizability Theory as a Unifying Framework of Measurement Reliability in Adolescent Research

Peer reviewed

Direct link

Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014

In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…

Descriptors: Generalizability Theory, Measurement, Reliability, Correlation

Defensible Progress Monitoring Data for Medium- and High-Stakes Decisions

Peer reviewed

Direct link

Parker, Richard I.; Vannest, Kimberly J.; Davis, John L.; Clemens, Nathan H. – Journal of Special Education, 2012

Within a response to intervention model, educators increasingly use progress monitoring (PM) to support medium- to high-stakes decisions for individual students. For PM to serve these more demanding decisions requires more careful consideration of measurement error. That error should be calculated within a fixed linear regression model rather than…

Descriptors: Measurement, Computation, Response to Intervention, Regression (Statistics)

On Bias in Linear Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010

The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…

Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends

Polytomous Differential Item Functioning and Violations of Ordering of the Expected Latent Trait by the Raw Score

Peer reviewed

Direct link

DeMars, Christine E. – Educational and Psychological Measurement, 2008

The graded response (GR) and generalized partial credit (GPC) models do not imply that examinees ordered by raw observed score will necessarily be ordered on the expected value of the latent trait (OEL). Factors were manipulated to assess whether increased violations of OEL also produced increased Type I error rates in differential item…

Descriptors: Test Items, Raw Scores, Test Theory, Error of Measurement

Theory of Test Translation Error

Peer reviewed

Direct link

Solano-Flores, Guillermo; Backhoff, Eduardo; Contreras-Nino, Luis Angel – International Journal of Testing, 2009

In this article, we present a theory of test translation whose intent is to provide the conceptual foundation for effective, systematic work in the process of test translation and test translation review. According to the theory, translation error is multidimensional; it is not simply the consequence of defective translation but an inevitable fact…

Descriptors: Test Items, Investigations, Semantics, Translation

Standardized Conditional "SEM": A Case for Conditional Reliability

Peer reviewed

Direct link

Raju, Nambury S.; Price, Larry R.; Oshima, T. C.; Nering, Michael L. – Applied Psychological Measurement, 2007

An examinee-level (or conditional) reliability is proposed for use in both classical test theory (CTT) and item response theory (IRT). The well-known group-level reliability is shown to be the average of conditional reliabilities of examinees in a group or a population. This relationship is similar to the known relationship between the square of…

Descriptors: Item Response Theory, Error of Measurement, Reliability, Test Theory

Comparing Measurement Theories.

Download full text

Schumacker, Randall E. – 1998

In comparing measurement theories, it is evident that the awareness of the concept of measurement error during the time of Galileo has lead to the formulation of observed scores comprising a true score and error (classical theory), universe score and various random error components (generalizability theory), or individual latent ability and error…

Descriptors: Comparative Analysis, Computer Software, Error of Measurement, Generalizability Theory

Behavior Domains in Theory and in Practice

Peer reviewed

Direct link

McDonald, Roderick P. – Alberta Journal of Educational Research, 2003

The concept of a behavior domain is a reasonable and essential foundation for psychometric work based on true score theory, the linear model of common factor analysis, and the nonlinear models of item response theory. Investigators applying these models to test data generally treat the true scores or factors or traits as abstractive psychological…

Descriptors: Factor Analysis, Error of Measurement, True Scores, Psychometrics

Backhoff, Eduardo	1
Bichi, Ado Abdu	1
Clemens, Nathan H.	1
Contreras-Nino, Luis Angel	1
Culpepper, Steven Andrew	1
Davis, John L.	1
DeMars, Christine E.	1
Fan, Xitao	1
McDonald, Roderick P.	1
Nering, Michael L.	1
Oshima, T. C.	1
Parker, Richard I.	1
Price, Larry R.	1
Raju, Nambury S.	1
Schumacker, Randall E.	1
Solano-Flores, Guillermo	1
Sun, Shaojing	1
Talib, Rohaya	1
Vannest, Kimberly J.	1
van der Linden, Wim J.	1
More ▼