ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Error of Measurement	13
Test Construction	13
Test Theory	13
Test Reliability	7
Test Validity	7
Item Analysis	5
Test Items	5
Item Response Theory	4
Career Development	3
Criterion Referenced Tests	3
Latent Trait Theory	3
Measurement Techniques	3
Psychometrics	3
Test Interpretation	3
Academic Standards	2
Achievement Tests	2
Comparative Analysis	2
Decision Making	2
Difficulty Level	2
Educational Quality	2
Factor Analysis	2
Foreign Countries	2
Generalizability Theory	2
Higher Education	2
Item Sampling	2
More ▼

Source

Alberta Journal of…	1
ETS Research Report Series	1
Educational Measurement:…	1
IEEE Transactions on Education	1
International Journal of…	1
International Journal of…	1
Psychometrika	1
Social Behavior and…	1

Publication Type

Journal Articles	8
Reports - Research	8
Reports - Descriptive	2
Book/Product Reviews	1
Opinion Papers	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Elementary Secondary Education	1
Grade 3	1
Postsecondary Education	1

Audience

Location

Canada

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018

Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…

Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

A Control Systems Concept Inventory Test Design and Assessment

Peer reviewed

Direct link

Bristow, M.; Erkorkmaz, K.; Huissoon, J. P.; Jeon, Soo; Owen, W. S.; Waslander, S. L.; Stubley, G. D. – IEEE Transactions on Education, 2012

Any meaningful initiative to improve the teaching and learning in introductory control systems courses needs a clear test of student conceptual understanding to determine the effectiveness of proposed methods and activities. The authors propose a control systems concept inventory. Development of the inventory was collaborative and iterative. The…

Descriptors: Diagnostic Tests, Concept Formation, Undergraduate Students, Engineering Education

Tests in Europe: Where We Are and Where We Should Go

Peer reviewed

Direct link

Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012

Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…

Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries

Measurement Error and Changes in Personal Constructs.

Peer reviewed

Chambers, William V. – Social Behavior and Personality, 1985

Personal construct psychologists have suggested various psychological functions explain differences in the stability of constructs. Among these functions are constellatory and loose construction. This paper argues that measurement error is a more parsimonious explanation of the differences in construct stability reported in these studies. (Author)

Descriptors: Error of Measurement, Test Construction, Test Format, Test Reliability

TESTAT--A Supplementary Module to SYSTAT. Software Review.

Peer reviewed

Stone, Clement A. – Educational Measurement: Issues and Practice, 1992

TESTAT is a supplementary module for the popular SYSTAT statistical package for the personal computer. The program performs test analyses based on classical test theory and item response theory. Limitations and advantages are discussed. (SLD)

Descriptors: Computer Assisted Testing, Computer Software Evaluation, Error of Measurement, Item Response Theory

Estimating the Imputed Social Cost of Errors of Measurement.

Peer reviewed

Lord, Frederic M. – Psychometrika, 1985

Given a loss function, an asymptotic method for optimal test design for a specified target population of examinees is presented. Also, of more practical use, given an existing unidimensional test and target population, a way is presented to find the loss function for which the test is optimal. (NSF)

Descriptors: Error of Measurement, Higher Education, Item Sampling, Latent Trait Theory

A Comparison of Two Item Selection Procedures for Building Criterion-Referenced Tests.

Download full text

Haladyna, Tom; Roid, Gale – 1981

Two approaches to criterion-referenced test construction are compared. Classical test theory is based on the practice of random sampling from a well-defined domain of test items; latent trait theory suggests that the difficulty of the items should be matched to the achievement level of the student. In addition to these two methods of test…

Descriptors: Criterion Referenced Tests, Error of Measurement, Latent Trait Theory, Test Construction

Behavior Domains in Theory and in Practice

Peer reviewed

Direct link

McDonald, Roderick P. – Alberta Journal of Educational Research, 2003

The concept of a behavior domain is a reasonable and essential foundation for psychometric work based on true score theory, the linear model of common factor analysis, and the nonlinear models of item response theory. Investigators applying these models to test data generally treat the true scores or factors or traits as abstractive psychological…

Descriptors: Factor Analysis, Error of Measurement, True Scores, Psychometrics

An Application of Generalizability Theory to the Validation of a Behaviorally Anchored Role-Play Measure.

Espelage, Dorothy L.; Quittner, Alexandra L.; Kamps, Jodi – 1998

Generalizability theory (g-theory) was used, as an alternative to classical test theory, to evaluate measurement error in a behaviorally anchored role-play measure, highlighting the usefulness of this theory in instrument development. G-theory partitions an observed score into the universe score and error scores associated with separate sources of…

Descriptors: Behavior Patterns, Eating Disorders, Error of Measurement, Females

Invariance of Rasch Model Ability Parameter Estimates Over Different Collections of Items.

Curry, Allen R.; And Others – 1978

The efficacy of employing subsets of items from a calibrated item pool to estimate the Rasch model person parameters was investigated. Specifically, the degree of invariance of Rasch model ability-parameter estimates was examined across differing collections of simulated items. The ability-parameter estimates were obtained from a simulation of…

Descriptors: Career Development, Difficulty Level, Equated Scores, Error of Measurement

The Paradox of Criterion-Referenced Measurement.

Download full text

Haladyna, Tom – 1976

The existence of criterion-referenced (CR) measurement is questioned in this paper. Despite beliefs that differences exist between two alternative forms of measurement, CR and Norm Referenced (NR), an analysis of philosophical and psychological descriptions of measurement, as well as a growing number of empirical studies, reveal that the common…

Descriptors: Academic Standards, Achievement Tests, Career Development, Comparative Analysis

A Theoretical and Empirical Comparison of Three Approaches to Achievement Testing.

Haladyna, Tom; Roid, Gale – 1976

Three approaches to the construction of achievement tests are compared: construct, operational, and empirical. The construct approach is based upon classical test theory and measures an abstract representation of the instructional objectives. The operational approach specifies instructional intent through instructional objectives, facet design,…

Descriptors: Academic Achievement, Achievement Tests, Career Development, Comparative Analysis

Haladyna, Tom	3
Roid, Gale	2
Bichi, Ado Abdu	1
Bristow, M.	1
Chambers, William V.	1
Curry, Allen R.	1
Elosua, Paula	1
Erkorkmaz, K.	1
Espelage, Dorothy L.	1
Huissoon, J. P.	1
Iliescu, Dragos	1
Jeon, Soo	1
Kamps, Jodi	1
Li, Feifei	1
Lord, Frederic M.	1
McDonald, Roderick P.	1
Owen, W. S.	1
Quittner, Alexandra L.	1
Stone, Clement A.	1
Stubley, G. D.	1
Talib, Rohaya	1
Waslander, S. L.	1
More ▼