ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	4

Descriptor

Error of Measurement	11
Models	11
Test Validity	11
Test Reliability	4
Correlation	3
Evaluation Methods	3
Item Response Theory	3
Educational Assessment	2
Elementary Secondary Education	2
Item Analysis	2
Standardized Tests	2
Student Evaluation	2
True Scores	2
Academic Achievement	1
Achievement Need	1
Adults	1
Affective Behavior	1
Attitude Measures	1
Bias	1
Causal Models	1
Cognitive Processes	1
Computer Assisted Instruction	1
Concept Formation	1
Constructivism (Learning)	1
Criterion Referenced Tests	1
More ▼

Source

Journal of Educational…	2
CALICO Journal	1
Educational Assessment,…	1
Journal of Education and…	1
Sociological Methods &…	1

Publication Type

Reports - Research	7
Journal Articles	5
Speeches/Meeting Papers	3
Reports - Evaluative	2

Education Level

Higher Education	2
Elementary Secondary Education	1
Postsecondary Education	1

Audience

Researchers

Location

Texas	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Test of Standard Written…

What Works Clearinghouse Rating

Showing all 11 results Save | Export

IRT Observed-Score Equating for Rater-Mediated Assessments Using a Hierarchical Rater Model

Peer reviewed

Direct link

Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025

While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…

Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity

Linear Probability Model Revisited: Why It Works and How It Should Be Specified

Peer reviewed

Direct link

Myoung-jae Lee; Goeun Lee; Jin-young Choi – Sociological Methods & Research, 2025

A linear model is often used to find the effect of a binary treatment D on a noncontinuous outcome Y with covariates X. Particularly, a binary Y gives the popular "linear probability model (LPM)," but the linear model is untenable if X contains a continuous regressor. This raises the question: what kind of treatment effect does the…

Descriptors: Probability, Least Squares Statistics, Regression (Statistics), Causal Models

Validation and Invariance of the Conceptions of Assessment-III Abridged (COA-IIIA) among Pre-Service and In-Service Teachers in the United States

Peer reviewed

Direct link

Reeves, Todd D.; Onder, Yasemin; Kraner, Chris – Educational Assessment, Evaluation and Accountability, 2023

As beliefs are well-known antecedents of teachers' practices, including assessment practices, sound measurement of teacher beliefs is critical for scholarly research as well as practical purposes. The present study examined the validity of inferences derived from the Conceptions of Assessment III--Abridged (COA-IIIA) instrument with US PK-12…

Descriptors: Attitude Measures, Teacher Attitudes, Preservice Teachers, Experienced Teachers

Examination of Different Item Response Theory Models on Tests Composed of Testlets

Peer reviewed
PDF on ERIC

Download full text

Kogar, Esin Yilmaz; Kelecioglu, Hülya – Journal of Education and Learning, 2017

The purpose of this research is to first estimate the item and ability parameters and the standard error values related to those parameters obtained from Unidimensional Item Response Theory (UIRT), bifactor (BIF) and Testlet Response Theory models (TRT) in the tests including testlets, when the number of testlets, number of independent items, and…

Descriptors: Item Response Theory, Models, Mathematics Tests, Test Items

Measurement Models for Thematic Apperceptive Measures of the Achievement Motive.

Download full text

Reuman, David A.; And Others – 1982

According to classical test theory, the presence of random measurement error in a psychological test has important implications for validation studies. The more comprehensive application of classical test theory in construct validation is distinguished from that in criterion-oriented validation. Critics of thematic apperceptive measurement of the…

Descriptors: Academic Achievement, Achievement Need, Adults, Error of Measurement

An Evaluation Model for Mastery Testing

Peer reviewed

Emrick, John A. – Journal of Educational Measurement, 1971

Descriptors: Criterion Referenced Tests, Error of Measurement, Evaluation Methods, Item Analysis

The Test of Standard Written English: A Revalidation with Writing Samples and Implications of Placement Decisions.

Suddick, David E.; And Others – 1985

The Test of Standard Written English (TSWE) is a 50-item multiple choice instrument designed to assess the ability of college students to use English. In this study, based upon a sample of 45 students, the TSWE was revalidated with writing samples. The coefficient of 0.54 was most impressive given that the TSWE scores were restricted to those…

Descriptors: Correlation, Error of Measurement, Essay Tests, Higher Education

Simplex Analyses of the Test Data: What Can They Tell Us?

Frayer, Dorothy A. – 1971

A Paradigm for testing concept attainment, comprised of twelve tasks, was formulated. These tasks were hypothesized to form a cumulative hierarchy. Tests were constructed in mathematics and social studies using the paradigm. Data for these tests was analyzed by Kaiser's method for fitting a perfect simplex and Schonemann's method for fitting a…

Descriptors: Concept Formation, Correlation, Data Analysis, Error of Measurement

Test Validity and National Educational Assessment: A Conception, a Method, and an Example.

Download full text

Wiley, David E.; And Others – 1981

This paper brings to first fruition an analytic schema based on four elements which involve a conception of skills independent or particular testing devices: (1) the development and application of a class of statistical models incorporating qualitative definitions of skill, distorted in item response by errors conceived as misclassifications; (2)…

Descriptors: Educational Assessment, Elementary Education, Error of Measurement, Latent Trait Theory

Test Takers' Experiences with Computer-Administered Listening Comprehension Tests: Interviewing for Qualitative Explorations of Test Validity

Peer reviewed

Direct link

Gorsuch, Greta – CALICO Journal, 2004

In this study, retrospective interviews were used to investigate reliability (and thus validity) threats to a computerized ESL listening comprehension test administered at a university in the US. The participants in the investigation, six international graduate students, were asked to respond to semi- and open-ended questions during individual…

Descriptors: Graduate Students, Listening Comprehension, Investigations, Listening Comprehension Tests

On Meaningful Measurement: Issues of Reliability and Validity from a Humanistic Constructivist Information-Processing Perspective.

Download full text

Cheung, K. C. – 1993

In the past decade, there have been ample interests in the assessment of cognitive and affective processes and products for the purposes of meaningful learning. Meaningful measurement (MM) has been proposed which is in accordance with a humanistic constructivist information-processing perspective. Students' responses to the assessment tasks are…

Descriptors: Affective Behavior, Cognitive Processes, Constructivism (Learning), Educational Assessment

Carl Westine	1
Cheung, K. C.	1
Emrick, John A.	1
Frayer, Dorothy A.	1
Goeun Lee	1
Gorsuch, Greta	1
Jin-young Choi	1
Kelecioglu, Hülya	1
Kogar, Esin Yilmaz	1
Kraner, Chris	1
Michelle Boyer	1
Myoung-jae Lee	1
Onder, Yasemin	1
Reeves, Todd D.	1
Reuman, David A.	1
Stella Y. Kim	1
Suddick, David E.	1
Tong Wu	1
Wiley, David E.	1
More ▼