Publication Date
In 2025 | 2 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 4 |
Descriptor
Source
Journal of Educational… | 2 |
CALICO Journal | 1 |
Educational Assessment,… | 1 |
Journal of Education and… | 1 |
Sociological Methods &… | 1 |
Author
Carl Westine | 1 |
Cheung, K. C. | 1 |
Emrick, John A. | 1 |
Frayer, Dorothy A. | 1 |
Goeun Lee | 1 |
Gorsuch, Greta | 1 |
Jin-young Choi | 1 |
Kelecioglu, Hülya | 1 |
Kogar, Esin Yilmaz | 1 |
Kraner, Chris | 1 |
Michelle Boyer | 1 |
More ▼ |
Publication Type
Reports - Research | 7 |
Journal Articles | 5 |
Speeches/Meeting Papers | 3 |
Reports - Evaluative | 2 |
Education Level
Higher Education | 2 |
Elementary Secondary Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Location
Texas | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of Standard Written… | 1 |
What Works Clearinghouse Rating
Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025
While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…
Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity
Myoung-jae Lee; Goeun Lee; Jin-young Choi – Sociological Methods & Research, 2025
A linear model is often used to find the effect of a binary treatment D on a noncontinuous outcome Y with covariates X. Particularly, a binary Y gives the popular "linear probability model (LPM)," but the linear model is untenable if X contains a continuous regressor. This raises the question: what kind of treatment effect does the…
Descriptors: Probability, Least Squares Statistics, Regression (Statistics), Causal Models
Reeves, Todd D.; Onder, Yasemin; Kraner, Chris – Educational Assessment, Evaluation and Accountability, 2023
As beliefs are well-known antecedents of teachers' practices, including assessment practices, sound measurement of teacher beliefs is critical for scholarly research as well as practical purposes. The present study examined the validity of inferences derived from the Conceptions of Assessment III--Abridged (COA-IIIA) instrument with US PK-12…
Descriptors: Attitude Measures, Teacher Attitudes, Preservice Teachers, Experienced Teachers
Kogar, Esin Yilmaz; Kelecioglu, Hülya – Journal of Education and Learning, 2017
The purpose of this research is to first estimate the item and ability parameters and the standard error values related to those parameters obtained from Unidimensional Item Response Theory (UIRT), bifactor (BIF) and Testlet Response Theory models (TRT) in the tests including testlets, when the number of testlets, number of independent items, and…
Descriptors: Item Response Theory, Models, Mathematics Tests, Test Items
Reuman, David A.; And Others – 1982
According to classical test theory, the presence of random measurement error in a psychological test has important implications for validation studies. The more comprehensive application of classical test theory in construct validation is distinguished from that in criterion-oriented validation. Critics of thematic apperceptive measurement of the…
Descriptors: Academic Achievement, Achievement Need, Adults, Error of Measurement

Emrick, John A. – Journal of Educational Measurement, 1971
Descriptors: Criterion Referenced Tests, Error of Measurement, Evaluation Methods, Item Analysis
Suddick, David E.; And Others – 1985
The Test of Standard Written English (TSWE) is a 50-item multiple choice instrument designed to assess the ability of college students to use English. In this study, based upon a sample of 45 students, the TSWE was revalidated with writing samples. The coefficient of 0.54 was most impressive given that the TSWE scores were restricted to those…
Descriptors: Correlation, Error of Measurement, Essay Tests, Higher Education
Frayer, Dorothy A. – 1971
A Paradigm for testing concept attainment, comprised of twelve tasks, was formulated. These tasks were hypothesized to form a cumulative hierarchy. Tests were constructed in mathematics and social studies using the paradigm. Data for these tests was analyzed by Kaiser's method for fitting a perfect simplex and Schonemann's method for fitting a…
Descriptors: Concept Formation, Correlation, Data Analysis, Error of Measurement
Wiley, David E.; And Others – 1981
This paper brings to first fruition an analytic schema based on four elements which involve a conception of skills independent or particular testing devices: (1) the development and application of a class of statistical models incorporating qualitative definitions of skill, distorted in item response by errors conceived as misclassifications; (2)…
Descriptors: Educational Assessment, Elementary Education, Error of Measurement, Latent Trait Theory
Gorsuch, Greta – CALICO Journal, 2004
In this study, retrospective interviews were used to investigate reliability (and thus validity) threats to a computerized ESL listening comprehension test administered at a university in the US. The participants in the investigation, six international graduate students, were asked to respond to semi- and open-ended questions during individual…
Descriptors: Graduate Students, Listening Comprehension, Investigations, Listening Comprehension Tests
Cheung, K. C. – 1993
In the past decade, there have been ample interests in the assessment of cognitive and affective processes and products for the purposes of meaningful learning. Meaningful measurement (MM) has been proposed which is in accordance with a humanistic constructivist information-processing perspective. Students' responses to the assessment tasks are…
Descriptors: Affective Behavior, Cognitive Processes, Constructivism (Learning), Educational Assessment