Publication Date
In 2025 | 2 |
Since 2024 | 7 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 11 |
Descriptor
Error of Measurement | 39 |
Test Reliability | 39 |
Testing Problems | 39 |
Test Validity | 14 |
Test Interpretation | 11 |
Scores | 10 |
Item Analysis | 7 |
Mathematical Models | 7 |
Achievement Tests | 5 |
Evaluation Methods | 5 |
Higher Education | 5 |
More ▼ |
Source
Author
Zimmerman, Donald W. | 2 |
Adrian Adams | 1 |
Altepeter, Tom | 1 |
Barford, Sean W. | 1 |
Barker, Pierce | 1 |
Belcher, Marcia | 1 |
Bell, Robert M. | 1 |
Berger, Dale E. | 1 |
Brendan A. Schuetze | 1 |
Bridgeman, Brent | 1 |
Brown, Jonathan R. | 1 |
More ▼ |
Publication Type
Journal Articles | 23 |
Reports - Research | 23 |
Speeches/Meeting Papers | 6 |
Opinion Papers | 5 |
Reports - Evaluative | 4 |
Reports - Descriptive | 3 |
Collected Works - Serials | 1 |
Information Analyses | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 3 |
Elementary Secondary Education | 2 |
Postsecondary Education | 2 |
Elementary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Secondary Education | 1 |
Audience
Researchers | 3 |
Practitioners | 1 |
Location
Ethiopia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024
This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…
Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures
Esra Sözer Boz – Education and Information Technologies, 2025
International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…
Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students
Danielle R. Blazek; Jason T. Siegel – International Journal of Social Research Methodology, 2024
Social scientists have long agreed that satisficing behavior increases error and reduces the validity of survey data. There have been numerous reviews on detecting satisficing behavior, but preventing this behavior has received less attention. The current narrative review provides empirically supported guidance on preventing satisficing by…
Descriptors: Response Style (Tests), Responses, Reaction Time, Test Interpretation
Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025
Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…
Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences
Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024
The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…
Descriptors: Foreign Countries, College Faculty, College Students, Test Construction
Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024
Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…
Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing
Adrian Adams; Lauren Barth-Cohen – CBE - Life Sciences Education, 2024
In undergraduate research settings, students are likely to encounter anomalous data, that is, data that do not meet their expectations. Most of the research that directly or indirectly captures the role of anomalous data in research settings uses post-hoc reflective interviews or surveys. These data collection approaches focus on recall of past…
Descriptors: Undergraduate Students, Physics, Science Instruction, Laboratory Experiments
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Mrazik, Martin; Janzen, Troy M.; Dombrowski, Stefan C.; Barford, Sean W.; Krawchuk, Lindsey L. – Canadian Journal of School Psychology, 2012
A total of 19 graduate students enrolled in a graduate course conducted 6 consecutive administrations of the Wechsler Intelligence Scale for Children, 4th edition (WISC-IV, Canadian version). Test protocols were examined to obtain data describing the frequency of examiner errors, including administration and scoring errors. Results identified 511…
Descriptors: Intelligence Tests, Intelligence, Statistical Analysis, Scoring
Papay, John P. – American Educational Research Journal, 2011
Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…
Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests

Luftig, Jeffrey T.; Norton, Willis P. – Journal of Studies in Technical Careers, 1981
The purpose of this article is to review applications of reliability formulas and to recommend more appropriate methods of determining the reliability of affective instruments. (SK)
Descriptors: Affective Measures, Error of Measurement, Measurement Techniques, Test Reliability

Yarnold, Paul R. – Educational and Psychological Measurement, 1984
Unreliable profiles impose the difficulty that ordinal and interval relations among the individual's scores become uncertain or unstable. A profile reliability coefficient is derived to estimate the relative expected extent of this ordinal and interval "inversion" for any profile of K measures. (Author/DWH)
Descriptors: Error of Measurement, Mathematical Models, Profiles, Test Reliability

Cureton, Edward E. – Educational and Psychological Measurement, 1971
A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)
Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1980
It is suggested that error of measurement cannot be routinely incorporated into the "error term" in statistical tests, and that the reliability of test scores does not have the simple relationship to statistical inference that one might expect. (Author/GK)
Descriptors: Error of Measurement, Hypothesis Testing, Mathematical Formulas, Test Reliability

Zimmerman, Donald W.; And Others – Journal of Experimental Education, 1981
Reliability coefficients of linear combinations of observed scores have anomalous properties which have led to difficulties in the investigation of difference scores and gain scores in test theory. Discrepancies between classical results and correct results obtained from more general formulas, which allow for correlated errors, are examined…
Descriptors: Error of Measurement, Mathematical Formulas, Mathematical Models, Scores