Publication Date
In 2025 | 3 |
Since 2024 | 7 |
Since 2021 (last 5 years) | 34 |
Since 2016 (last 10 years) | 79 |
Since 2006 (last 20 years) | 336 |
Descriptor
Error of Measurement | 702 |
Estimation (Mathematics) | 124 |
Scores | 115 |
Statistical Analysis | 115 |
Comparative Analysis | 95 |
Item Response Theory | 95 |
Correlation | 94 |
Evaluation Methods | 89 |
Computation | 87 |
Research Methodology | 87 |
Reliability | 84 |
More ▼ |
Source
Author
Kolen, Michael J. | 8 |
Thompson, Bruce | 8 |
Brennan, Robert L. | 7 |
Raykov, Tenko | 6 |
Zwick, Rebecca | 6 |
Hanson, Bradley A. | 5 |
Loeb, Susanna | 5 |
Marcoulides, George A. | 5 |
van der Linden, Wim J. | 5 |
Algina, James | 4 |
Alonzo, Julie | 4 |
More ▼ |
Publication Type
Education Level
Location
United States | 11 |
Australia | 9 |
Germany | 7 |
United Kingdom (England) | 7 |
California | 6 |
New York | 5 |
North Carolina | 5 |
Michigan | 4 |
Texas | 4 |
Canada | 3 |
Illinois | 3 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 5 |
Race to the Top | 3 |
Aid to Families with… | 1 |
Elementary and Secondary… | 1 |
Job Training Partnership Act… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Guo, Jinxin; Xu, Xin; Xin, Tao – Journal of Educational Measurement, 2023
Missingness due to not-reached items and omitted items has received much attention in the recent psychometric literature. Such missingness, if not handled properly, would lead to biased parameter estimation, as well as inaccurate inference of examinees, and further erode the validity of the test. This paper reviews some commonly used IRT based…
Descriptors: Psychometrics, Bias, Error of Measurement, Test Validity
Julian Schuessler; Peter Selb – Sociological Methods & Research, 2025
Directed acyclic graphs (DAGs) are now a popular tool to inform causal inferences. We discuss how DAGs can also be used to encode theoretical assumptions about nonprobability samples and survey nonresponse and to determine whether population quantities including conditional distributions and regressions can be identified. We describe sources of…
Descriptors: Data Collection, Graphs, Error of Measurement, Statistical Bias
Jeroen D. Mulder; Kim Luijken; Bas B. L. Penning de Vries; Ellen L. Hamaker – Structural Equation Modeling: A Multidisciplinary Journal, 2024
The use of structural equation models for causal inference from panel data is critiqued in the causal inference literature for unnecessarily relying on a large number of parametric assumptions, and alternative methods originating from the potential outcomes framework have been recommended, such as inverse probability weighting (IPW) estimation of…
Descriptors: Structural Equation Models, Time on Task, Time Management, Causal Models
van Aert, Robbie C. M. – Research Synthesis Methods, 2023
The partial correlation coefficient (PCC) is used to quantify the linear relationship between two variables while taking into account/controlling for other variables. Researchers frequently synthesize PCCs in a meta-analysis, but two of the assumptions of the common equal-effect and random-effects meta-analysis model are by definition violated.…
Descriptors: Correlation, Meta Analysis, Sampling, Simulation
Xin Qiao; Akihito Kamata; Cornelis Potgieter – Grantee Submission, 2023
Oral reading fluency (ORF) assessments are commonly used to screen at-risk readers and to evaluate the effectiveness of interventions as curriculum-based measurements. As with other assessments, equating ORF scores becomes necessary when we want to compare ORF scores from different test forms. Recently, Kara et al. (2023) proposed a model-based…
Descriptors: Error of Measurement, Oral Reading, Reading Fluency, Equated Scores
Moretti, Angelo; Whitworth, Adam – Sociological Methods & Research, 2023
Spatial microsimulation encompasses a range of alternative methodological approaches for the small area estimation (SAE) of target population parameters from sample survey data down to target small areas in contexts where such data are desired but not otherwise available. Although widely used, an enduring limitation of spatial microsimulation SAE…
Descriptors: Simulation, Geometric Concepts, Computation, Measurement
Raggi, Martina; Stanghellini, Elena; Doretti, Marco – Sociological Methods & Research, 2023
The decomposition of the overall effect of a treatment into direct and indirect effects is here investigated with reference to a recursive system of binary random variables. We show how, for the single mediator context, the marginal effect measured on the log odds scale can be written as the sum of the indirect and direct effects plus a residual…
Descriptors: Path Analysis, Student Attitudes, Museums, Error of Measurement
Carpentras, Dino; Quayle, Michael – International Journal of Social Research Methodology, 2023
Agent-based models (ABMs) often rely on psychometric constructs such as 'opinions', 'stubbornness', 'happiness', etc. The measurement process for these constructs is quite different from the one used in physics as there is no standardized unit of measurement for opinion or happiness. Consequently, measurements are usually affected by 'psychometric…
Descriptors: Psychometrics, Error of Measurement, Models, Prediction
Tenko Raykov – Educational and Psychological Measurement, 2024
This note is concerned with the benefits that can result from the use of the maximal reliability and optimal linear combination concepts in educational and psychological research. Within the widely used framework of unidimensional multi-component measuring instruments, it is demonstrated that the linear combination of their components that…
Descriptors: Educational Research, Behavioral Science Research, Reliability, Error of Measurement
John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024
Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…
Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics
Kulinskaya, Elena; Hoaglin, David C. – Research Synthesis Methods, 2023
For estimation of heterogeneity variance T[superscript 2] in meta-analysis of log-odds-ratio, we derive new mean- and median-unbiased point estimators and new interval estimators based on a generalized Q statistic, Q[subscript F], in which the weights depend on only the studies' effective sample sizes. We compare them with familiar estimators…
Descriptors: Q Methodology, Statistical Analysis, Meta Analysis, Intervals
Do the Numbers Add Up? Questioning Measurement That Places Australian ECEC Teaching as 'Low Quality'
Thorpe, Karen; Houen, Sandy; Rankin, Peter; Pattinson, Cassandra; Staton, Sally – Australian Educational Researcher, 2023
Internationally, standard observational measures of Early Childhood Education and Care (ECEC) are used to assess the quality of provision. They are applied as research tools but, significantly, also guide policy decisions, distribution of resources and public opinion. Considerable faith is placed in such measures, yet their validity, reliability…
Descriptors: Foreign Countries, Educational Quality, Classroom Environment, Measures (Individuals)
Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023
This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…
Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores
Anders Holm; Anders Hjorth-Trolle; Robert Andersen – Sociological Methods & Research, 2025
Lagged dependent variables (LDVs) are often used as predictors in ordinary least squares (OLS) models in the social sciences. Although several estimators are commonly employed, little is known about their relative merits in the presence of classical measurement error and different longitudinal processes. We assess the performance of four commonly…
Descriptors: Elementary Education, Scores, Error of Measurement, Predictor Variables
Ole J. Kemi – Advances in Physiology Education, 2025
Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…
Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards