Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 17 |
Since 2016 (last 10 years) | 44 |
Since 2006 (last 20 years) | 101 |
Descriptor
Error of Measurement | 101 |
Probability | 101 |
Statistical Analysis | 36 |
Computation | 28 |
Models | 25 |
Simulation | 22 |
Sample Size | 18 |
Item Response Theory | 17 |
Regression (Statistics) | 16 |
Comparative Analysis | 15 |
Scores | 15 |
More ▼ |
Source
Author
Raykov, Tenko | 3 |
Blackwell, Matthew | 2 |
Honaker, James | 2 |
King, Gary | 2 |
Lee, Won-Chan | 2 |
Leite, Walter L. | 2 |
Marcoulides, George A. | 2 |
Monroe, Scott | 2 |
Phillips, Gary W. | 2 |
Qian, Jiahe | 2 |
Sijtsma, Klaas | 2 |
More ▼ |
Publication Type
Journal Articles | 85 |
Reports - Research | 59 |
Reports - Evaluative | 23 |
Reports - Descriptive | 12 |
Dissertations/Theses -… | 6 |
Guides - Non-Classroom | 1 |
Information Analyses | 1 |
Numerical/Quantitative Data | 1 |
Opinion Papers | 1 |
Education Level
Higher Education | 11 |
Postsecondary Education | 9 |
Secondary Education | 3 |
Elementary Education | 2 |
Junior High Schools | 2 |
Middle Schools | 2 |
Elementary Secondary Education | 1 |
Grade 1 | 1 |
Grade 2 | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
More ▼ |
Audience
Researchers | 3 |
Teachers | 2 |
Location
Germany | 2 |
Ohio | 2 |
United Kingdom | 2 |
Brazil | 1 |
Europe | 1 |
Hong Kong | 1 |
Israel | 1 |
Italy | 1 |
Netherlands | 1 |
Pennsylvania | 1 |
United Kingdom (England) | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Bixi Zhang; Spyros Konstantopoulos – Society for Research on Educational Effectiveness, 2022
Background: Meta-analysis refers to the statistical methods employed to combine results of several empirical studies in a topic of interest (Hedges & Olkin, 1985). Meta-analysis is often included in literature review studies to quantitatively analyze data from a collection of studies (Valentine et al., 2010). The statistical power of a…
Descriptors: Meta Analysis, Probability, Effect Size, Research Methodology
Myoung-jae Lee; Goeun Lee; Jin-young Choi – Sociological Methods & Research, 2025
A linear model is often used to find the effect of a binary treatment D on a noncontinuous outcome Y with covariates X. Particularly, a binary Y gives the popular "linear probability model (LPM)," but the linear model is untenable if X contains a continuous regressor. This raises the question: what kind of treatment effect does the…
Descriptors: Probability, Least Squares Statistics, Regression (Statistics), Causal Models
Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022
The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…
Descriptors: Test Reliability, Scores, Test Items, Correlation
Kulinskaya, Elena; Hoaglin, David C. – Research Synthesis Methods, 2023
For estimation of heterogeneity variance T[superscript 2] in meta-analysis of log-odds-ratio, we derive new mean- and median-unbiased point estimators and new interval estimators based on a generalized Q statistic, Q[subscript F], in which the weights depend on only the studies' effective sample sizes. We compare them with familiar estimators…
Descriptors: Q Methodology, Statistical Analysis, Meta Analysis, Intervals
Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023
This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…
Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores
Raykov, Tenko; Marcoulides, George A.; Pusic, Martin – Measurement: Interdisciplinary Research and Perspectives, 2021
An interval estimation procedure is discussed that can be used to evaluate the probability of a particular response for a binary or binary scored item at a pre-specified point along an underlying latent continuum. The item is assumed to: (a) be part of a unidimensional multi-component measuring instrument that may contain also polytomous items,…
Descriptors: Item Response Theory, Computation, Probability, Test Items
Alexandru Cernat; Vera Toepoel – International Journal of Social Research Methodology, 2024
Most of the social science research is based on the implied assumption that measurement error is the same across key socio-demographic groups and all differences in key statistics of interest are real. Nevertheless, there is evidence that this is not the case. In this paper, the authors tackle this important topic by investigating if data quality…
Descriptors: Error of Measurement, Low Income Groups, Probability, Foreign Countries
Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023
This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…
Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation
Breen, Richard; Bernt Karlson, Kristian; Holm, Anders – Sociological Methods & Research, 2021
The Karlson-Holm-Breen (KHB) method has rapidly become popular as a way of separating the impact of confounding from rescaling when comparing conditional and unconditional parameter estimates in nonlinear probability models such as the logit and probit. In this note, we show that the same estimates can be obtained in a somewhat different way to…
Descriptors: Probability, Models, Computation, Comparative Analysis
Adam Sales; Ethan Prhiar; Thanaporn March Patikorn – Society for Research on Educational Effectiveness, 2021
In a randomized controlled trial (RCT), some subjects assigned to the treatment condition may not fully comply. Often there is interest in the effect of the treatment within the "principal stratum" of subjects who would comply if assigned to treatment. However, it is unknown which control subjects would have complied if treated and which…
Descriptors: Randomized Controlled Trials, Scores, Probability, Statistical Analysis
Sharpe, J. P. – Physics Teacher, 2022
The Poisson distribution describes the probability of a certain number of events occurring in an interval of time when the occurrence of the individual events is independent of one another and the events occur with a fixed mean rate. Probably the best-known example of the Poisson distribution in the physics curriculum is the temporal distribution…
Descriptors: Physics, Science Instruction, Probability, Mathematics Skills
Ellis, Jules L. – Educational and Psychological Measurement, 2021
This study develops a theoretical model for the costs of an exam as a function of its duration. Two kind of costs are distinguished: (1) the costs of measurement errors and (2) the costs of the measurement. Both costs are expressed in time of the student. Based on a classical test theory model, enriched with assumptions on the context, the costs…
Descriptors: Test Length, Models, Error of Measurement, Measurement
Altintas, Ozge; Wallin, Gabriel – International Journal of Assessment Tools in Education, 2021
Educational assessment tests are designed to measure the same psychological constructs over extended periods. This feature is important considering that test results are often used for admittance to university programs. To ensure fair assessments, especially for those whose results weigh heavily in selection decisions, it is necessary to collect…
Descriptors: College Admission, College Entrance Examinations, Test Bias, Equated Scores
Yongyun Shin; Stephen W. Raudenbush – Grantee Submission, 2023
We consider two-level models where a continuous response R and continuous covariates C are assumed missing at random. Inferences based on maximum likelihood or Bayes are routinely made by estimating their joint normal distribution from observed data R[subscript obs] and C[subscript obs]. However, if the model for R given C includes random…
Descriptors: Maximum Likelihood Statistics, Hierarchical Linear Modeling, Error of Measurement, Statistical Distributions
Raykov, Tenko; Marcoulides, George A.; Li, Tenglong – Educational and Psychological Measurement, 2018
This note extends the results in the 2016 article by Raykov, Marcoulides, and Li to the case of correlated errors in a set of observed measures subjected to principal component analysis. It is shown that when at least two measures are fallible, the probability is zero for any principal component--and in particular for the first principal…
Descriptors: Factor Analysis, Error of Measurement, Correlation, Reliability