Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 19 |
Descriptor
Scores | 34 |
Sampling | 33 |
Error of Measurement | 9 |
Foreign Countries | 8 |
Statistical Analysis | 8 |
Reliability | 7 |
Comparative Analysis | 6 |
Academic Achievement | 5 |
Correlation | 5 |
Research Design | 5 |
Achievement Tests | 4 |
More ▼ |
Source
Author
Zimmerman, Donald W. | 2 |
Abduljabbar, Adel S. | 1 |
Asukai, Nozomu | 1 |
Blais, Jean-Guy | 1 |
Brennan, Robert L. | 1 |
Carol Eckerly | 1 |
Carroll, Regina A. | 1 |
Chan, Wendy | 1 |
Chen, Michael | 1 |
Cook, Linda L. | 1 |
Dalton, Ben | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 34 |
Journal Articles | 23 |
Speeches/Meeting Papers | 7 |
Numerical/Quantitative Data | 2 |
Book/Product Reviews | 1 |
Opinion Papers | 1 |
Education Level
Grade 4 | 2 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 5 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
Higher Education | 1 |
Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
College Board Achievement… | 1 |
Comprehensive Tests of Basic… | 1 |
Progress in International… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022
The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…
Descriptors: Equated Scores, Test Items, Scores, Probability
Chan, Wendy – American Journal of Evaluation, 2022
Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…
Descriptors: Probability, Scores, Scoring, Generalization
John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024
Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…
Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics
Wang, Jianjun; Ma, Xin – Athens Journal of Education, 2019
This rejoinder keeps the original focus on statistical computing pertaining to the correlation of student achievement between mathematics and science from the Trend in Mathematics and Science Study (TIMSS). Albeit the availability of student performance data in TIMSS and the emphasis of the inter-subject connection in the Next Generation Science…
Descriptors: Scores, Correlation, Achievement Tests, Elementary Secondary Education
Warner-Griffin, Catharine; Liu, Huili; Tadler, Chrystine; Herget, Debbie; Dalton, Ben – National Center for Education Statistics, 2017
The Progress in International Reading Literacy Study (PIRLS) is an international assessment of student performance in reading literacy at the fourth grade. PIRLS measures students in the fourth year of formal schooling because this is typically when students' learning transitions from a focus on "learning to read" to a focus on…
Descriptors: Foreign Countries, Achievement Tests, Grade 4, International Assessment
Rapp, John T.; Carroll, Regina A.; Stangeland, Lindsay; Swanson, Greg; Higgins, William J. – Behavior Modification, 2011
The authors evaluated the extent to which interobserver agreement (IOA) scores, using the block-by-block method for events scored with continuous duration recording (CDR), were higher when the data from the same sessions were converted to discontinuous methods. Sessions with IOA scores of 89% or less with CDR were rescored using 10-s partial…
Descriptors: Intervals, Sampling, Comparative Analysis, Measures (Individuals)
Kane, Michael T. – Journal of Educational Measurement, 2013
To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Descriptors: Test Interpretation, Validity, Scores, Test Use
Mizuno, Yasunao; Kishimoto, Junji; Asukai, Nozomu – Death Studies, 2012
To investigate the prevalence of significant loss, potential complicated grief (CG), and its contributing factors, we conducted a nationwide random sampling survey of Japanese adults aged 18 or older (N = 1,343) using a self-rating Japanese-language version of the Complicated Grief Brief Screen. Among them, 37.0% experienced their most significant…
Descriptors: Grief, Well Being, Foreign Countries, Sampling
Tomlinson, Jon C.; Winston, Bruce E. – Christian Higher Education, 2011
This study builds on earlier work by DellaVecchio and Winston (2004) and McPherson (2008). They addressed the seven motivational gifts Paul wrote about in Romans 12:3-8 as a means for addressing job satisfaction and person-job fit among college professors. Using a snowball sampling method, 89 college professors completed the online survey…
Descriptors: Tenure, Job Satisfaction, Multivariate Analysis, Biblical Literature
Marsh, Herbert W.; Ludtke, Oliver; Nagengast, Benjamin; Trautwein, Ulrich; Morin, Alexandre J. S.; Abduljabbar, Adel S.; Koller, Olaf – Educational Psychologist, 2012
Classroom context and climate are inherently classroom-level (L2) constructs, but applied researchers sometimes--inappropriately--represent them by student-level (L1) responses in single-level models rather than more appropriate multilevel models. Here we focus on important conceptual issues (distinctions between climate and contextual variables;…
Descriptors: Foreign Countries, Classroom Environment, Educational Research, Research Design
Wiberg, Marie; Sundstrom, Anna – Practical Assessment, Research & Evaluation, 2009
A common problem in predictive validity studies in the educational and psychological fields, e.g. in educational and employment selection, is restriction in range of the predictor variables. There are several methods for correcting correlations for restriction of range. The aim of this paper was to examine the usefulness of two approaches to…
Descriptors: Predictive Validity, Predictor Variables, Correlation, Mathematics
O'Toole, John Mitchell; King, Robert A. R. – Language Assessment Quarterly, 2010
This quantitative study intends to better understand the impact of the location of the first deleted word upon the estimation of text difficulty yielded by successive cloze tests based on random deletion from a single passage. The variation in sampling of language features across five cloze tests based on the same passage is random and thus not…
Descriptors: Cloze Procedure, Readability, Nouns, Figurative Language
Waller, Niels G. – Applied Psychological Measurement, 2008
Reliability is a property of test scores from individuals who have been sampled from a well-defined population. Reliability indices, such as coefficient and related formulas for internal consistency reliability (KR-20, Hoyt's reliability), yield lower bound reliability estimates when (a) subjects have been sampled from a single population and when…
Descriptors: Test Items, Reliability, Scores, Psychometrics
Limbrick, Lisa; Wheldall, Kevin; Madelaine, Alison – Australian Journal of Learning Difficulties, 2008
Extensive research over the past decade has indicated that there are more boys than girls who are struggling readers, but the degree to which there are more boys remains a point of contention. The focus of this article is to review the various definitions of reading disability, to examine how these different definitions translate into different…
Descriptors: Reading Difficulties, Gender Differences, Definitions, Low Achievement
Zimmerman, Donald W. – Educational and Psychological Measurement, 2007
Properties of the Spearman correction for attenuation were investigated using Monte Carlo methods, under conditions where correlations between error scores exist as a population parameter and also where correlated errors arise by chance in random sampling. Equations allowing for all possible dependence among true and error scores on two tests at…
Descriptors: Monte Carlo Methods, Correlation, Sampling, Data Analysis