ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	19

Publication Type

Reports - Evaluative	34
Journal Articles	23
Speeches/Meeting Papers	7
Numerical/Quantitative Data	2
Book/Product Reviews	1
Opinion Papers	1

Education Level

Grade 4	2
Elementary Education	1
Elementary Secondary Education	1
Grade 5	1
Grade 7	1
Grade 8	1
Higher Education	1
Secondary Education	1

Audience

Location

Australia	1
California	1
China	1
Florida	1
Germany	1
Japan	1
North Carolina	1
Sweden	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

College Board Achievement…	1
Comprehensive Tests of Basic…	1
Progress in International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 34 results Save | Export

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

The Role of Distributional Overlap on the Precision Gain of Bounds for Generalization

Peer reviewed

Direct link

Chan, Wendy – American Journal of Evaluation, 2022

Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…

Descriptors: Probability, Scores, Scoring, Generalization

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Rejoinder: Response To--"An Examination of Plausible Score Correlation from the Trend in Mathematics and Science Study"

Peer reviewed
PDF on ERIC

Download full text

Wang, Jianjun; Ma, Xin – Athens Journal of Education, 2019

This rejoinder keeps the original focus on statistical computing pertaining to the correlation of student achievement between mathematics and science from the Trend in Mathematics and Science Study (TIMSS). Albeit the availability of student performance data in TIMSS and the emphasis of the inter-subject connection in the Next Generation Science…

Descriptors: Scores, Correlation, Achievement Tests, Elementary Secondary Education

Reading Achievement of U.S. Fourth-Grade Students in an International Context: First Look at the Progress in International Reading Literacy Study (PIRLS) 2016 and ePIRLS 2016. NCES 2018-017

Peer reviewed
PDF on ERIC

Download full text

Warner-Griffin, Catharine; Liu, Huili; Tadler, Chrystine; Herget, Debbie; Dalton, Ben – National Center for Education Statistics, 2017

The Progress in International Reading Literacy Study (PIRLS) is an international assessment of student performance in reading literacy at the fourth grade. PIRLS measures students in the fourth year of formal schooling because this is typically when students' learning transitions from a focus on "learning to read" to a focus on…

Descriptors: Foreign Countries, Achievement Tests, Grade 4, International Assessment

A Comparison of Reliability Measures for Continuous and Discontinuous Recording Methods: Inflated Agreement Scores with Partial Interval Recording and Momentary Time Sampling for Duration Events

Peer reviewed

Direct link

Rapp, John T.; Carroll, Regina A.; Stangeland, Lindsay; Swanson, Greg; Higgins, William J. – Behavior Modification, 2011

The authors evaluated the extent to which interobserver agreement (IOA) scores, using the block-by-block method for events scored with continuous duration recording (CDR), were higher when the data from the same sessions were converted to discontinuous methods. Sessions with IOA scores of 89% or less with CDR were rescored using 10-s partial…

Descriptors: Intervals, Sampling, Comparative Analysis, Measures (Individuals)

Validating the Interpretations and Uses of Test Scores

Peer reviewed

Direct link

Kane, Michael T. – Journal of Educational Measurement, 2013

To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

Descriptors: Test Interpretation, Validity, Scores, Test Use

A Nationwide Random Sampling Survey of Potential Complicated Grief in Japan

Peer reviewed

Direct link

Mizuno, Yasunao; Kishimoto, Junji; Asukai, Nozomu – Death Studies, 2012

To investigate the prevalence of significant loss, potential complicated grief (CG), and its contributing factors, we conducted a nationwide random sampling survey of Japanese adults aged 18 or older (N = 1,343) using a self-rating Japanese-language version of the Complicated Grief Brief Screen. Among them, 37.0% experienced their most significant…

Descriptors: Grief, Well Being, Foreign Countries, Sampling

Romans 12 Motivational Gifts and College Professors: Implications for Job Satisfaction and Person-Job Fit

Peer reviewed

Direct link

Tomlinson, Jon C.; Winston, Bruce E. – Christian Higher Education, 2011

This study builds on earlier work by DellaVecchio and Winston (2004) and McPherson (2008). They addressed the seven motivational gifts Paul wrote about in Romans 12:3-8 as a means for addressing job satisfaction and person-job fit among college professors. Using a snowball sampling method, 89 college professors completed the online survey…

Descriptors: Tenure, Job Satisfaction, Multivariate Analysis, Biblical Literature

Classroom Climate and Contextual Effects: Conceptual and Methodological Issues in the Evaluation of Group-Level Effects

Peer reviewed

Direct link

Marsh, Herbert W.; Ludtke, Oliver; Nagengast, Benjamin; Trautwein, Ulrich; Morin, Alexandre J. S.; Abduljabbar, Adel S.; Koller, Olaf – Educational Psychologist, 2012

Classroom context and climate are inherently classroom-level (L2) constructs, but applied researchers sometimes--inappropriately--represent them by student-level (L1) responses in single-level models rather than more appropriate multilevel models. Here we focus on important conceptual issues (distinctions between climate and contextual variables;…

Descriptors: Foreign Countries, Classroom Environment, Educational Research, Research Design

A Comparison of Two Approaches to Correction of Restriction of Range in Correlation Analysis

Peer reviewed

Direct link

Wiberg, Marie; Sundstrom, Anna – Practical Assessment, Research & Evaluation, 2009

A common problem in predictive validity studies in the educational and psychological fields, e.g. in educational and employment selection, is restriction in range of the predictor variables. There are several methods for correcting correlations for restriction of range. The aim of this paper was to examine the usefulness of two approaches to…

Descriptors: Predictive Validity, Predictor Variables, Correlation, Mathematics

A Matter of Significance: Can Sampling Error Invalidate Cloze Estimates of Text Readability?

Peer reviewed

Direct link

O'Toole, John Mitchell; King, Robert A. R. – Language Assessment Quarterly, 2010

This quantitative study intends to better understand the impact of the location of the first deleted word upon the estimation of text difficulty yielded by successive cloze tests based on random deletion from a single passage. The variation in sampling of language features across five cloze tests based on the same passage is random and thus not…

Descriptors: Cloze Procedure, Readability, Nouns, Figurative Language

Commingled Samples: A Neglected Source of Bias in Reliability Analysis

Peer reviewed

Direct link

Waller, Niels G. – Applied Psychological Measurement, 2008

Reliability is a property of test scores from individuals who have been sampled from a well-defined population. Reliability indices, such as coefficient and related formulas for internal consistency reliability (KR-20, Hoyt's reliability), yield lower bound reliability estimates when (a) subjects have been sampled from a single population and when…

Descriptors: Test Items, Reliability, Scores, Psychometrics

Gender Ratios for Reading Disability: Are There Really More Boys than Girls Who Are Low-Progress Readers?

Peer reviewed

Direct link

Limbrick, Lisa; Wheldall, Kevin; Madelaine, Alison – Australian Journal of Learning Difficulties, 2008

Extensive research over the past decade has indicated that there are more boys than girls who are struggling readers, but the degree to which there are more boys remains a point of contention. The focus of this article is to review the various definitions of reading disability, to examine how these different definitions translate into different…

Descriptors: Reading Difficulties, Gender Differences, Definitions, Low Achievement

Correction for Attenuation with Biased Reliability Estimates and Correlated Errors in Populations and Samples

Peer reviewed

Direct link

Zimmerman, Donald W. – Educational and Psychological Measurement, 2007

Properties of the Spearman correction for attenuation were investigated using Monte Carlo methods, under conditions where correlations between error scores exist as a population parameter and also where correlated errors arise by chance in random sampling. Equations allowing for all possible dependence among true and error scores on two tests at…

Descriptors: Monte Carlo Methods, Correlation, Sampling, Data Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	4
Journal of Educational and…	3
American Journal of Evaluation	1
Applied Measurement in…	1
Applied Psychological…	1
Athens Journal of Education	1
Australian Journal of…	1
Behavior Modification	1
Brookings Papers on Education…	1
Child Psychiatry and Human…	1
Christian Higher Education	1
Death Studies	1
Education Policy Analysis…	1
Educational Psychologist	1
International Education…	1
Journal of Educational…	1
Language Assessment Quarterly	1
National Center for Education…	1
Practical Assessment,…	1
More ▼

Scores	34
Sampling	33
Error of Measurement	9
Foreign Countries	8
Statistical Analysis	8
Reliability	7
Comparative Analysis	6
Academic Achievement	5
Correlation	5
Research Design	5
Achievement Tests	4
Elementary School Students	4
Elementary Secondary Education	4
Gender Differences	4
Generalizability Theory	4
Interrater Reliability	4
Measures (Individuals)	4
Sample Size	4
Science Tests	4
Scoring	4
Simulation	4
Data Analysis	3
Educational Assessment	3
Evaluation Methods	3
Experiments	3
More ▼

Zimmerman, Donald W.	2
Abduljabbar, Adel S.	1
Asukai, Nozomu	1
Blais, Jean-Guy	1
Brennan, Robert L.	1
Carol Eckerly	1
Carroll, Regina A.	1
Chan, Wendy	1
Chen, Michael	1
Cook, Linda L.	1
Dalton, Ben	1
Fan, Xitao	1
Harmon, Michelle G.	1
Herget, Debbie	1
Higgins, William J.	1
John R. Donoghue	1
Kane, Michael T.	1
Kane, Thomas J.	1
King, Robert A. R.	1
Kino, Mary M.	1
Kishimoto, Junji	1
Kogan, Lori R.	1
Koller, Olaf	1
Kupermintz, Haggai	1
More ▼