ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	11

Descriptor

Error of Measurement	32
Generalizability Theory	32
Scores	8
Test Items	8
Academic Achievement	7
Measurement Techniques	7
Test Reliability	7
Interrater Reliability	6
Reliability	6
Educational Assessment	5
Estimation (Mathematics)	5
Performance Based Assessment	5
Scoring	5
Test Theory	5
Cutting Scores	4
Sampling	4
Test Interpretation	4
Analysis of Variance	3
Correlation	3
Equations (Mathematics)	3
Foreign Countries	3
Mathematical Models	3
Multivariate Analysis	3
Research Design	3
Second Language Learning	3
More ▼

Source

Educational and Psychological…	6
Educational Measurement:…	2
International Journal of…	2
Journal of Educational…	2
Journal of Educational…	2
Applied Measurement in…	1
Applied Psychological…	1
Educational Researcher	1
Educational Testing Service	1
Evaluation Review	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Language Testing	1
National Center for Analysis…	1
Psychometrika	1
More ▼

Publication Type

Reports - Evaluative	32
Journal Articles	22
Speeches/Meeting Papers	7
Information Analyses	1

Education Level

Elementary Education	2
Elementary Secondary Education	2
Grade 3	2
Grade 5	2
Grade 1	1
Grade 4	1
Grade 7	1
Higher Education	1

Audience

Researchers

Location

Haiti	1
Iowa	1
New York	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Trends in International…	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

Reliability and Validity of Inferences about Teachers Based on Student Scores. William H. Angoff Memorial Lecture Series

Download full text

Haertel, Edward H. – Educational Testing Service, 2013

Policymakers and school administrators have embraced value-added models of teacher effectiveness as tools for educational improvement. Teacher value-added estimates may be viewed as complicated scores of a certain kind. This suggests using a test validation model to examine their reliability and validity. Validation begins with an interpretive…

Descriptors: Reliability, Validity, Inferences, Teacher Effectiveness

Applying Rasch Model and Generalizability Theory to Study Modified-Angoff Cut Scores

Peer reviewed

Direct link

Arce, Alvaro J.; Wang, Ze – International Journal of Testing, 2012

The traditional approach to scale modified-Angoff cut scores transfers the raw cuts to an existing raw-to-scale score conversion table. Under the traditional approach, cut scores and conversion table raw scores are not only seen as interchangeable but also as originating from a common scaling process. In this article, we propose an alternative…

Descriptors: Generalizability Theory, Item Response Theory, Cutting Scores, Scaling

Correcting Fallacies in Validity, Reliability, and Classification

Peer reviewed

Direct link

Sijtsma, Klaas – International Journal of Testing, 2009

This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…

Descriptors: Construct Validity, Reliability, Classification, Test Theory

Unbiased Estimates of Variance Components with Bootstrap Procedures

Peer reviewed

Direct link

Brennan, Robert L. – Educational and Psychological Measurement, 2007

This article provides general procedures for obtaining unbiased estimates of variance components for any random-model balanced design under any bootstrap sampling plan, with the focus on designs of the type typically used in generalizability theory. The results reported here are particularly helpful when the bootstrap is used to estimate standard…

Descriptors: Generalizability Theory, Error of Measurement, Statistical Analysis

Estimating Standard Errors of Cut Scores for Item Rating and Mapmark Procedures: A Generalizability Theory Approach

Peer reviewed

Direct link

Yin, Ping; Sconing, James – Educational and Psychological Measurement, 2008

Standard-setting methods are widely used to determine cut scores on a test that examinees must meet for a certain performance standard. Because standard setting is a measurement procedure, it is important to evaluate variability of cut scores resulting from the standard-setting process. Generalizability theory is used in this study to estimate…

Descriptors: Generalizability Theory, Standard Setting, Cutting Scores, Test Items

Bootstrap Estimates of Standard Errors in Generalizability Theory

Peer reviewed

Direct link

Tong, Ye; Brennan, Robert L. – Educational and Psychological Measurement, 2007

Estimating standard errors of estimated variance components has long been a challenging task in generalizability theory. Researchers have speculated about the potential applicability of the bootstrap for obtaining such estimates, but they have identified problems (especially bias) in using the bootstrap. Using Brennan's bias-correcting procedures…

Descriptors: Error of Measurement, Generalizability Theory, Computation, Simulation

Who Is Given Tests in What Language by Whom, When, and Where? The Need for Probabilistic Views of Language in the Testing of English Language Learners

Peer reviewed

Direct link

Solano-Flores, Guillermo – Educational Researcher, 2008

The testing of English language learners (ELLs) is, to a large extent, a random process because of poor implementation and factors that are uncertain or beyond control. Yet current testing practices and policies appear to be based on deterministic views of language and linguistic groups and erroneous assumptions about the capacity of assessment…

Descriptors: Generalizability Theory, Testing, Second Language Learning, Error of Measurement

Why Generalizability Theory Is Essential and Classical Test Theory Is Often Inadequate.

Download full text

Kieffer, Kevin M. – 1998

This paper discusses the benefits of using generalizabilty theory in lieu of classical test theory. Generalizability theory subsumes and extends the precepts of classical test theory by estimating the magnitude of multiple sources of measurement error and their interactions simultaneously in a single analysis. Since classical test theory examines…

Descriptors: Error of Measurement, Generalizability Theory, Heuristics, Interaction

The Information in Multiple Ratings

Peer reviewed

Direct link

Bock, R. Darrell; Brennan, Robert L.; Muraki, Eiji – Applied Psychological Measurement, 2002

In assessment programs where scores are reported for individual examinees, it is desirable to have responses to performance exercises graded by more than one rater. If more than one item on each test form is so graded, it is also desirable that different raters grade the responses of any one examinee. This gives rise to sampling designs in which…

Descriptors: Generalizability Theory, Test Items, Item Response Theory, Error of Measurement

A Comparison of Methods of Estimating Conditional Standard Errors of Measurement for Testlet-based Test Scores.

Peer reviewed

Lee, Guemin – Journal of Educational Measurement, 2000

Studied the appropriateness and implications of incorporating a testlet definition into the estimation of procedures of the conditional standard error of measurement (SEM) for tests composed of testlets. Simulation results for several methods show that an item-based method using a generalizability theory model provided good estimates of the…

Descriptors: Comparative Analysis, Error of Measurement, Estimation (Mathematics), Generalizability Theory

The Precision of Measurements.

Peer reviewed

Kane, Michael – Applied Measurement in Education, 1996

This overview of the role of error and tolerance for error in measurement asserts that the generic precision associated with a measurement procedure is defined as the root mean square error, or standard error, in some relevant population. This view of precision is explored in several applications of measurement. (SLD)

Descriptors: Error of Measurement, Error Patterns, Generalizability Theory, Measurement Techniques

A Multivariate Generalizability Analysis of the Multistate Bar Examination

Peer reviewed

Direct link

Yin, Ping – Educational and Psychological Measurement, 2005

The main purpose of this study is to examine the content structure of the Multistate Bar Examination (MBE) using the "table of specifications" model from the perspective of multivariate generalizability theory. Specifically, using MBE data collected over different years (six administrations: three from the February test and three from July test),…

Descriptors: Correlation, Generalizability Theory, Statistical Analysis, Multivariate Analysis

Error Sources Influencing Performance Assessment Reliability or Generalizability: A Meta Analysis.

Download full text

Jiang, Ying Hong; And Others – 1997

As performance-based assessments have gained wider use, there are increasing concerns about their dependability. This study is a synthesis of existing studies regarding the reliability or generalizability of performance assessments. The meta-analysis involves summarizing, examining, and evaluating research findings. Articles on the dependability…

Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Judges

Simultaneous Confidence Intervals for the Linear Functions of Expected Means Squares used in Generalizability Theory.

Peer reviewed

Bell, John F. – Journal of Educational Statistics, 1986

Khuri's and Satterthwaite's methods of obtaining confidence intervals of variance components are compared. The article discusses that Khuri's method may be applied to obtain confidence intervals for the variance components and other linear functions of the expected mean squares used in generalizability theory. (Author/JAZ)

Descriptors: Analysis of Variance, Elementary Education, Equations (Mathematics), Error of Measurement

Alternative Solutions for Optimization Problems in Generalizability Theory.

Peer reviewed

Sanders, Piet F. – Psychometrika, 1992

Presents solutions for the problem of maximizing the generalizability coefficient under a budget constraint. Shows that the Cauchy-Schwarz inequality can be applied to derive optimal continuous solutions for the number of conditions of each facet. Illustrates the formal similarity between optimization problems in survey sampling and…

Descriptors: Budgeting, Cost Effectiveness, Equations (Mathematics), Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2 | 3

Brennan, Robert L.	5
Shavelson, Richard J.	2
Solano-Flores, Guillermo	2
Yin, Ping	2
Arce, Alvaro J.	1
Axtell, Philip K.	1
Bell, John F.	1
Bock, R. Darrell	1
Boyd, Donald	1
Colton, Dean A.	1
Cronbach, Lee J.	1
Crowley, Susan	1
Fawson, Parker C.	1
Grossman, Pamela	1
Haertel, Edward H.	1
Jiang, Ying Hong	1
Johnson, Eugene G.	1
Kane, Michael	1
Kieffer, Kevin M.	1
Lankford, Hamilton	1
Lee, Guemin	1
Li, Min	1
Loeb, Susanna	1
Ludlow, Brian C.	1
More ▼