NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Location
New York1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 16 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Gerhard Tutz; Pascal Jordan – Journal of Educational and Behavioral Statistics, 2024
A general framework of latent trait item response models for continuous responses is given. In contrast to classical test theory (CTT) models, which traditionally distinguish between true scores and error scores, the responses are clearly linked to latent traits. It is shown that CTT models can be derived as special cases, but the model class is…
Descriptors: Item Response Theory, Responses, Scores, Models
Joshua B. Gilbert – Annenberg Institute for School Reform at Brown University, 2022
This simulation study examines the characteristics of the Explanatory Item Response Model (EIRM) when estimating treatment effects when compared to classical test theory (CTT) sum and mean scores and item response theory (IRT)-based theta scores. Results show that the EIRM and IRT theta scores provide generally equivalent bias and false positive…
Descriptors: Item Response Theory, Models, Test Theory, Computation
Engelhard, George, Jr.; Wind, Stefanie A. – College Board, 2013
The major purpose of this study is to examine the quality of ratings assigned to CR (constructed-response) questions in large-scale assessments from the perspective of Rasch Measurement Theory. Rasch Measurement Theory provides a framework for the examination of rating scale category structure that can yield useful information for interpreting the…
Descriptors: Measurement Techniques, Rating Scales, Test Theory, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Multivariate Behavioral Research, 2010
Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…
Descriptors: Educational Testing, Scores, Reports, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Wicherts, Jelte M.; Scholten, Annemarie Zand – Intelligence, 2010
The validity of cognitive ability tests is often interpreted solely as a function of the cognitive abilities that these tests are supposed to measure, but other factors may be at play. The effects of test anxiety on the criterion related validity (CRV) of tests was the topic of a recent study by Reeve, Heggestad, and Lievens (2009) (Reeve, C. L.,…
Descriptors: Familiarity, Test Validity, Cognitive Tests, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011
The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…
Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation
Peer reviewed Peer reviewed
Direct linkDirect link
Buchmann, Claudia; Condron, Dennis J.; Roscigno, Vincent J. – Social Forces, 2010
The authors welcome and appreciate the comments of Eric Grodsky and Sigal Alon on their article "Shadow Education, American Style: Test Preparation, the SAT and College Enrollment." In their comments, Grodsky takes issue with several important theoretical and methodological aspects of their article and Alon highlights key processes…
Descriptors: Race, Educational Mobility, Test Preparation, College Entrance Examinations
Peer reviewed Peer reviewed
Direct linkDirect link
Grodsky, Eric – Social Forces, 2010
Buchmann, Condron and Roscigno argue in their article, "Shadow Education, American Style: Test Preparation, the SAT and College Enrollment," that the activities in which students engage to prepare for college entrance exams are forms of shadow education, a means by which more advantaged parents seek to pass their privileged status along…
Descriptors: Enrollment, Criticism, Research Problems, Test Preparation
Woolley, Kristin K. – 1996
The theory of score validity has undergone several revisions within the measurement community. The current consensus among professionals is a rejection of the trinitarian doctrine (J. P. Guion, 1980) of score validity and the recognition of a unified view that includes social consequences of test interpretation and use. While some aspects of the…
Descriptors: Models, Scores, Standards, Test Interpretation
Peer reviewed Peer reviewed
Zimmerman, Donald W.; Zumbo, Bruno D. – International Journal of Testing, 2001
Presents a model of tests and measurement that identifies test scores with Hilbert space vectors and true and error components of scores with linear operators. This geometric point of view brings to light relations among elementary concepts in test theory, including reliability, validity, and parallel tests. (Author/SLD)
Descriptors: Models, Probability, Reliability, Scores
Peer reviewed Peer reviewed
Embretson, Susan E. – Applied Psychological Measurement, 1996
Conditions under which interaction effects estimated from classical total scores, rather than item response theory trait scores, can be misleading are discussed with reference to analysis of variance (ANOVA). When no interaction effects exist on the true latent variable, spurious interaction effects can be observed from the total score scale. (SLD)
Descriptors: Analysis of Variance, Interaction, Item Response Theory, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Graham, James M. – Educational and Psychological Measurement, 2006
Coefficient alpha, the most commonly used estimate of internal consistency, is often considered a lower bound estimate of reliability, though the extent of its underestimation is not typically known. Many researchers are unaware that coefficient alpha is based on the essentially tau-equivalent measurement model. It is the violation of the…
Descriptors: Models, Test Theory, Reliability, Structural Equation Models
Vos, Hans J. – 1994
Some applications of Bayesian decision theory to intelligent tutoring systems are considered. How the problem of adapting the appropriate amount of instruction to the changing nature of a student's capabilities during the learning process can be situated in the general framework of Bayesian decision theory is discussed in the context of the…
Descriptors: Bayesian Statistics, Decision Making, Foreign Countries, Intelligent Tutoring Systems
Peer reviewed Peer reviewed
Hamilton, Lawrence C. – Journal of Educational Measurement, 1981
Errors in self-reports of three academic performance measures are analyzed. Empirical errors are shown to depart radically from both no-error and random-error assumptions. Self-reports by females depart farther from the no-error and random-error models for all three performance measures. (Author/BW)
Descriptors: Academic Achievement, Error Patterns, Grade Point Average, Models
Boyd, Donald; Grossman, Pamela; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – National Center for Analysis of Longitudinal Data in Education Research, 2008
Value-added models in education research allow researchers to explore how a wide variety of policies and measured school inputs affect the academic performance of students. Researchers typically quantify the impacts of such interventions in terms of "effect sizes", i.e., the estimated effect of a one standard deviation change in the…
Descriptors: Credentials, Teacher Effectiveness, Models, Teacher Qualifications
Previous Page | Next Page ยป
Pages: 1  |  2