NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)2
Since 2006 (last 20 years)15
Publication Type
Reports - Evaluative29
Journal Articles20
Speeches/Meeting Papers6
Audience
Practitioners1
Laws, Policies, & Programs
Assessments and Surveys
Armed Services Vocational…1
What Works Clearinghouse Rating
Showing 1 to 15 of 29 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2016
The frequently neglected and often misunderstood relationship between classical test theory and item response theory is discussed for the unidimensional case with binary measures and no guessing. It is pointed out that popular item response models can be directly obtained from classical test theory-based models by accounting for the discrete…
Descriptors: Test Theory, Item Response Theory, Models, Correlation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Möller, Jens; Müller-Kalthoff, Hanno; Helm, Friederike; Nagy, Nicole; Marsh, Herb W. – Frontline Learning Research, 2016
The dimensional comparison theory (DCT) focuses on the effects of internal, dimensional comparisons (e.g., "How good am I in math compared to English?") on academic self-concepts with widespread consequences for students' self-evaluation, motivation, and behavioral choices. DCT is based on the internal/external frame of reference model…
Descriptors: Comparative Analysis, Comparative Testing, Self Concept, Self Concept Measures
Peer reviewed Peer reviewed
Direct linkDirect link
Mislevy, Robert J. – Teachers College Record, 2014
Background/Context: This article explains the idea of a neopragmatic postmodernist test theory and offers some thoughts about what changing notions concerning the nature of and meanings assigned to knowledge imply for educational assessment, present and future. Purpose: Advances in the learning sciences--particularly situative and sociocognitive…
Descriptors: Test Theory, Postmodernism, Educational Assessment, Educational Trends
Peer reviewed Peer reviewed
Direct linkDirect link
Wicherts, Jelte M.; Scholten, Annemarie Zand – Intelligence, 2010
The validity of cognitive ability tests is often interpreted solely as a function of the cognitive abilities that these tests are supposed to measure, but other factors may be at play. The effects of test anxiety on the criterion related validity (CRV) of tests was the topic of a recent study by Reeve, Heggestad, and Lievens (2009) (Reeve, C. L.,…
Descriptors: Familiarity, Test Validity, Cognitive Tests, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
van der Linden, Wim J. – Journal of Educational Measurement, 2009
Two different traditions of response-time (RT) modeling are reviewed: the tradition of distinct models for RTs and responses, and the tradition of model integration in which RTs are incorporated in response models or the other way around. Several conceptual issues underlying both traditions are made explicit and analyzed for their consequences. We…
Descriptors: Test Items, Models, Reaction Time, Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011
The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…
Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation
Poliandri, Donatella; Cardone, Michele; Muzzioli, Paola; Romiti, Sara – Online Submission, 2011
The purpose of this study is to validate a test anxiety scale for Italian students. The scale is part of a questionnaire administered after the students' annual competence test by the National Institute for the Educational Evaluation of Instruction and Training (INVALSI). The aim of the scale is to explore the anxiety levels of Italian students…
Descriptors: Reading Comprehension, Standardized Tests, Rating Scales, Questionnaires
Peer reviewed Peer reviewed
Direct linkDirect link
van der Linden, Wim J. – Applied Psychological Measurement, 2009
An adaptive testing method is presented that controls the speededness of a test using predictions of the test takers' response times on the candidate items in the pool. Two different types of predictions are investigated: posterior predictions given the actual response times on the items already administered and posterior predictions that use the…
Descriptors: Simulation, Adaptive Testing, Vocational Aptitude, Bayesian Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Jiao, Hong – Measurement: Interdisciplinary Research and Perspectives, 2009
Diagnostic assessment is currently an active research area in educational measurement. Literature related to diagnostic modeling has been in existence for several decades, but a great deal of research has been conducted within the last decade or so, especially within the last five years. The author summarizes the key components in the application…
Descriptors: Educational Assessment, Literature Reviews, Test Items, Probability
Peer reviewed Peer reviewed
Direct linkDirect link
Howell, Roy D.; Breivik, Einar; Wilcox, James B. – Psychological Methods, 2007
The relationship between observable responses and the latent constructs they are purported to measure has received considerable attention recently, with particular focus on what has become known as formative measurement. This alternative to reflective measurement in the area of theory-testing research is examined in the context of the potential…
Descriptors: Researchers, Item Response Theory, Formative Evaluation, Test Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Wendt, Heike; Bos, Wilfried; Goy, Martin – Educational Research and Evaluation, 2011
Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…
Descriptors: Measures (Individuals), Test Theory, Group Testing, Educational Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Chen, Yi-Hsin; Gorin, Joanna S.; Thompson, Marilyn S.; Tatsuoka, Kikumi K. – International Journal of Testing, 2008
As with any test administered across linguistically and culturally diverse groups, evidence suggesting the equivalence of score meaning across countries is needed for valid comparisons. The current study examines the cross-cultural equivalence of score interpretations from the Trends in International Mathematics and Science Study (TIMSS)-1999 from…
Descriptors: Construct Validity, Mathematics Tests, Foreign Countries, Equated Scores
Woolley, Kristin K. – 1996
The theory of score validity has undergone several revisions within the measurement community. The current consensus among professionals is a rejection of the trinitarian doctrine (J. P. Guion, 1980) of score validity and the recognition of a unified view that includes social consequences of test interpretation and use. While some aspects of the…
Descriptors: Models, Scores, Standards, Test Interpretation
Mislevy, Robert J.; And Others – 1990
The models of standard test theory, having evolved under a trait-oriented psychology, do not reflect the knowledge structures and the problem-solving strategies now seen as central to understanding performance and learning. In some applications, however, key qualitative distinctions among persons as to structures and strategies can be expressed…
Descriptors: Learning Strategies, Models, Problem Solving, Spatial Ability
Peer reviewed Peer reviewed
Embretson, Susan E. – Applied Psychological Measurement, 1996
Conditions under which interaction effects estimated from classical total scores, rather than item response theory trait scores, can be misleading are discussed with reference to analysis of variance (ANOVA). When no interaction effects exist on the true latent variable, spurious interaction effects can be observed from the total score scale. (SLD)
Descriptors: Analysis of Variance, Interaction, Item Response Theory, Models
Previous Page | Next Page »
Pages: 1  |  2