Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 22 |
Descriptor
Evaluation Research | 30 |
Measurement Techniques | 30 |
Models | 30 |
Evaluation Methods | 21 |
Item Response Theory | 8 |
Psychometrics | 7 |
Comparative Analysis | 6 |
Evaluation Problems | 6 |
Measurement | 6 |
Measurement Objectives | 6 |
Educational Research | 5 |
More ▼ |
Source
Author
Raykov, Tenko | 2 |
Adams, Stephen T. | 1 |
Aigrain, Philippe | 1 |
Allen, Mark | 1 |
Anthony, James C. | 1 |
Bank, Jurgen | 1 |
Barrero, F. | 1 |
Blane, D. | 1 |
Bos, Wilfried | 1 |
Boyd, Don | 1 |
Braysher, Ben | 1 |
More ▼ |
Publication Type
Journal Articles | 25 |
Reports - Descriptive | 9 |
Reports - Evaluative | 9 |
Reports - Research | 9 |
Opinion Papers | 5 |
Information Analyses | 3 |
Books | 1 |
Education Level
Elementary Secondary Education | 8 |
Adult Education | 4 |
Higher Education | 4 |
Postsecondary Education | 3 |
High Schools | 1 |
Audience
Location
United Kingdom (England) | 2 |
California | 1 |
Indiana | 1 |
Michigan | 1 |
Netherlands | 1 |
New York | 1 |
United Kingdom (Wales) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
British Household Panel Survey | 1 |
What Works Clearinghouse Rating
Tenko Raykov; Lisa Calvocoressi; Randall E. Schumacker – Measurement: Interdisciplinary Research and Perspectives, 2024
This paper is concerned with the process of selecting between the increasingly popular bi-factor model and the second-order factor model in measurement research. It is indicated that in certain settings widely used in empirical studies, the second-order model is nested in the bi-factor model and obtained from the latter after imposing appropriate…
Descriptors: Factor Analysis, Decision Making, Computer Software, Measurement Techniques
Raykov, Tenko; Anthony, James C.; Menold, Natalja – Educational and Psychological Measurement, 2023
The population relationship between coefficient alpha and scale reliability is studied in the widely used setting of unidimensional multicomponent measuring instruments. It is demonstrated that for any set of component loadings on the common factor, regardless of the extent of their inequality, the discrepancy between alpha and reliability can be…
Descriptors: Correlation, Evaluation Research, Reliability, Measurement Techniques
Wandersman, Abraham – American Journal of Evaluation, 2014
The Labin et al. logic model describes the why, how, what, and potential outcomes of evaluation capacity building (ECB). Getting To Outcomes offers a frame and empirical results for operationalizing the ECB logic model of Labin et al. and for deepening the science and practice of ECB.
Descriptors: Evaluation, Capacity Building, Methods, Accountability
Guarino, Cassandra M. – Education Policy Center at Michigan State University, 2013
The push for accountability in public schooling has extended to the measurement of teacher performance, accelerated by federal efforts through Race to the Top. Currently, a large number of states and districts across the country are computing measures of teacher performance based on the standardized test scores of their students and using them in…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Models, Program Descriptions
Raykov, Tenko; Patelis, Thanos; Marcoulides, George A. – Educational and Psychological Measurement, 2011
A latent variable modeling approach that can be used to examine whether several psychometric tests are parallel is discussed. The method consists of sequentially testing the properties of parallel measures via a corresponding relaxation of parameter constraints in a saturated model or an appropriately constructed latent variable model. The…
Descriptors: Models, Psychometrics, Evaluation Methods, Evaluation Research
Heene, Moritz – Measurement: Interdisciplinary Research and Perspectives, 2011
Humphry (this issue) deserves credit for drawing attention to the long-neglected fact that differences in item discrimination parameters are often due to empirical factors and not the product of random error components. In doing so, Humphry offers a psychometrically elegant, coherent, and practically important new model that is more flexible while…
Descriptors: Measurement, Item Response Theory, Data, Psychometrics
Braysher, Ben – National Centre for Vocational Education Research (NCVER), 2012
The annual Student Outcomes Survey collects information on the outcomes of two groups of students--those that have completed a qualification (graduates) and those that have completed only part of a course and then left the vocational education and training (VET) system (module completers). At the time of selecting the survey sample, insufficient…
Descriptors: Qualifications, Eligibility, Vocational Education, Graduates
Kyngdon, Andrew – Measurement: Interdisciplinary Research and Perspectives, 2011
Behavioral scientists have struggled with units of measurement for as long as they have struggled with measurement itself. Psychology's sole attempt at an explicit unit of measurement--the Lexile Framework for Reading (Stenner, Burdick, Sanford, & Burdick, 2006)--has been and continues to be ignored by the psychometric "cognoscenti."…
Descriptors: Measurement Techniques, Psychometrics, Behavioral Sciences, Scientists
Wang, Wen-Chung; Shih, Ching-Lin; Yang, Chih-Chien – Educational and Psychological Measurement, 2009
This study implements a scale purification procedure onto the standard MIMIC method for differential item functioning (DIF) detection and assesses its performance through a series of simulations. It is found that the MIMIC method with scale purification (denoted as M-SP) outperforms the standard MIMIC method (denoted as M-ST) in controlling…
Descriptors: Test Items, Measures (Individuals), Test Bias, Evaluation Research
Advantages of the Rasch Measurement Model in Analysing Educational Tests: An Applicator's Reflection
Tormakangas, Kari – Educational Research and Evaluation, 2011
Educational achievement is a very important issue for parents, teachers, and the government. An accurate measurement plays a very important role in evaluating achievement fairly, and, therefore, analysis methods have been developed considerably in recent years. Education based on long-time learning processes forms a fruitful base for item tests,…
Descriptors: Test Items, Item Analysis, Learning Processes, Item Response Theory
Iverson, Geoffrey J.; Wagenmakers, Eric-Jan; Lee, Michael D. – Psychological Methods, 2010
The purpose of the recently proposed "p[subscript rep]" statistic is to estimate the probability of concurrence, that is, the probability that a replicate experiment yields an effect of the same sign (Killeen, 2005a). The influential journal "Psychological Science" endorses "p[subscript rep]" and recommends its use…
Descriptors: Effect Size, Evaluation Methods, Probability, Experiments
McKenzie, Robert G. – Learning Disability Quarterly, 2009
The assessment procedures within Response to Intervention (RTI) models have begun to supplant the use of traditional, discrepancy-based frameworks for identifying students with specific learning disabilities (SLD). Many RTI proponents applaud this shift because of perceived shortcomings in utilizing discrepancy as an indicator of SLD. However,…
Descriptors: Intervention, Learning Disabilities, Error of Measurement, Psychometrics
Schulz, Wolfram; Fraillon, Julian – Educational Research and Evaluation, 2011
When comparing data derived from tests or questionnaires in cross-national studies, researchers commonly assume measurement invariance in their underlying scaling models. However, different cultural contexts, languages, and curricula can have powerful effects on how students respond in different countries. This article illustrates how the…
Descriptors: Citizenship Education, International Studies, Item Response Theory, International Education
Wendt, Heike; Bos, Wilfried; Goy, Martin – Educational Research and Evaluation, 2011
Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…
Descriptors: Measures (Individuals), Test Theory, Group Testing, Educational Testing
Boyd, Don; Grossman, Pam; Lankford, Hamp; Loeb, Susanna; Wyckoff, Jim – National Center for Analysis of Longitudinal Data in Education Research, 2008
The use of value-added models in education research has expanded rapidly. These models allow researchers to explore how a wide variety of policies and measured school inputs affect the academic performance of students. An important question is whether such effects are sufficiently large to achieve various policy goals. Judging whether a change in…
Descriptors: Academic Achievement, Measures (Individuals), Measurement, Error of Measurement
Previous Page | Next Page ยป
Pages: 1 | 2