Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 7 |
Descriptor
Models | 9 |
Scoring | 9 |
Scaling | 7 |
Goodness of Fit | 3 |
Item Response Theory | 3 |
Scores | 3 |
Test Items | 3 |
Academic Achievement | 2 |
Multidimensional Scaling | 2 |
Performance Based Assessment | 2 |
Psychometrics | 2 |
More ▼ |
Source
ProQuest LLC | 2 |
Applied Measurement in… | 1 |
Australian Journal of… | 1 |
Educational Assessment | 1 |
Educational and Psychological… | 1 |
Foreign Language Annals | 1 |
National Center for Research… | 1 |
Society for Research on… | 1 |
Author
Baker, Eva L. | 1 |
Batchelder, William H. | 1 |
Bonner, Sarah M. | 1 |
Cai, Li | 1 |
Cooksey, Ray W. | 1 |
Ercikan, Kadriye | 1 |
Everson, Howard T. | 1 |
France, Stephen L. | 1 |
Koepfler, James R. | 1 |
Luecht, Richard M. | 1 |
Niemi, David | 1 |
More ▼ |
Publication Type
Journal Articles | 5 |
Reports - Evaluative | 3 |
Dissertations/Theses -… | 2 |
Reports - Research | 2 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Secondary Education | 2 |
Secondary Education | 2 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
High Schools | 1 |
More ▼ |
Audience
Location
Australia | 1 |
California | 1 |
New York | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015
Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…
Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory
Shin, Hyo Jeong – ProQuest LLC, 2015
This dissertation is comprised of three papers that propose and apply psychometric models to deal with complexities and challenges in large-scale assessments, focusing on modeling rater effects and complex learning progressions. In particular, three papers investigate extensions and applications of multilevel and multidimensional item response…
Descriptors: Item Response Theory, Psychometrics, Models, Measurement
Cai, Li – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2013
Lord and Wingersky's (1984) recursive algorithm for creating summed score based likelihoods and posteriors has a proven track record in unidimensional item response theory (IRT) applications. Extending the recursive algorithm to handle multidimensionality is relatively simple, especially with fixed quadrature because the recursions can be defined…
Descriptors: Mathematics, Scores, Item Response Theory, Computation
Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016
Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…
Descriptors: Evaluation Methods, Test Construction, Design, Scaling
Thomas, Ally S.; Bonner, Sarah M.; Everson, Howard T. – Society for Research on Educational Effectiveness, 2014
Recently, the authors have been exploring the use of propensity score methods for developing evidence of program impact. Specifically, they have been developing evidence (after one year of implementation) of the effects of the Math Science Partnership in New York City ("MSPinNYC2") on high school students' achievement--both in terms of…
Descriptors: Program Evaluation, Probability, Scores, Scoring
Koepfler, James R. – ProQuest LLC, 2012
Over the past decade, educational policy trends have shifted to a focus on examining students' growth from kindergarten through twelfth grade (K-12). One way states can track students' growth is with a vertical scale. Presently, every state that uses a vertical scale bases the scale on a unidimensional IRT model. These models make a…
Descriptors: Item Response Theory, Models, Scaling, Elementary Secondary Education
Niemi, David; Baker, Eva L.; Sylvester, Roxanne M. – Educational Assessment, 2007
To provide an accurate reading of students' and schools' rates of progress, and to provide cues for instruction, assessment at every level should be connected to explicit learning goals and standards. To show how this requirement can be fulfilled, and how research-based assessment can effectively support learning and instruction, this article…
Descriptors: Student Evaluation, Performance Based Assessment, Scaling, Scoring
Luecht, Richard M. – Foreign Language Annals, 2003
This article contends that the necessary links between constructs and test scores/decisions in language assessment must be established through principled design procedures that align three models: (1) a theoretical construct model; (2) a test development model; and (3) a psychometric scoring model. The theoretical construct model articulates the…
Descriptors: Scoring, Psychometrics, Language Proficiency, Language Tests

Cooksey, Ray W. – Australian Journal of Education, 1993
An Australian study investigated multidimensionality in college entrance examination scores. Data from the American College Testing Program (ACT) and Australian Tertiary Entrance scores (which combines year 12 course grades and scores on the Australian Scholastic Aptitude Test) were analyzed using a 4-dimensional model. Results suggest a single…
Descriptors: College Entrance Examinations, Course Selection (Students), Foreign Countries, Higher Education