ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	7

Descriptor

Models	9
Scoring	9
Scaling	7
Goodness of Fit	3
Item Response Theory	3
Scores	3
Test Items	3
Academic Achievement	2
Multidimensional Scaling	2
Performance Based Assessment	2
Psychometrics	2
Test Construction	2
Algebra	1
Answer Keys	1
Automation	1
Biology	1
Classification	1
College Entrance Examinations	1
College Readiness	1
Computation	1
Construct Validity	1
Correlation	1
Course Selection (Students)	1
Critical Thinking	1
Culture Fair Tests	1
More ▼

Source

ProQuest LLC	2
Applied Measurement in…	1
Australian Journal of…	1
Educational Assessment	1
Educational and Psychological…	1
Foreign Language Annals	1
National Center for Research…	1
Society for Research on…	1

Author

Baker, Eva L.	1
Batchelder, William H.	1
Bonner, Sarah M.	1
Cai, Li	1
Cooksey, Ray W.	1
Ercikan, Kadriye	1
Everson, Howard T.	1
France, Stephen L.	1
Koepfler, James R.	1
Luecht, Richard M.	1
Niemi, David	1
Oliveri, María Elena	1
Shin, Hyo Jeong	1
Sylvester, Roxanne M.	1
Thomas, Ally S.	1
More ▼

Publication Type

Journal Articles	5
Reports - Evaluative	3
Dissertations/Theses -…	2
Reports - Research	2
Opinion Papers	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	2
Secondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Location

Australia	1
California	1
New York	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Maximum Likelihood Item Easiness Models for Test Theory without an Answer Key

Peer reviewed

Direct link

France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015

Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…

Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory

Modeling Rater Effects and Complex Learning Progressions Using Item Response Models

Direct link

Shin, Hyo Jeong – ProQuest LLC, 2015

This dissertation is comprised of three papers that propose and apply psychometric models to deal with complexities and challenges in large-scale assessments, focusing on modeling rater effects and complex learning progressions. In particular, three papers investigate extensions and applications of multilevel and multidimensional item response…

Descriptors: Item Response Theory, Psychometrics, Models, Measurement

Lord-Wingersky Algorithm Version 2.0 for Hierarchical Item Factor Models with Applications in Test Scoring, Scale Alignment, and Model Fit Testing. CRESST Report 830

Download full text

Cai, Li – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2013

Lord and Wingersky's (1984) recursive algorithm for creating summed score based likelihoods and posteriors has a proven track record in unidimensional item response theory (IRT) applications. Extending the recursive algorithm to handle multidimensionality is relatively simple, especially with fixed quadrature because the recursions can be defined…

Descriptors: Mathematics, Scores, Item Response Theory, Computation

In Search of Validity Evidence in Support of the Interpretation and Use of Assessments of Complex Constructs: Discussion of Research on Assessing 21st Century Skills

Peer reviewed

Direct link

Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016

Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…

Descriptors: Evaluation Methods, Test Construction, Design, Scaling

Evaluating Phase II of a New York City-Wide STEM Initiative Using Propensity Score Methods: A Replication Study

Peer reviewed
PDF on ERIC

Download full text

Thomas, Ally S.; Bonner, Sarah M.; Everson, Howard T. – Society for Research on Educational Effectiveness, 2014

Recently, the authors have been exploring the use of propensity score methods for developing evidence of program impact. Specifically, they have been developing evidence (after one year of implementation) of the effects of the Math Science Partnership in New York City ("MSPinNYC2") on high school students' achievement--both in terms of…

Descriptors: Program Evaluation, Probability, Scores, Scoring

Examining the Bifactor IRT Model for Vertical Scaling in K-12 Assessment

Direct link

Koepfler, James R. – ProQuest LLC, 2012

Over the past decade, educational policy trends have shifted to a focus on examining students' growth from kindergarten through twelfth grade (K-12). One way states can track students' growth is with a vertical scale. Presently, every state that uses a vertical scale bases the scale on a unidimensional IRT model. These models make a…

Descriptors: Item Response Theory, Models, Scaling, Elementary Secondary Education

Scaling Up, Scaling Down: Seven Years of Performance Assessment Development in the Nation's Second Largest School District

Peer reviewed

Direct link

Niemi, David; Baker, Eva L.; Sylvester, Roxanne M. – Educational Assessment, 2007

To provide an accurate reading of students' and schools' rates of progress, and to provide cues for instruction, assessment at every level should be connected to explicit learning goals and standards. To show how this requirement can be fulfilled, and how research-based assessment can effectively support learning and instruction, this article…

Descriptors: Student Evaluation, Performance Based Assessment, Scaling, Scoring

Multistage Complexity in Language Proficiency Assessment: A Framework for Aligning Theoretical Perspectives, Test Development, and Psychometrics

Peer reviewed

Direct link

Luecht, Richard M. – Foreign Language Annals, 2003

This article contends that the necessary links between constructs and test scores/decisions in language assessment must be established through principled design procedures that align three models: (1) a theoretical construct model; (2) a test development model; and (3) a psychometric scoring model. The theoretical construct model articulates the…

Descriptors: Scoring, Psychometrics, Language Proficiency, Language Tests

The Problem of Multidimensionality in Course Scores and Course Choices in the Production of a Single Year 12 Tertiary Entrance Score.

Peer reviewed

Cooksey, Ray W. – Australian Journal of Education, 1993

An Australian study investigated multidimensionality in college entrance examination scores. Data from the American College Testing Program (ACT) and Australian Tertiary Entrance scores (which combines year 12 course grades and scores on the Australian Scholastic Aptitude Test) were analyzed using a 4-dimensional model. Results suggest a single…

Descriptors: College Entrance Examinations, Course Selection (Students), Foreign Countries, Higher Education