ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	13
Since 2016 (last 10 years)	35
Since 2006 (last 20 years)	80

Descriptor

Validity	63
Test Validity	59
Test Construction	35
Scores	33
Test Items	32
Item Response Theory	20
Evaluation Methods	19
Achievement Tests	18
Test Interpretation	18
Educational Assessment	17
Reliability	17
Elementary School Students	14
Mathematics Tests	14
Models	14
Scoring	14
Foreign Countries	13
Test Reliability	13
Test Use	13
Construct Validity	12
Evidence	12
Standardized Tests	12
Correlation	11
Elementary Secondary Education	11
High School Students	10
Item Analysis	10
More ▼

Source

Applied Measurement in…

136

Publication Type

Journal Articles	136
Reports - Research	71
Reports - Evaluative	44
Reports - Descriptive	19
Speeches/Meeting Papers	6
Information Analyses	5
Book/Product Reviews	1
Legal/Legislative/Regulatory…	1
Opinion Papers	1
Reports - General	1

Education Level

Elementary Secondary Education	13
High Schools	13
Secondary Education	12
Elementary Education	10
Higher Education	10
Grade 8	9
Middle Schools	9
Grade 4	6
Junior High Schools	6
Postsecondary Education	5
Grade 12	4
Grade 5	4
Grade 7	4
Grade 3	3
Grade 6	3
Grade 9	3
Intermediate Grades	3
Grade 10	1
Grade 11	1
More ▼

Audience

Location

Canada	4
Germany	2
Massachusetts	2
Arizona	1
Australia	1
California	1
California (Los Angeles)	1
Finland	1
France	1
Israel	1
Italy	1
Jordan	1
Kansas	1
Netherlands	1
New York	1
New York (New York)	1
North Carolina	1
Norway	1
Romania	1
Russia	1
Slovenia	1
South Korea	1
Sweden	1
Trinidad and Tobago	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

Race to the Top

Assessments and Surveys

Texas Assessment of Academic…	5
SAT (College Admission Test)	3
National Assessment of…	2
Program for International…	2
Trends in International…	2
Armed Services Vocational…	1
Bar Examinations	1
Graduate Record Examinations	1
Iowa Tests of Basic Skills	1
Law School Admission Test	1
Measures of Academic Progress	1
Perceived Competence Scale…	1
Progress in International…	1
United States Medical…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 136 results Save | Export

Validity and Racial Justice in Educational Assessment

Peer reviewed

Direct link

Lederman, Josh – Applied Measurement in Education, 2023

Given its centrality to assessment, until the concept of validity includes concern for racial justice, such matters will be seen as residing outside the "real" work of validation, rendering them powerless to count against the apparent scientific merit of the test. As the definition of validity has evolved, however, it holds great…

Descriptors: Educational Assessment, Validity, Social Justice, Race

Using Content Relevance and Representativeness Indices in Instrument Revision

Peer reviewed

Direct link

Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024

Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…

Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction

A Method for Displaying Incremental Validity with Expectancy Charts

Peer reviewed

Direct link

Lee, Samuel David; Walmsley, Philip T.; Sackett, Paul R.; Kuncel, Nathan – Applied Measurement in Education, 2021

Providing assessment validity information to decision makers in a clear and useful format is an ongoing challenge for the educational and psychological measurement community. We identify issues with a previous approach to a graphical presentation, noting that it is mislabeled as presenting incremental validity, when in fact it displays the effects…

Descriptors: Test Validity, Predictor Variables, Charts

A Method of Empirical Q-Matrix Validation for Multidimensional Item Response Theory

Peer reviewed

Direct link

Marcelo Andrade da Silva; A. Corinne Huggins-Manley; Jorge Luis Bazán; Amber Benedict – Applied Measurement in Education, 2024

A Q-matrix is a binary matrix that defines the relationship between items and latent variables and is widely used in diagnostic classification models (DCMs), and can also be adopted in multidimensional item response theory (MIRT) models. The construction process of the Q-matrix is typically carried out by experts in the subject area of the items…

Descriptors: Q Methodology, Matrices, Item Response Theory, Educational Assessment

A Method for Identifying Partial Test-Taking Engagement

Peer reviewed

Direct link

Wise, Steven; Kuhfeld, Megan – Applied Measurement in Education, 2021

Effort-moderated (E-M) scoring is intended to estimate how well a disengaged test taker would have performed had they been fully engaged. It accomplishes this adjustment by excluding disengaged responses from scoring and estimating performance from the remaining responses. The scoring method, however, assumes that the remaining responses are not…

Descriptors: Scoring, Achievement Tests, Identification, Validity

Rethinking Think-Alouds: The Often-Problematic Collection of Response Process Data

Peer reviewed

Direct link

Leighton, Jacqueline P. – Applied Measurement in Education, 2021

The objective of this paper is to comment on the think-aloud methods presented in the three papers included in this special issue. The commentary offered stems from the author's own psychological investigations of unobservable information processes and the conditions under which the most defensible claims can be advanced. The structure of this…

Descriptors: Protocol Analysis, Data Collection, Test Construction, Test Validity

Comparison of Methods for Identifying Differential Step Functioning with Polytomous Item Response Data

Peer reviewed

Direct link

Finch, Holmes – Applied Measurement in Education, 2022

Much research has been devoted to identification of differential item functioning (DIF), which occurs when the item responses for individuals from two groups differ after they are conditioned on the latent trait being measured by the scale. There has been less work examining differential step functioning (DSF), which is present for polytomous…

Descriptors: Comparative Analysis, Item Response Theory, Item Analysis, Simulation

Efficient Assessment of Students' Proportional Reasoning

Peer reviewed

Direct link

Carney, Michele; Paulding, Katie; Champion, Joe – Applied Measurement in Education, 2022

Teachers need ways to efficiently assess students' cognitive understanding. One promising approach involves easily adapted and administered item types that yield quantitative scores that can be interpreted in terms of whether or not students likely possess key understandings. This study illustrates an approach to analyzing response process…

Descriptors: Middle School Students, Logical Thinking, Mathematical Logic, Problem Solving

Think Alouds: Informing Scholarship and Broadening Partnerships through Assessment

Peer reviewed

Direct link

Bostic, Jonathan David – Applied Measurement in Education, 2021

Think alouds are valuable tools for academicians, test developers, and practitioners as they provide a unique window into a respondent's thinking during an assessment. The purpose of this special issue is to highlight novel ways to use think alouds as a means to gather evidence about respondents' thinking. An intended outcome from this special…

Descriptors: Protocol Analysis, Cognitive Processes, Data Collection, STEM Education

An Information-Based Approach to Identifying Rapid-Guessing Thresholds

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2019

The identification of rapid guessing is important to promote the validity of achievement test scores, particularly with low-stakes tests. Effective methods for identifying rapid guesses require reliable threshold methods that are also aligned with test taker behavior. Although several common threshold methods are based on rapid guessing response…

Descriptors: Guessing (Tests), Identification, Reaction Time, Reliability

Using Think-Alouds for Response Process Evidence of Teacher Attentiveness

Peer reviewed

Direct link

Mo, Ya; Carney, Michele; Cavey, Laurie; Totorica, Tatia – Applied Measurement in Education, 2021

There is a need for assessment items that assess complex constructs but can also be efficiently scored for evaluation of teacher education programs. In an effort to measure the construct of teacher attentiveness in an efficient and scalable manner, we are using exemplar responses elicited by constructed-response item prompts to develop…

Descriptors: Protocol Analysis, Test Items, Responses, Mathematics Teachers

Characterizing the Latent Classes in a Mixture IRT Model Using DIF

Peer reviewed

Direct link

Karadavut, Tugba – Applied Measurement in Education, 2021

Mixture IRT models address the heterogeneity in a population by extracting latent classes and allowing item parameters to vary between latent classes. Once the latent classes are extracted, they need to be further examined to be characterized. Some approaches have been adopted in the literature for this purpose. These approaches examine either the…

Descriptors: Item Response Theory, Models, Test Items, Maximum Likelihood Statistics

Comparison of Two Approaches to Interpretive Use Arguments

Peer reviewed

Direct link

Carney, Michele; Crawford, Angela; Siebert, Carl; Osguthorpe, Rich; Thiede, Keith – Applied Measurement in Education, 2019

The "Standards for Educational and Psychological Testing" recommend an argument-based approach to validation that involves a clear statement of the intended interpretation and use of test scores, the identification of the underlying assumptions and inferences in that statement--termed the interpretation/use argument, and gathering of…

Descriptors: Inquiry, Test Interpretation, Validity, Scores

Argument-Based Validation in Practice: Examples from Mathematics Education

Peer reviewed

Direct link

Krupa, Erin Elizabeth; Carney, Michele; Bostic, Jonathan – Applied Measurement in Education, 2019

This article provides a brief introduction to the set of four articles in the special issue. To provide a foundation for the issue, key terms are defined, a brief historical overview of validity is provided, and a description of several different validation approaches used in the issue are explained. Finally, the contribution of the articles to…

Descriptors: Test Items, Program Validation, Test Validity, Mathematics Education

Validating Rubric Scoring Processes: An Application of an Item Response Tree Model

Peer reviewed

Direct link

Myers, Aaron J.; Ames, Allison J.; Leventhal, Brian C.; Holzman, Madison A. – Applied Measurement in Education, 2020

When rating performance assessments, raters may ascribe different scores for the same performance when rubric application does not align with the intended application of the scoring criteria. Given performance assessment score interpretation assumes raters apply rubrics as rubric developers intended, misalignment between raters' scoring processes…

Descriptors: Scoring Rubrics, Validity, Item Response Theory, Interrater Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Wise, Steven L.	5
Byrne, Barbara M.	4
Carney, Michele	4
Linn, Robert L.	4
Mehrens, William A.	4
Abedi, Jamal	3
Cline, Frederick	3
Downing, Steven M.	3
Hambleton, Ronald K.	3
Kane, Michael	3
Leighton, Jacqueline P.	3
Phillips, S. E.	3
Sawyer, Richard	3
Bostic, Jonathan David	2
Buckendahl, Chad W.	2
Clauser, Brian E.	2
Cook, Linda	2
Crocker, Linda	2
Elliott, Stephen N.	2
Haladyna, Thomas M.	2
Huff, Kristen	2
Kane, Michael T.	2
Lane, Suzanne	2
Sackett, Paul R.	2
More ▼