ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	16

Source

Applied Measurement in…

Publication Type

Journal Articles	25
Reports - Research	15
Reports - Evaluative	9
Information Analyses	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	6
Elementary Education	4
Secondary Education	4
Grade 3	2
Grade 5	2
Higher Education	2
Intermediate Grades	2
Middle Schools	2
Postsecondary Education	2
Grade 11	1
Grade 4	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
More ▼

Audience

Location

Germany	2
Ohio	2
California	1
Canada	1
Finland	1
France	1
Indiana	1
Iowa	1
Italy	1
Jordan	1
Kansas	1
Massachusetts	1
Michigan	1
Minnesota	1
Oregon	1
Romania	1
Russia	1
United Kingdom	1
United Kingdom (Northern…	1
Vermont	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	3
Program for International…	3
Trends in International…	3
Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
Measures of Academic Progress	1
Progress in International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Using Content Relevance and Representativeness Indices in Instrument Revision

Peer reviewed

Direct link

Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024

Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…

Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction

The Impact of Test-Taking Disengagement on Item Content Representation

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2020

In achievement testing there is typically a practical requirement that the set of items administered should be representative of some target content domain. This is accomplished by establishing test blueprints specifying the content constraints to be followed when selecting the items for a test. Sometimes, however, students give disengaged…

Descriptors: Test Items, Test Content, Achievement Tests, Guessing (Tests)

Not-Reached Items: An Issue of Time and of Test-Taking Disengagement? The Case of PISA 2015 Reading Data

Peer reviewed

Direct link

Pools, Elodie – Applied Measurement in Education, 2022

Many low-stakes assessments, such as international large-scale surveys, are administered during time-limited testing sessions and some test-takers are not able to endorse the last items of the test, resulting in not-reached (NR) items. However, because the test has no consequence for the respondents, these NR items can also stem from quitting the…

Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students

Gauging Uncertainty in Test-to-Curriculum Alignment Indices

Peer reviewed

Direct link

Traynor, Anne; Li, Tingxuan; Zhou, Shuqi – Applied Measurement in Education, 2020

During the development of large-scale school achievement tests, panels of independent subject-matter experts use systematic judgmental methods to rate the correspondence between a given test's items and performance objective statements. The individual experts' ratings may then be used to compute summary indices to quantify the match between a…

Descriptors: Alignment (Education), Achievement Tests, Curriculum, Error of Measurement

The Trade-Off between Model Fit, Invariance, and Validity: The Case of PISA Science Assessments

Peer reviewed

Direct link

El Masri, Yasmine H.; Andrich, David – Applied Measurement in Education, 2020

In large-scale educational assessments, it is generally required that tests are composed of items that function invariantly across the groups to be compared. Despite efforts to ensure invariance in the item construction phase, for a range of reasons (including the security of items) it is often necessary to account for differential item…

Descriptors: Models, Goodness of Fit, Test Validity, Achievement Tests

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Are Multiple-Choice Items Too Fat?

Peer reviewed

Direct link

Haladyna, Thomas M.; Rodriguez, Michael C.; Stevens, Craig – Applied Measurement in Education, 2019

The evidence is mounting regarding the guidance to employ more three-option multiple-choice items. From theoretical analyses, empirical results, and practical considerations, such items are of equal or higher quality than four- or five-option items, and more items can be administered to improve content coverage. This study looks at 58 tests,…

Descriptors: Multiple Choice Tests, Test Items, Testing Problems, Guessing (Tests)

Effort Analysis: Individual Score Validation of Achievement Test Data

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2015

Whenever the purpose of measurement is to inform an inference about a student's achievement level, it is important that we be able to trust that the student's test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student's test event is not unduly influenced by construct-irrelevant factors…

Descriptors: Achievement Tests, Scores, Validity, Test Items

Effects of Item Modifications on Test Accessibility for Persistently Low-Performing Students with Disabilities

Peer reviewed

Direct link

Cohen, Dale J.; Zhang, Jin; Wothke, Werner – Applied Measurement in Education, 2019

Construct-irrelevant cognitive complexity of some items in the statewide grade-level assessments may impose performance barriers for students with disabilities who are ineligible for alternate assessments based on alternate achievement standards. This has spurred research into whether items can be modified to reduce complexity without affecting…

Descriptors: Test Items, Accessibility (for Disabled), Students with Disabilities, Low Achievement

Negative Keying Effects in the Factor Structure of TIMSS 2011 Motivation Scales and Associations with Reading Achievement

Peer reviewed

Direct link

Michaelides, Michalis P. – Applied Measurement in Education, 2019

The Student Background survey administered along with achievement tests in studies of the International Association for the Evaluation of Educational Achievement includes scales of student motivation, competence, and attitudes toward mathematics and science. The scales consist of positively- and negatively keyed items. The current research…

Descriptors: International Assessment, Achievement Tests, Mathematics Achievement, Mathematics Tests

Focusing on Interactions between Content and Cognition: A New Perspective on Gender Differences in Mathematical Sub-Competencies

Peer reviewed

Direct link

George, Ann Cathrice; Robitzsch, Alexander – Applied Measurement in Education, 2018

This article presents a new perspective on measuring gender differences in the large-scale assessment study Trends in International Science Study (TIMSS). The suggested empirical model is directly based on the theoretical competence model of the domain mathematics and thus includes the interaction between content and cognitive sub-competencies.…

Descriptors: Achievement Tests, Elementary Secondary Education, Mathematics Achievement, Mathematics Tests

Sensitivity of Cross-State Assessment Item Difficulty to Differences in State Curricular Content Standards

Peer reviewed

Direct link

Traynor, Anne – Applied Measurement in Education, 2017

It has long been argued that U.S. states' differential performance on nationwide assessments may reflect differences in students' opportunity to learn the tested content that is primarily due to variation in curricular content standards, rather than in instructional quality or educational investment. To quantify the effect of differences in…

Descriptors: Test Items, Difficulty Level, State Standards, Academic Standards

Measurement Properties of Two Innovative Item Formats in a Computer-Based Test

Peer reviewed

Direct link

Wan, Lei; Henly, George A. – Applied Measurement in Education, 2012

Many innovative item formats have been proposed over the past decade, but little empirical research has been conducted on their measurement properties. This study examines the reliability, efficiency, and construct validity of two innovative item formats--the figural response (FR) and constructed response (CR) formats used in a K-12 computerized…

Descriptors: Test Items, Test Format, Computer Assisted Testing, Measurement

Correlates of Rapid-Guessing Behavior in Low-Stakes Testing: Implications for Test Development and Measurement Practice

Peer reviewed

Direct link

Wise, Steven L.; Pastor, Dena A.; Kong, Xiaojing J. – Applied Measurement in Education, 2009

Previous research has shown that rapid-guessing behavior can degrade the validity of test scores from low-stakes proficiency tests. This study examined, using hierarchical generalized linear modeling, examinee and item characteristics for predicting rapid-guessing behavior. Several item characteristics were found significant; items with more text…

Descriptors: Guessing (Tests), Achievement Tests, Correlation, Test Items

Validity of the Simultaneous Approach to the Development of Equivalent Achievement Tests in English and French

Peer reviewed

Direct link

Rogers, W. Todd; Lin, Jie; Rinaldi, Christia M. – Applied Measurement in Education, 2011

The evidence gathered in the present study supports the use of the simultaneous development of test items for different languages. The simultaneous approach used in the present study involved writing an item in one language (e.g., French) and, before moving to the development of a second item, translating the item into the second language (e.g.,…

Descriptors: Test Items, Item Analysis, Achievement Tests, French

Previous Page | Next Page »

Pages: 1 | 2

Achievement Tests	25
Test Items	25
Foreign Countries	7
International Assessment	6
Difficulty Level	5
Elementary Secondary Education	5
Mathematics Tests	5
Scores	5
Test Validity	5
Elementary School Students	4
Guessing (Tests)	4
Item Analysis	4
Mathematics Achievement	4
Measurement	4
Test Construction	4
Curriculum	3
Error of Measurement	3
Grade 4	3
Item Response Theory	3
Multiple Choice Tests	3
Reading Tests	3
Secondary School Students	3
Classification	2
Comparative Analysis	2
Computer Assisted Testing	2
More ▼

Wise, Steven L.	3
Traynor, Anne	2
Abulela, Mohammed A. A.	1
Andrich, David	1
Anne Traynor	1
Ansley, Timothy	1
Benson, Jeri	1
Bishop, N. Scott	1
Cohen, Dale J.	1
Crocker, Linda M.	1
El Masri, Yasmine H.	1
Ercikan, Kadriye	1
Frisbie, David A.	1
George, Ann Cathrice	1
Gierl, Mark J.	1
Haladyna, Thomas M.	1
Henly, George A.	1
Koh, Kim	1
Kong, Xiaojing J.	1
Li, Tingxuan	1
Lin, Jie	1
McCreith, Tanya	1
Meijer, Rob R.	1
Michaelides, Michalis P.	1
More ▼