ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	26
Since 2006 (last 20 years)	35

Descriptor

Achievement Tests	71
Test Items	25
Elementary Secondary Education	17
Foreign Countries	14
International Assessment	14
Mathematics Tests	12
Scores	11
Item Response Theory	10
Mathematics Achievement	10
Standardized Tests	10
Comparative Analysis	9
Test Construction	9
Test Validity	9
Academic Achievement	8
Elementary School Students	8
Guessing (Tests)	8
Secondary School Students	8
Difficulty Level	7
Grade 4	7
Grade 6	7
Models	7
Validity	7
Computer Assisted Testing	6
Error of Measurement	6
Measurement	6
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	71
Reports - Research	43
Reports - Evaluative	21
Speeches/Meeting Papers	8
Reports - Descriptive	7
Information Analyses	3
Historical Materials	1

Education Level

Elementary Secondary Education	11
Secondary Education	11
Elementary Education	9
Grade 3	5
Grade 4	5
Grade 5	5
Intermediate Grades	5
Middle Schools	5
Grade 6	4
Grade 8	4
Grade 7	3
High Schools	3
Higher Education	3
Postsecondary Education	3
Early Childhood Education	2
Grade 2	2
Grade 9	2
Junior High Schools	2
Primary Education	2
Grade 1	1
Grade 10	1
Grade 11	1
Grade 12	1
More ▼

Audience

Location

Canada	2
Finland	2
Germany	2
Ohio	2
Singapore	2
United States	2
Australia	1
California	1
Costa Rica	1
Florida	1
France	1
Indiana	1
Iowa	1
Iran (Tehran)	1
Italy	1
Jordan	1
Kansas	1
Massachusetts	1
Michigan	1
Minnesota	1
Netherlands	1
Oregon	1
Romania	1
Russia	1
Tennessee	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Program for International…	8
National Assessment of…	7
Trends in International…	5
Measures of Academic Progress	3
Texas Assessment of Academic…	3
Iowa Tests of Basic Skills	2
Iowa Tests of Educational…	2
SAT (College Admission Test)	2
Stanford Achievement Tests	2
California Achievement Tests	1
College Board Achievement…	1
Florida Comprehensive…	1
Progress in International…	1
Woodcock Johnson Psycho…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 71 results Save | Export

Using Content Relevance and Representativeness Indices in Instrument Revision

Peer reviewed

Direct link

Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024

Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…

Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction

A Method for Identifying Partial Test-Taking Engagement

Peer reviewed

Direct link

Wise, Steven; Kuhfeld, Megan – Applied Measurement in Education, 2021

Effort-moderated (E-M) scoring is intended to estimate how well a disengaged test taker would have performed had they been fully engaged. It accomplishes this adjustment by excluding disengaged responses from scoring and estimating performance from the remaining responses. The scoring method, however, assumes that the remaining responses are not…

Descriptors: Scoring, Achievement Tests, Identification, Validity

Cross-Cultural Validation of the Mathematics Construct and Attribute Profiles: A Differential Item Functioning Approach

Peer reviewed

Direct link

Yi-Hsin Chen – Applied Measurement in Education, 2024

This study aims to apply the differential item functioning (DIF) technique with the deterministic inputs, noisy "and" gate (DINA) model to validate the mathematics construct and diagnostic attribute profiles across American and Singaporean students. Even with the same ability level, every single item is expected to show uniform DIF…

Descriptors: Foreign Countries, Achievement Tests, Elementary Secondary Education, International Assessment

The Impact of Test-Taking Disengagement on Item Content Representation

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2020

In achievement testing there is typically a practical requirement that the set of items administered should be representative of some target content domain. This is accomplished by establishing test blueprints specifying the content constraints to be followed when selecting the items for a test. Sometimes, however, students give disengaged…

Descriptors: Test Items, Test Content, Achievement Tests, Guessing (Tests)

Not-Reached Items: An Issue of Time and of Test-Taking Disengagement? The Case of PISA 2015 Reading Data

Peer reviewed

Direct link

Pools, Elodie – Applied Measurement in Education, 2022

Many low-stakes assessments, such as international large-scale surveys, are administered during time-limited testing sessions and some test-takers are not able to endorse the last items of the test, resulting in not-reached (NR) items. However, because the test has no consequence for the respondents, these NR items can also stem from quitting the…

Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students

Gauging Uncertainty in Test-to-Curriculum Alignment Indices

Peer reviewed

Direct link

Traynor, Anne; Li, Tingxuan; Zhou, Shuqi – Applied Measurement in Education, 2020

During the development of large-scale school achievement tests, panels of independent subject-matter experts use systematic judgmental methods to rate the correspondence between a given test's items and performance objective statements. The individual experts' ratings may then be used to compute summary indices to quantify the match between a…

Descriptors: Alignment (Education), Achievement Tests, Curriculum, Error of Measurement

Critically Reflecting on the Origins, Evolution, and Impact of the Cattell-Horn-Carroll (CHC) Model

Peer reviewed

Direct link

McGill, Ryan J.; Dombrowski, Stefan C. – Applied Measurement in Education, 2019

The Cattell-Horn-Carroll (CHC) model presently serves as a blueprint for both test development and a taxonomy for clinical interpretation of modern tests of cognitive ability. Accordingly, the trend among test publishers has been toward creating tests that provide users with an ever-increasing array of scores that comport with CHC. However, an…

Descriptors: Models, Cognitive Ability, Intelligence Tests, Intelligence

Identifying Disengaged Survey Responses: New Evidence Using Response Time Metadata

Peer reviewed

Direct link

Soland, James; Wise, Steven L.; Gao, Lingyun – Applied Measurement in Education, 2019

Disengaged responding is a phenomenon that often biases observed scores from achievement tests and surveys in practically and statistically significant ways. This problem has led to the development of methods to detect and correct for disengaged responses on both achievement test and survey scores. One major disadvantage when trying to detect…

Descriptors: Reaction Time, Metadata, Response Style (Tests), Student Surveys

The Effects of Effort Monitoring with Proctor Notification on Test-Taking Engagement, Test Performance, and Validity

Peer reviewed

Direct link

Wise, Steven L.; Kuhfeld, Megan R.; Soland, James – Applied Measurement in Education, 2019

When we administer educational achievement tests, we want to be confident that the resulting scores validly indicate what the test takers know and can do. However, if the test is perceived as low stakes by the test taker, disengaged test taking sometimes occurs, which poses a serious threat to score validity. When computer-based tests are used,…

Descriptors: Guessing (Tests), Computer Assisted Testing, Achievement Tests, Scores

The Trade-Off between Model Fit, Invariance, and Validity: The Case of PISA Science Assessments

Peer reviewed

Direct link

El Masri, Yasmine H.; Andrich, David – Applied Measurement in Education, 2020

In large-scale educational assessments, it is generally required that tests are composed of items that function invariantly across the groups to be compared. Despite efforts to ensure invariance in the item construction phase, for a range of reasons (including the security of items) it is often necessary to account for differential item…

Descriptors: Models, Goodness of Fit, Test Validity, Achievement Tests

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Are Multiple-Choice Items Too Fat?

Peer reviewed

Direct link

Haladyna, Thomas M.; Rodriguez, Michael C.; Stevens, Craig – Applied Measurement in Education, 2019

The evidence is mounting regarding the guidance to employ more three-option multiple-choice items. From theoretical analyses, empirical results, and practical considerations, such items are of equal or higher quality than four- or five-option items, and more items can be administered to improve content coverage. This study looks at 58 tests,…

Descriptors: Multiple Choice Tests, Test Items, Testing Problems, Guessing (Tests)

Performance Decline as an Indicator of Generalized Test-Taking Disengagement

Peer reviewed

Direct link

Wise, Steven L.; Kingsbury, G. Gage – Applied Measurement in Education, 2022

In achievement testing we assume that students will demonstrate their maximum performance as they encounter test items. Sometimes, however, student performance can decline during a test event, which implies that the test score does not represent maximum performance. This study describes a method for identifying significant performance decline and…

Descriptors: Achievement Tests, Performance, Classification, Guessing (Tests)

A General Approach to Measuring Test-Taking Effort on Computer-Based Tests

Peer reviewed

Direct link

Wise, Steven L.; Gao, Lingyun – Applied Measurement in Education, 2017

There has been an increased interest in the impact of unmotivated test taking on test performance and score validity. This has led to the development of new ways of measuring test-taking effort based on item response time. In particular, Response Time Effort (RTE) has been shown to provide an assessment of effort down to the level of individual…

Descriptors: Test Bias, Computer Assisted Testing, Item Response Theory, Achievement Tests

Effort Analysis: Individual Score Validation of Achievement Test Data

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2015

Whenever the purpose of measurement is to inform an inference about a student's achievement level, it is important that we be able to trust that the student's test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student's test event is not unduly influenced by construct-irrelevant factors…

Descriptors: Achievement Tests, Scores, Validity, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Wise, Steven L.	8
Soland, James	3
Twing, Jon S.	3
Ercikan, Kadriye	2
Frisbie, David A.	2
Gao, Lingyun	2
Haladyna, Thomas M.	2
Linn, Robert L.	2
Traynor, Anne	2
Abulela, Mohammed A. A.	1
Andrich, David	1
Anne Traynor	1
Ansley, Timothy	1
Bailey, Alison L.	1
Barnes, Laura L. B.	1
Bazargan, Abbas	1
Becker, Douglas F.	1
Benson, Jeri	1
Bishop, N. Scott	1
Borgonovi, Francesca	1
Bovaird, James A.	1
Butler, Frances A.	1
Care, Esther	1
Cohen, Allan S.	1
More ▼