ERIC - Search Results

Publication Date

In 2025	0
Since 2024	21
Since 2021 (last 5 years)	68
Since 2016 (last 10 years)	166

Descriptor

Test Items	65
Item Response Theory	52
Foreign Countries	31
Scores	31
Achievement Tests	26
Comparative Analysis	25
Mathematics Tests	25
Statistical Analysis	25
Models	24
Accuracy	23
Scoring	23
International Assessment	20
Computation	19
Test Bias	19
Computer Assisted Testing	18
Difficulty Level	18
Test Construction	17
Multiple Choice Tests	16
Test Validity	15
Error of Measurement	14
Guessing (Tests)	14
Test Format	14
Sample Size	13
Classification	12
Correlation	12
More ▼

Source

Applied Measurement in…

166

Publication Type

Journal Articles	166
Reports - Research	141
Reports - Evaluative	12
Reports - Descriptive	11
Information Analyses	7
Tests/Questionnaires	6
Opinion Papers	1

Education Level

Secondary Education	32
Elementary Education	25
Higher Education	23
Postsecondary Education	20
Elementary Secondary Education	15
Middle Schools	14
Junior High Schools	12
High Schools	10
Intermediate Grades	10
Grade 8	8
Grade 3	7
Grade 4	7
Early Childhood Education	6
Grade 6	6
Grade 7	6
Primary Education	6
Grade 5	5
Grade 2	4
Grade 1	2
Grade 11	2
Grade 9	2
Grade 10	1
Grade 12	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Practitioners

Location

Canada	4
Germany	4
Australia	3
United States	3
Virginia	3
California	2
Finland	2
Florida	2
Iran	2
Iran (Tehran)	2
Massachusetts	2
Netherlands	2
Ohio	2
Singapore	2
United Kingdom	2
California (Los Angeles)	1
Costa Rica	1
Europe	1
France	1
Indiana	1
Israel	1
Italy	1
Japan	1
Jordan	1
Kansas	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	12
Trends in International…	7
Graduate Record Examinations	3
Measures of Academic Progress	3
National Assessment of…	2
Wechsler Intelligence Scale…	2
Advanced Placement…	1
Florida Comprehensive…	1
Massachusetts Comprehensive…	1
Progress in International…	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1
Stanford Binet Intelligence…	1
Test of English as a Foreign…	1
United States Medical…	1
Wechsler Adult Intelligence…	1
Woodcock Johnson Psycho…	1
Woodcock Johnson Tests of…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 166 results Save | Export

Using Content Relevance and Representativeness Indices in Instrument Revision

Peer reviewed

Direct link

Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024

Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…

Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction

Item-Writing Guidelines on Response Option Placement: A Systematic Review

Peer reviewed

Direct link

Séverin Lions; María Paz Blanco; Pablo Dartnell; Carlos Monsalve; Gabriel Ortega; Julie Lemarié – Applied Measurement in Education, 2024

Multiple-choice items are universally used in formal education. Since they should assess learning, not test-wiseness or guesswork, they must be constructed following the highest possible standards. Hundreds of item-writing guides have provided guidelines to help test developers adopt appropriate strategies to define the distribution and sequence…

Descriptors: Test Construction, Multiple Choice Tests, Guidelines, Test Items

Impact of Violating Unidimensionality on Rasch Calibration for Mixed-Format Tests

Peer reviewed

Direct link

Chunyan Liu; Raja Subhiyah; Richard A. Feinberg – Applied Measurement in Education, 2024

Mixed-format tests that include both multiple-choice (MC) and constructed-response (CR) items have become widely used in many large-scale assessments. When an item response theory (IRT) model is used to score a mixed-format test, the unidimensionality assumption may be violated if the CR items measure a different construct from that measured by MC…

Descriptors: Test Format, Response Style (Tests), Multiple Choice Tests, Item Response Theory

Does the Response Options Placement Provide Clues to the Correct Answers in Multiple-Choice Tests? A Systematic Review

Peer reviewed

Direct link

Lions, Séverin; Monsalve, Carlos; Dartnell, Pablo; Blanco, María Paz; Ortega, Gabriel; Lemarié, Julie – Applied Measurement in Education, 2022

Multiple-choice tests are widely used in education, often for high-stakes assessment purposes. Consequently, these tests should be constructed following the highest standards. Many efforts have been undertaken to advance item-writing guidelines intended to improve tests. One important issue is the unwanted effects of the options' position on test…

Descriptors: Multiple Choice Tests, High Stakes Tests, Test Construction, Guidelines

Accuracy and Sensitivity of Coefficient Alpha and Its Alternatives with Unidimensional and Contaminated Scales

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023

We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…

Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length

Cross-Cultural Validation of the Mathematics Construct and Attribute Profiles: A Differential Item Functioning Approach

Peer reviewed

Direct link

Yi-Hsin Chen – Applied Measurement in Education, 2024

This study aims to apply the differential item functioning (DIF) technique with the deterministic inputs, noisy "and" gate (DINA) model to validate the mathematics construct and diagnostic attribute profiles across American and Singaporean students. Even with the same ability level, every single item is expected to show uniform DIF…

Descriptors: Foreign Countries, Achievement Tests, Elementary Secondary Education, International Assessment

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

The Impact of Non-Effortful Responding on Item and Person Parameters in Item-Pool Scaling Linking

Peer reviewed

Direct link

Yue Liu; Zhen Li; Hongyun Liu; Xiaofeng You – Applied Measurement in Education, 2024

Low test-taking effort of examinees has been considered a source of construct-irrelevant variance in item response modeling, leading to serious consequences on parameter estimation. This study aims to investigate how non-effortful response (NER) influences the estimation of item and person parameters in item-pool scale linking (IPSL) and whether…

Descriptors: Item Response Theory, Computation, Simulation, Responses

Modeling Dimensions Converging at the Upper Anchor in Learning Progressions: An Example of Micro-Evolution

Peer reviewed

Direct link

Mingfeng Xue; Mark Wilson – Applied Measurement in Education, 2024

Multidimensionality is common in psychological and educational measurements. This study focuses on dimensions that converge at the upper anchor (i.e. the highest acquisition status defined in a learning progression) and compares different ways of dealing with them using the multidimensional random coefficients multinomial logit model and scale…

Descriptors: Learning Trajectories, Educational Assessment, Item Response Theory, Evolution

Dissecting Knowledge, Guessing, and Blunder in Multiple Choice Assessments

Peer reviewed

Direct link

Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023

Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…

Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models

Can Adaptive Testing Improve Test-Taking Experience? A Case Study on Educational Survey Assessment

Peer reviewed

Direct link

Yi-Hsuan Lee; Yue Jia – Applied Measurement in Education, 2024

Test-taking experience is a consequence of the interaction between students and assessment properties. We define a new notion, rapid-pacing behavior, to reflect two types of test-taking experience -- disengagement and speededness. To identify rapid-pacing behavior, we extend existing methods to develop response-time thresholds for individual items…

Descriptors: Adaptive Testing, Reaction Time, Item Response Theory, Test Format

Are Online and Paper Tests Comparable? Evidence from Statewide K-12 Tests

Peer reviewed

Direct link

Ben Backes; James Cowan – Applied Measurement in Education, 2024

We investigate two research questions using a recent statewide transition from paper to computer-based testing: first, the extent to which test mode effects found in prior studies can be eliminated; and second, the degree to which online and paper assessments offer different information about underlying student ability. We first find very small…

Descriptors: Computer Assisted Testing, Test Format, Differences, Academic Achievement

Change in Engagement during Test Events: An Argument for Weighted Scoring?

Peer reviewed

Direct link

Steven L. Wise; G. Gage Kingsbury; Meredith L. Langi – Applied Measurement in Education, 2023

Recent research has provided evidence that performance change during a student's test event can indicate the presence of test-taking disengagement. Meaningful performance change implies that some portions of the test event reflect assumed maximum performance better than others and, because disengagement tends to diminish performance,…

Descriptors: Tests, Weighted Scores, Test Wiseness, Scoring

An Examination of Individual Ability Estimation and Classification Accuracy under Rapid Guessing Misidentifications

Peer reviewed

Direct link

Rios, Joseph – Applied Measurement in Education, 2022

To mitigate the deleterious effects of rapid guessing (RG) on ability estimates, several rescoring procedures have been proposed. Underlying many of these procedures is the assumption that RG is accurately identified. At present, there have been minimal investigations examining the utility of rescoring approaches when RG is misclassified, and…

Descriptors: Accuracy, Guessing (Tests), Scoring, Classification

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

Wise, Steven L.	7
Lee, Won-Chan	5
Oliveri, Maria Elena	4
Rios, Joseph A.	4
Soland, James	4
Carney, Michele	3
Ercikan, Kadriye	3
Bridgeman, Brent	2
Buzick, Heather	2
Chunyan Liu	2
Clark, Amy K.	2
Cohen, Dale J.	2
Cohen, Jon	2
Dadey, Nathan	2
Davis, Laurie Laughlin	2
Ferrara, Steve	2
Gao, Lingyun	2
Glazer, Nancy	2
Guo, Hongwen	2
Holtzman, Steven	2
Jones, Andrew T.	2
Jurich, Daniel	2
Kong, Xiaojing	2
Lee, HyeSun	2
McBride, Yuanyuan	2
More ▼