ERIC - Search Results

Publication Date

In 2025	0
Since 2024	4
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	33

Descriptor

Test Items	56
Computer Assisted Testing	24
Item Response Theory	18
Adaptive Testing	15
Difficulty Level	13
Psychometrics	10
Testing	9
Comparative Analysis	8
Mathematics Tests	8
Multiple Choice Tests	8
Testing Accommodations	8
College Students	7
Scores	7
Test Construction	7
Test Format	7
Ability	6
Evaluation Methods	6
Higher Education	6
Reaction Time	6
Reading Tests	6
Simulation	6
Achievement Tests	5
Computation	5
Elementary School Students	5
Grade 8	5
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	56
Reports - Research	42
Reports - Evaluative	14
Information Analyses	2
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Secondary Education	10
Middle Schools	9
Elementary Education	8
Junior High Schools	7
Grade 8	6
Higher Education	6
Postsecondary Education	6
Elementary Secondary Education	5
Grade 5	5
High Schools	4
Grade 7	3
Grade 3	2
Grade 4	2
Grade 6	2
Intermediate Grades	2
Primary Education	2
Early Childhood Education	1
Grade 11	1
Grade 9	1
More ▼

Audience

Practitioners

Location

Australia	1
Canada	1
Georgia	1
Japan	1
Ohio	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Major Field Achievement Test…	1
Praxis Series	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 56 results Save | Export

Identifying Careless Responses in Computer-Adaptive Affective Surveys Using Person Fit Analysis

Peer reviewed

Direct link

Stefanie A. Wind; Beyza Aksu-Dunya – Applied Measurement in Education, 2024

Careless responding is a pervasive concern in research using affective surveys. Although researchers have considered various methods for identifying careless responses, studies are limited that consider the utility of these methods in the context of computer adaptive testing (CAT) for affective scales. Using a simulation study informed by recent…

Descriptors: Response Style (Tests), Computer Assisted Testing, Adaptive Testing, Affective Measures

Bayesian Logistic Regression: A New Method to Calibrate Pretest Items in Multistage Adaptive Testing

Peer reviewed

Direct link

TsungHan Ho – Applied Measurement in Education, 2023

An operational multistage adaptive test (MST) requires the development of a large item bank and the effort to continuously replenish the item bank due to concerns about test security and validity over the long term. New items should be pretested and linked to the item bank before being used operationally. The linking item volume fluctuations in…

Descriptors: Bayesian Statistics, Regression (Statistics), Test Items, Pretesting

On the Reliable Identification and Effectiveness of Computer-Based, Pop-Up Glossaries in Large-Scale Assessments

Peer reviewed

Direct link

Cohen, Dale J.; Ballman, Alesha; Rijmen, Frank; Cohen, Jon – Applied Measurement in Education, 2020

Computer-based, pop-up glossaries are perhaps the most promising accommodation aimed at mitigating the influence of linguistic structure and cultural bias on the performance of English Learner (EL) students on statewide assessments. To date, there is no established procedure for identifying the words that require a glossary for EL students that is…

Descriptors: Glossaries, Testing Accommodations, English Language Learners, Computer Assisted Testing

Are Multiple-Choice Items Too Fat?

Peer reviewed

Direct link

Haladyna, Thomas M.; Rodriguez, Michael C.; Stevens, Craig – Applied Measurement in Education, 2019

The evidence is mounting regarding the guidance to employ more three-option multiple-choice items. From theoretical analyses, empirical results, and practical considerations, such items are of equal or higher quality than four- or five-option items, and more items can be administered to improve content coverage. This study looks at 58 tests,…

Descriptors: Multiple Choice Tests, Test Items, Testing Problems, Guessing (Tests)

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Can Adaptive Testing Improve Test-Taking Experience? A Case Study on Educational Survey Assessment

Peer reviewed

Direct link

Yi-Hsuan Lee; Yue Jia – Applied Measurement in Education, 2024

Test-taking experience is a consequence of the interaction between students and assessment properties. We define a new notion, rapid-pacing behavior, to reflect two types of test-taking experience -- disengagement and speededness. To identify rapid-pacing behavior, we extend existing methods to develop response-time thresholds for individual items…

Descriptors: Adaptive Testing, Reaction Time, Item Response Theory, Test Format

Bayesian Estimation and Testing of a Linear Logistic Test Model for Learning during the Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Applied Measurement in Education, 2021

The present study proposes a Bayesian approach for estimating and testing the operation-specific learning model, a variant of the linear logistic test model that allows for the measurement of the learning that occurs during a test as a result of the repeated use of the operations involved in the items. The advantages of using a Bayesian framework…

Descriptors: Bayesian Statistics, Computation, Learning, Testing

Computer-Based Listening Test with Full Video, Visual-Limited Video, and Audio: A Comparative Analysis Based on Difficulty, Discrimination Power, and Response Time

Peer reviewed

Direct link

Takahiro Terao – Applied Measurement in Education, 2024

This study aimed to compare item characteristics and response time between stimulus conditions in computer-delivered listening tests. Listening materials had three variants: regular videos, frame-by-frame videos, and only audios without visuals. Participants were 228 Japanese high school students who were requested to complete one of nine…

Descriptors: Computer Assisted Testing, Audiovisual Aids, Reaction Time, High School Students

Considering the Use of General and Modified Assessment Items in Computerized Adaptive Testing

Peer reviewed

Direct link

Wyse, Adam E.; Albano, Anthony D. – Applied Measurement in Education, 2015

This article used several data sets from a large-scale state testing program to examine the feasibility of combining general and modified assessment items in computerized adaptive testing (CAT) for different groups of students. Results suggested that several of the assumptions made when employing this type of mixed-item CAT may not be met for…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Testing Programs

Differential Item Functioning for Accommodated Students with Disabilities: Effect of Differences in Proficiency Distributions

Peer reviewed

Direct link

Quesen, Sarah; Lane, Suzanne – Applied Measurement in Education, 2019

This study examined the effect of similar vs. dissimilar proficiency distributions on uniform DIF detection on a statewide eighth grade mathematics assessment. Results from the similar- and dissimilar-ability reference groups with an SWD focal group were compared for four models: logistic regression, hierarchical generalized linear model (HGLM),…

Descriptors: Test Items, Mathematics Tests, Grade 8, Item Response Theory

Effects of Item Modifications on Test Accessibility for Persistently Low-Performing Students with Disabilities

Peer reviewed

Direct link

Cohen, Dale J.; Zhang, Jin; Wothke, Werner – Applied Measurement in Education, 2019

Construct-irrelevant cognitive complexity of some items in the statewide grade-level assessments may impose performance barriers for students with disabilities who are ineligible for alternate assessments based on alternate achievement standards. This has spurred research into whether items can be modified to reduce complexity without affecting…

Descriptors: Test Items, Accessibility (for Disabled), Students with Disabilities, Low Achievement

A New Procedure for Detection of Students' Rapid Guessing Responses Using Response Time

Peer reviewed

Direct link

Guo, Hongwen; Rios, Joseph A.; Haberman, Shelby; Liu, Ou Lydia; Wang, Jing; Paek, Insu – Applied Measurement in Education, 2016

Unmotivated test takers using rapid guessing in item responses can affect validity studies and teacher and institution performance evaluation negatively, making it critical to identify these test takers. The authors propose a new nonparametric method for finding response-time thresholds for flagging item responses that result from rapid-guessing…

Descriptors: Guessing (Tests), Reaction Time, Nonparametric Statistics, Models

Multistage Computerized Adaptive Testing with Uniform Item Exposure

Peer reviewed

Direct link

Edwards, Michael C.; Flora, David B.; Thissen, David – Applied Measurement in Education, 2012

This article describes a computerized adaptive test (CAT) based on the uniform item exposure multi-form structure (uMFS). The uMFS is a specialization of the multi-form structure (MFS) idea described by Armstrong, Jones, Berliner, and Pashley (1998). In an MFS CAT, the examinee first responds to a small fixed block of items. The items comprising…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Test Items

Second-Generation Challenges for Making Content Assessments Accessible for ELLs

Peer reviewed

Direct link

Kopriva, Rebecca J. – Applied Measurement in Education, 2014

In this commentary, Rebecca Kopriva examines the articles in this special issue by drawing on her experience from three series of investigations examining how English language learners (ELLs) and other students perceive what test items ask and how they can successfully represent what they know. The first series examined the effect of different…

Descriptors: English Language Learners, Test Items, Educational Assessment, Access to Education

The Comparability of Scores from Different Digital Devices: A Literature Review and Synthesis with Recommendations for Practice

Peer reviewed

Direct link

Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018

Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…

Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Wise, Steven L.	4
Cohen, Dale J.	2
Penfield, Randall D.	2
Pomplun, Mark	2
Way, Walter D.	2
Abedi, Jamal	1
Ainley, John	1
Albano, Anthony D.	1
Ballman, Alesha	1
Ban, Jae-Chun	1
Banks, Kathleen	1
Bennett, Randy Elliot	1
Bergstrom, Betty A.	1
Beyza Aksu-Dunya	1
Boulais, André-Philippe	1
Brandon, Paul R.	1
Brian F. French	1
Chang, Lei	1
Chauvin, Sheila W.	1
Cho, Hyun-Jeong	1
Cohen, Jon	1
Custer, Michael	1
Dadey, Nathan	1
Davey, Beth	1
More ▼