ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	8

Source

Applied Measurement in…

Publication Type

Journal Articles	15
Reports - Research	10
Reports - Evaluative	5

Education Level

Grade 8	4
Elementary Education	3
Elementary Secondary Education	3
Middle Schools	3
Grade 5	2
Junior High Schools	2
Secondary Education	2
Grade 11	1
Grade 3	1
Grade 4	1
Grade 6	1
Grade 7	1
High Schools	1
Higher Education	1
Postsecondary Education	1
More ▼

Audience

Location

Canada	1
Massachusetts	1
Spain	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

Massachusetts Comprehensive…

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Are Online and Paper Tests Comparable? Evidence from Statewide K-12 Tests

Peer reviewed

Direct link

Ben Backes; James Cowan – Applied Measurement in Education, 2024

We investigate two research questions using a recent statewide transition from paper to computer-based testing: first, the extent to which test mode effects found in prior studies can be eliminated; and second, the degree to which online and paper assessments offer different information about underlying student ability. We first find very small…

Descriptors: Computer Assisted Testing, Test Format, Differences, Academic Achievement

Can Adaptive Testing Improve Test-Taking Experience? A Case Study on Educational Survey Assessment

Peer reviewed

Direct link

Yi-Hsuan Lee; Yue Jia – Applied Measurement in Education, 2024

Test-taking experience is a consequence of the interaction between students and assessment properties. We define a new notion, rapid-pacing behavior, to reflect two types of test-taking experience -- disengagement and speededness. To identify rapid-pacing behavior, we extend existing methods to develop response-time thresholds for individual items…

Descriptors: Adaptive Testing, Reaction Time, Item Response Theory, Test Format

Impact of Accumulated Error on Item Response Theory Pre-Equating with Mixed Format Tests

Peer reviewed

Direct link

Keller, Lisa A.; Keller, Robert; Cook, Robert J.; Colvin, Kimberly F. – Applied Measurement in Education, 2016

The equating of tests is an essential process in high-stakes, large-scale testing conducted over multiple forms or administrations. By adjusting for differences in difficulty and placing scores from different administrations of a test on a common scale, equating allows scores from these different forms and administrations to be directly compared…

Descriptors: Item Response Theory, Equated Scores, Test Format, Testing Programs

Multistage Computerized Adaptive Testing with Uniform Item Exposure

Peer reviewed

Direct link

Edwards, Michael C.; Flora, David B.; Thissen, David – Applied Measurement in Education, 2012

This article describes a computerized adaptive test (CAT) based on the uniform item exposure multi-form structure (uMFS). The uMFS is a specialization of the multi-form structure (MFS) idea described by Armstrong, Jones, Berliner, and Pashley (1998). In an MFS CAT, the examinee first responds to a small fixed block of items. The items comprising…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Test Items

Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André – Applied Measurement in Education, 2016

Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…

Descriptors: Psychometrics, Multiple Choice Tests, Test Items, Item Analysis

Measurement Properties of Two Innovative Item Formats in a Computer-Based Test

Peer reviewed

Direct link

Wan, Lei; Henly, George A. – Applied Measurement in Education, 2012

Many innovative item formats have been proposed over the past decade, but little empirical research has been conducted on their measurement properties. This study examines the reliability, efficiency, and construct validity of two innovative item formats--the figural response (FR) and constructed response (CR) formats used in a K-12 computerized…

Descriptors: Test Items, Test Format, Computer Assisted Testing, Measurement

Examining the Effectiveness of Test Accommodation Using DIF and a Mixture IRT Model

Peer reviewed

Direct link

Cho, Hyun-Jeong; Lee, Jaehoon; Kingston, Neal – Applied Measurement in Education, 2012

This study examined the validity of test accommodation in third-eighth graders using differential item functioning (DIF) and mixture IRT models. Two data sets were used for these analyses. With the first data set (N = 51,591) we examined whether item type (i.e., story, explanation, straightforward) or item features were associated with item…

Descriptors: Testing Accommodations, Test Bias, Item Response Theory, Validity

Item-Level Comparative Analysis of Online and Paper Administrations of the Texas Assessment of Knowledge and Skills

Peer reviewed

Direct link

Keng, Leslie; McClarty, Katie Larsen; Davis, Laurie Laughlin – Applied Measurement in Education, 2008

This article describes a comparative study conducted at the item level for paper and online administrations of a statewide high stakes assessment. The goal was to identify characteristics of items that may have contributed to mode effects. Item-level analyses compared two modes of the Texas Assessment of Knowledge and Skills (TAKS) for up to four…

Descriptors: Computer Assisted Testing, Geometric Concepts, Grade 8, Comparative Analysis

The Effects of Test Difficulty Manipulation in Computerized Adaptive Testing and Self-Adapted Testing.

Peer reviewed

Ponsoda, Vicente; Olea, Julio; Rodriguez, Maria Soledad; Revuelta, Javier – Applied Measurement in Education, 1999

Compared easy and difficult versions of self-adapted tests (SAT) and computerized adapted tests. No significant differences were found among the tests for estimated ability or posttest state anxiety in studies with 187 Spanish high school students, although other significant differences were found. Discusses implications for interpreting test…

Descriptors: Ability, Adaptive Testing, Comparative Analysis, Computer Assisted Testing

Testing for Differences in Test Score Distributions Using Loglinear Models.

Peer reviewed

Hanson, Bradley A. – Applied Measurement in Education, 1996

Determining whether score distributions differ on two or more test forms administered to samples of examinees from a single population is explored using three statistical tests using loglinear models. Examples are presented of applying tests of distribution differences to decide if equating is needed for alternative forms of a test. (SLD)

Descriptors: Equated Scores, Scoring, Statistical Distributions, Test Format

The Effectiveness of Several Multiple-Choice Formats.

Peer reviewed

Haladyna, Thomas A. – Applied Measurement in Education, 1992

Several multiple-choice item formats are examined in the current climate of test reform. The reform movement is discussed as it affects use of the following formats: (1) complex multiple-choice; (2) alternate choice; (3) true-false; (4) multiple true-false; and (5) the context dependent item set. (SLD)

Descriptors: Cognitive Psychology, Comparative Testing, Context Effect, Educational Change

High-Stakes Testing Accommodations: Validity versus Disabled Rights.

Peer reviewed

Phillips, S. E. – Applied Measurement in Education, 1994

This article explores the measurement problems associated with granting accommodations for mental disabilities, uses existing case law to construct a legal framework for considering such accommodations, and discusses the advantages and disadvantages of alternative strategies for handling testing accommodation requests. (Author/SLD)

Descriptors: Accessibility (for Disabled), Alternative Assessment, Court Litigation, Elementary Secondary Education

The Cognitive Equivalence of Reading Comprehension Test Items via Computerized and Paper-and-Pencil Administration.

Peer reviewed

Kobrin, Jennifer L.; Young, John W. – Applied Measurement in Education, 2003

Studied the cognitive equivalence of computerized and paper-and-pencil reading comprehension tests using verbal protocol analysis. Results for 48 college students indicate that the only significant difference between the computerized and paper-and-pencil tests was in the frequency of identifying important information in the passage. (SLD)

Descriptors: Cognitive Processes, College Students, Computer Assisted Testing, Difficulty Level

Effects of Scale Anchors on Student Ratings of Instructors.

Peer reviewed

Dunham, Trudy C.; Davison, Mark L. – Applied Measurement in Education, 1990

The effects of packing or skewing the response options of a scale on the common measurement problems of leniency and range restriction in instructor ratings were assessed. Results from a sample of 130 undergraduate education students indicate that packing reduced leniency but had no effect on range restriction. (TJH)

Descriptors: Education Majors, Higher Education, Professors, Rating Scales

Test Use among Classroom Teachers and Its Relationship to Teaching Level and Teaching Practices.

Peer reviewed

Hall, Bruce W.; And Others – Applied Measurement in Education, 1988

Responses of 310 teachers in Florida to a survey about use of teacher-made tests, nationally standardized tests, and state minimum competency tests were studied. Results show that all three test types were used to some extent in eight decision categories, but none of the tests were clearly dominant. (SLD)

Descriptors: Classroom Techniques, Decision Making, Elementary Secondary Education, Minimum Competency Testing

Test Format	15
Computer Assisted Testing	7
Test Items	7
Elementary Secondary Education	5
Item Response Theory	5
Testing Problems	4
Adaptive Testing	3
Difficulty Level	3
Grade 8	3
High Stakes Tests	3
Multiple Choice Tests	3
Test Construction	3
Comparative Analysis	2
Equated Scores	2
Foreign Countries	2
Higher Education	2
Mathematics Tests	2
Psychometrics	2
Scoring	2
Standardized Tests	2
State Programs	2
Test Reliability	2
Test Validity	2
Testing Accommodations	2
Ability	1
More ▼

Ben Backes	1
Boulais, André-Philippe	1
Cho, Hyun-Jeong	1
Colvin, Kimberly F.	1
Cook, Robert J.	1
Davis, Laurie Laughlin	1
Davison, Mark L.	1
De Champlain, André	1
Dunham, Trudy C.	1
Edwards, Michael C.	1
Flora, David B.	1
Gierl, Mark J.	1
Haladyna, Thomas A.	1
Hall, Bruce W.	1
Hanson, Bradley A.	1
Henly, George A.	1
James Cowan	1
Keller, Lisa A.	1
Keller, Robert	1
Keng, Leslie	1
Kingston, Neal	1
Kobrin, Jennifer L.	1
Lai, Hollis	1
Lee, Jaehoon	1
More ▼