ERIC - Search Results

Publication Date

In 2025	0
Since 2024	9
Since 2021 (last 5 years)	17
Since 2016 (last 10 years)	45
Since 2006 (last 20 years)	92

Descriptor

Computer Assisted Testing	69
Test Items	56
Item Response Theory	37
Testing Programs	37
Scores	34
Test Construction	32
Comparative Analysis	31
Adaptive Testing	30
Mathematics Tests	26
State Programs	25
Testing Problems	25
Testing Accommodations	24
Testing	23
Psychometrics	22
Achievement Tests	20
Elementary Secondary Education	20
Standardized Tests	20
Difficulty Level	19
Educational Assessment	17
High School Students	17
Multiple Choice Tests	17
Scoring	16
Evaluation Methods	15
Higher Education	15
Reading Tests	15
More ▼

Source

Applied Measurement in…

194

Publication Type

Journal Articles	194
Reports - Research	118
Reports - Evaluative	62
Reports - Descriptive	13
Information Analyses	11
Speeches/Meeting Papers	8
Tests/Questionnaires	3
Collected Works - General	2
Opinion Papers	2
Historical Materials	1
Legal/Legislative/Regulatory…	1
More ▼

Education Level

Higher Education	19
Secondary Education	17
Postsecondary Education	15
Elementary Secondary Education	13
Elementary Education	12
Middle Schools	11
Grade 8	10
Junior High Schools	9
Grade 3	8
High Schools	8
Grade 5	7
Grade 4	6
Grade 7	6
Grade 10	4
Grade 6	4
Early Childhood Education	3
Grade 11	3
Intermediate Grades	3
Primary Education	3
Grade 12	2
Grade 2	2
Grade 9	1
Two Year Colleges	1
More ▼

Audience

Practitioners	1
Teachers	1

Location

Georgia	4
Canada	3
South Carolina	3
Texas	3
Virginia	3
Israel	2
Australia	1
Colorado	1
Florida	1
Japan	1
Kansas	1
Maryland	1
Massachusetts	1
North Carolina	1
Ohio	1
Spain	1
Turkey	1
United States	1
Vermont	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Showing 1 to 15 of 194 results Save | Export

Identifying Careless Responses in Computer-Adaptive Affective Surveys Using Person Fit Analysis

Peer reviewed

Direct link

Stefanie A. Wind; Beyza Aksu-Dunya – Applied Measurement in Education, 2024

Careless responding is a pervasive concern in research using affective surveys. Although researchers have considered various methods for identifying careless responses, studies are limited that consider the utility of these methods in the context of computer adaptive testing (CAT) for affective scales. Using a simulation study informed by recent…

Descriptors: Response Style (Tests), Computer Assisted Testing, Adaptive Testing, Affective Measures

Reviewing the Test Reviews: Quality Judgments and Reviewer Agreements in the Mental Measurements Yearbook

Peer reviewed

Direct link

Hogan, Thomas; DeStefano, Marissa; Gilby, Caitlin; Kosman, Dana; Peri, Joshua – Applied Measurement in Education, 2021

Buros' "Mental Measurements Yearbook (MMY)" has provided professional reviews of commercially published psychological and educational tests for over 80 years. It serves as a kind of conscience for the testing industry. For a random sample of 50 entries in the "19th MMY" (a total of 100 separate reviews) this study determined…

Descriptors: Test Reviews, Interrater Reliability, Psychological Testing, Educational Testing

Comparing Examinee-Based and Response-Based Motivation Filtering Methods in Remote Low-Stakes Testing

Peer reviewed

Direct link

Sarah Alahmadi; Christine E. DeMars – Applied Measurement in Education, 2024

Large-scale educational assessments are sometimes considered low-stakes, increasing the possibility of confounding true performance level with low motivation. These concerns are amplified in remote testing conditions. To remove the effects of low effort levels in responses observed in remote low-stakes testing, several motivation filtering methods…

Descriptors: Multiple Choice Tests, Item Response Theory, College Students, Scores

Are Online and Paper Tests Comparable? Evidence from Statewide K-12 Tests

Peer reviewed

Direct link

Ben Backes; James Cowan – Applied Measurement in Education, 2024

We investigate two research questions using a recent statewide transition from paper to computer-based testing: first, the extent to which test mode effects found in prior studies can be eliminated; and second, the degree to which online and paper assessments offer different information about underlying student ability. We first find very small…

Descriptors: Computer Assisted Testing, Test Format, Differences, Academic Achievement

Bayesian Logistic Regression: A New Method to Calibrate Pretest Items in Multistage Adaptive Testing

Peer reviewed

Direct link

TsungHan Ho – Applied Measurement in Education, 2023

An operational multistage adaptive test (MST) requires the development of a large item bank and the effort to continuously replenish the item bank due to concerns about test security and validity over the long term. New items should be pretested and linked to the item bank before being used operationally. The linking item volume fluctuations in…

Descriptors: Bayesian Statistics, Regression (Statistics), Test Items, Pretesting

Don't Test after Lunch: The Relationship between Disengagement and the Time of Day That Low-Stakes Testing Occurs

Peer reviewed

Direct link

Steven L. Wise; Megan R. Kuhfeld; Marlit Annalena Lindner – Applied Measurement in Education, 2024

When student achievement is assessed, we seek to elicit a student's maximum performance -- a goal requiring the assumption that the student is fully engaged. Otherwise, to the extent that disengagement occurs, test performance is likely to suffer. Effectively managing test-taking disengagement requires an understanding of the testing conditions…

Descriptors: Testing, Attention Span, Learner Engagement, Time Factors (Learning)

Between- versus Within-Examinee Variability in Test-Taking Effort and Test Emotions during a Low-Stakes Test

Peer reviewed

Direct link

Perkins, Beth A.; Pastor, Dena A.; Finney, Sara J. – Applied Measurement in Education, 2021

When tests are low stakes for examinees, meaning there are little to no personal consequences associated with test results, some examinees put little effort into their performance. To understand the causes and consequences of diminished effort, researchers correlate test-taking effort with other variables, such as test-taking emotions and test…

Descriptors: Response Style (Tests), Psychological Patterns, Testing, Differences

A Census-Level, Multi-Grade Analysis of the Association between Testing Time, Breaks, and Achievement

Peer reviewed

Direct link

Rutkowski, David; Rutkowski, Leslie; Valdivia, Dubravka Svetina; Canbolat, Yusuf; Underhill, Stephanie – Applied Measurement in Education, 2023

Several states in the US have removed time limits on their state assessments. In Indiana, where this study takes place, the state assessment is both untimed during the testing window and allows unlimited breaks during the testing session. Using grade 3 and 8 math and English state assessment data, in this paper we focus on time used for testing…

Descriptors: Testing, Time, Intervals, Academic Achievement

On the Reliable Identification and Effectiveness of Computer-Based, Pop-Up Glossaries in Large-Scale Assessments

Peer reviewed

Direct link

Cohen, Dale J.; Ballman, Alesha; Rijmen, Frank; Cohen, Jon – Applied Measurement in Education, 2020

Computer-based, pop-up glossaries are perhaps the most promising accommodation aimed at mitigating the influence of linguistic structure and cultural bias on the performance of English Learner (EL) students on statewide assessments. To date, there is no established procedure for identifying the words that require a glossary for EL students that is…

Descriptors: Glossaries, Testing Accommodations, English Language Learners, Computer Assisted Testing

Gauging Q-Matrix Design and Model Selection in Applied Cognitive Diagnosis

Peer reviewed

Direct link

Youn Seon Lim – Applied Measurement in Education, 2024

Educational testing has been criticized for its disconnect from modern cognitive science and its limited role in improving instruction and student learning. Reform efforts emphasize the need for testing to provide specific diagnostic insights into students' skills and knowledge. Cognitive diagnosis (CD), an emerging paradigm in educational…

Descriptors: Q Methodology, Matrices, Models, Design

When Should Individual Ability Estimates Be Reported if Rapid Guessing Is Present?

Peer reviewed

Direct link

Rios, Joseph A. – Applied Measurement in Education, 2022

Testing programs are confronted with the decision of whether to report individual scores for examinees that have engaged in rapid guessing (RG). As noted by the "Standards for Educational and Psychological Testing," this decision should be based on a documented criterion that determines score exclusion. To this end, a number of heuristic…

Descriptors: Testing, Guessing (Tests), Academic Ability, Scores

Are Multiple-Choice Items Too Fat?

Peer reviewed

Direct link

Haladyna, Thomas M.; Rodriguez, Michael C.; Stevens, Craig – Applied Measurement in Education, 2019

The evidence is mounting regarding the guidance to employ more three-option multiple-choice items. From theoretical analyses, empirical results, and practical considerations, such items are of equal or higher quality than four- or five-option items, and more items can be administered to improve content coverage. This study looks at 58 tests,…

Descriptors: Multiple Choice Tests, Test Items, Testing Problems, Guessing (Tests)

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Can Adaptive Testing Improve Test-Taking Experience? A Case Study on Educational Survey Assessment

Peer reviewed

Direct link

Yi-Hsuan Lee; Yue Jia – Applied Measurement in Education, 2024

Test-taking experience is a consequence of the interaction between students and assessment properties. We define a new notion, rapid-pacing behavior, to reflect two types of test-taking experience -- disengagement and speededness. To identify rapid-pacing behavior, we extend existing methods to develop response-time thresholds for individual items…

Descriptors: Adaptive Testing, Reaction Time, Item Response Theory, Test Format

Bayesian Estimation and Testing of a Linear Logistic Test Model for Learning during the Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Applied Measurement in Education, 2021

The present study proposes a Bayesian approach for estimating and testing the operation-specific learning model, a variant of the linear logistic test model that allows for the measurement of the learning that occurs during a test as a result of the repeated use of the operations involved in the items. The advantages of using a Bayesian framework…

Descriptors: Bayesian Statistics, Computation, Learning, Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13

Wise, Steven L.	8
Hambleton, Ronald K.	5
Finney, Sara J.	4
Pomplun, Mark	4
Davis, Laurie Laughlin	3
Engelhard, George, Jr.	3
Haberman, Shelby	3
Huynh, Huynh	3
Kong, Xiaojing	3
Abedi, Jamal	2
Bennett, Randy Elliot	2
Buckendahl, Chad W.	2
Buzick, Heather	2
Chen, Wen-Hung	2
Clauser, Brian E.	2
Clyman, Stephen G.	2
Cohen, Dale J.	2
Cohen, Jon	2
Dodd, Barbara G.	2
Drasgow, Fritz	2
Feldt, Leonard S.	2
Ferrara, Steve	2
Forsyth, Robert A.	2
Frary, Robert B.	2
More ▼

National Assessment of…	5
SAT (College Admission Test)	5
Texas Assessment of Academic…	5
Program for International…	3
ACT Assessment	1
Armed Services Vocational…	1
California Achievement Tests	1
Georgia Criterion Referenced…	1
Graduate Record Examinations	1
Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
Major Field Achievement Test…	1
Massachusetts Comprehensive…	1
National Teacher Examinations	1
Praxis Series	1
Progress in International…	1
Test Anxiety Inventory	1
Trends in International…	1
Wechsler Intelligence Scale…	1
Woodcock Johnson Tests of…	1
More ▼