ERIC - Search Results

Publication Date

In 2025	5
Since 2024	19
Since 2021 (last 5 years)	63
Since 2016 (last 10 years)	158
Since 2006 (last 20 years)	422

Descriptor

Test Items	792
Test Construction	332
Item Response Theory	125
Computer Assisted Testing	118
Foreign Countries	113
Student Evaluation	105
Test Validity	104
Evaluation Methods	102
Item Analysis	92
Test Format	86
Models	84
Difficulty Level	82
Higher Education	82
Elementary Secondary Education	77
Mathematics Tests	68
Multiple Choice Tests	67
Scoring	66
Achievement Tests	65
Psychometrics	64
Testing	63
Item Banks	60
Test Reliability	60
Educational Assessment	59
Scores	57
Science Tests	54
More ▼

Publication Type

Reports - Descriptive	792
Journal Articles	511
Speeches/Meeting Papers	72
Tests/Questionnaires	45
Guides - Non-Classroom	36
Numerical/Quantitative Data	19
Guides - Classroom - Teacher	18
Opinion Papers	18
Books	8
Collected Works - General	7
Reports - Research	7
Information Analyses	4
Reports - Evaluative	4
Book/Product Reviews	3
Computer Programs	3
Historical Materials	3
Guides - General	2
Collected Works - Serial	1
Collected Works - Serials	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	77
Elementary Secondary Education	65
Secondary Education	56
Postsecondary Education	51
Elementary Education	50
Grade 8	33
Middle Schools	29
Grade 4	28
High Schools	25
Junior High Schools	21
Intermediate Grades	17
Grade 6	13
Grade 12	12
Early Childhood Education	11
Grade 5	11
Grade 7	11
Grade 3	9
Primary Education	8
Kindergarten	6
Grade 9	5
Adult Education	4
Grade 10	4
Grade 11	4
Grade 2	3
Grade 1	2
More ▼

Audience

Practitioners	59
Teachers	55
Administrators	23
Researchers	20
Policymakers	6
Community	3
Students	3
Counselors	2
Parents	2

Location

Australia	19
Canada	11
Florida	6
Hong Kong	5
Massachusetts	5
United Kingdom (Great Britain)	5
China	4
Oregon	4
United Kingdom	4
Asia	3
India	3
Japan	3
Malaysia	3
Maryland	3
Missouri	3
Tennessee	3
United States	3
Virginia	3
Alabama	2
Albania	2
Arizona	2
California	2
Europe	2
Germany	2
Idaho	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	9
Comprehensive Education…	2
Race to the Top	2
Americans with Disabilities…	1
Civil Rights Act 1964 Title…	1
Elementary and Secondary…	1
Higher Education Act…	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Jeanne Clery Disclosure of…	1
National Defense Education Act	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 792 results Save | Export

A Workflow for Minimizing Errors in Template-Based Automated Item-Generation Development

Peer reviewed

Direct link

Yanyan Fu – Educational Measurement: Issues and Practice, 2024

The template-based automated item-generation (TAIG) approach that involves template creation, item generation, item selection, field-testing, and evaluation has more steps than the traditional item development method. Consequentially, there is more margin for error in this process, and any template errors can be cascaded to the generated items.…

Descriptors: Error Correction, Automation, Test Items, Test Construction

Using Content Relevance and Representativeness Indices in Instrument Revision

Peer reviewed

Direct link

Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024

Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…

Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

The Multidimensionality of Measurement Bias in High-Stakes Testing: Using Machine Learning to Evaluate Complex Sources of Differential Item Functioning

Peer reviewed

Direct link

Belzak, William C. M. – Educational Measurement: Issues and Practice, 2023

Test developers and psychometricians have historically examined measurement bias and differential item functioning (DIF) across a single categorical variable (e.g., gender), independently of other variables (e.g., race, age, etc.). This is problematic when more complex forms of measurement bias may adversely affect test responses and, ultimately,…

Descriptors: Test Bias, High Stakes Tests, Artificial Intelligence, Test Items

An Explanatory Multidimensional Random Item Effects Rating Scale Model

Peer reviewed

Direct link

Huang, Sijia; Luo, Jinwen; Cai, Li – Educational and Psychological Measurement, 2023

Random item effects item response theory (IRT) models, which treat both person and item effects as random, have received much attention for more than a decade. The random item effects approach has several advantages in many practical settings. The present study introduced an explanatory multidimensional random item effects rating scale model. The…

Descriptors: Rating Scales, Item Response Theory, Models, Test Items

A Generalized Objective Function for Computer Adaptive Item Selection

Peer reviewed

Direct link

Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025

Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Creating Short Forms of Early Childhood Development Measures: A Framework for Quantifying Statistical, Conceptual, and Practical Tradeoffs in Direct Assessment. EdWorkingPaper No. 25-1143

Download full text

Jonathan Seiden – Annenberg Institute for School Reform at Brown University, 2025

Direct assessments of early childhood development (ECD) are a cornerstone of research in developmental psychology and are increasingly used to evaluate programs and policies in lower- and middle-income countries. Despite strong psychometric properties, these assessments are too expensive and time consuming for use in large-scale monitoring or…

Descriptors: Young Children, Child Development, Performance Based Assessment, Developmental Psychology

On the Positive Correlation between DIF and Difficulty: A New Theory on the Correlation as Methodological Artifact

Peer reviewed

Direct link

Bolt, Daniel M.; Liao, Xiangyi – Journal of Educational Measurement, 2021

We revisit the empirically observed positive correlation between DIF and difficulty studied by Freedle and commonly seen in tests of verbal proficiency when comparing populations of different mean latent proficiency levels. It is shown that a positive correlation between DIF and difficulty estimates is actually an expected result (absent any true…

Descriptors: Test Bias, Difficulty Level, Correlation, Verbal Tests

Improving Mathematics Diagnostic Tests Using Item Analysis

Peer reviewed

Direct link

Meike Akveld; George Kinnear – International Journal of Mathematical Education in Science and Technology, 2024

Many universities use diagnostic tests to assess incoming students' preparedness for mathematics courses. Diagnostic test results can help students to identify topics where they need more practice and give lecturers a summary of strengths and weaknesses in their class. We demonstrate a process that can be used to make improvements to a mathematics…

Descriptors: Mathematics Tests, Diagnostic Tests, Test Items, Item Analysis

Essentials of Visual Diagnosis of Test Items. Logical, Illogical, and Anomalous Patterns in Tests Items to Be Detected

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

This article discusses visual techniques for detecting test items that would be optimal to be selected to the final compilation on the one hand and, on the other hand, to out-select those items that would lower the quality of the compilation. Some classic visual tools are discussed, first, in a practical manner in diagnosing the logical,…

Descriptors: Test Items, Item Analysis, Item Response Theory, Cutting Scores

Reverse Engineering a Multiple-Choice Test Blueprint to Improve Course Alignment

Peer reviewed
PDF on ERIC

Download full text

Maristela Petrovic-Dzerdz – Collected Essays on Learning and Teaching, 2024

Large introductory classes, with their expansive curriculum, demand assessment strategies that blend efficiency with reliability, prompting the consideration of multiple-choice (MC) tests as a viable option. Crafting a high-quality MC test, however, necessitates a meticulous process involving reflection on assessment format appropriateness, test…

Descriptors: Multiple Choice Tests, Test Construction, Test Items, Alignment (Education)

Response Process Validity Evidence in Chemistry Education Research

Peer reviewed

Direct link

Deng, Jacky M.; Streja, Nicholas; Flynn, Alison B. – Journal of Chemical Education, 2021

Response process validity evidence can provide researchers with insight into how and why participants interpret items on instruments such as tests and questionnaires. In chemistry education research literature and the social sciences more broadly, response process validity evidence has been used and reported in a variety of ways. This paper's…

Descriptors: Chemistry, Science Education, Educational Research, Validity

Collaboration with Researchers with Intellectual/Developmental Disabilities: An Illustration of Inclusive Research Attributes across Two Projects

Peer reviewed

Direct link

Jessica M. Kramer; Evan E. Dean; Micah Peace Urquilla; Joan B. Beasley; Brad Linnenkamp – Inclusion, 2024

Researchers have implemented inclusive research for over 30 years. This article describes how two research projects collaborated with researchers with disabilities and aligns the description with four attributes of inclusive research developed by a consensus of international experts with and without disabilities. The first project, the Person…

Descriptors: Researchers, Cooperation, Intellectual Disability, Developmental Disabilities

Finding the Right Grain-Size for Measurement in the Classroom

Peer reviewed

Direct link

Mark Wilson – Journal of Educational and Behavioral Statistics, 2024

This article introduces a new framework for articulating how educational assessments can be related to teacher uses in the classroom. It articulates three levels of assessment: macro (use of standardized tests), meso (externally developed items), and micro (on-the-fly in the classroom). The first level is the usual context for educational…

Descriptors: Educational Assessment, Measurement, Standardized Tests, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 53

Educational Measurement:…	28
Journal of Educational…	27
Journal of Educational and…	23
Educational and Psychological…	18
Journal of Chemical Education	18
Applied Psychological…	17
Psychometrika	17
International Association for…	10
International Journal of…	10
National Assessment Governing…	10
Practical Assessment,…	10
Physics Teacher	7
Behavioral Research and…	6
Journal of Applied Testing…	6
Measurement:…	6
Online Submission	6
Teaching of Psychology	6
Education Week	5
Educational Assessment	5
Grantee Submission	5
International Journal of…	5
Achieve, Inc.	4
Australian Mathematics Teacher	4
Education Statistics Quarterly	4
Educational Technology	4
More ▼

van der Linden, Wim J.	13
Stansfield, Charles W.	7
Tindal, Gerald	5
Gierl, Mark J.	4
Raykov, Tenko	4
Sinharay, Sandip	4
Veldkamp, Bernard P.	4
Wainer, Howard	4
Abedi, Jamal	3
Alonzo, Julie	3
Camilli, Gregory	3
De Boeck, Paul	3
DeMars, Christine E.	3
Dimitrov, Dimiter M.	3
Embretson, Susan E.	3
Foy, Pierre, Ed.	3
Gewertz, Catherine	3
Hambleton, Ronald K.	3
Johnson, Carol	3
Ketterlin-Geller, Leanne R.	3
Kubinger, Klaus D.	3
Lazarus, Sheryl S.	3
Liu, Kimy	3
Liu, Kristin K.	3
More ▼

National Assessment of…	30
Program for International…	14
Trends in International…	9
SAT (College Admission Test)	8
Test of English as a Foreign…	4
Progress in International…	3
Stanford Achievement Tests	3
Advanced Placement…	2
Armed Services Vocational…	2
Iowa Tests of Basic Skills	2
Massachusetts Comprehensive…	2
National Teacher Examinations	2
New York State Regents…	2
Test of English for…	2
ACT Assessment	1
Alberta Grade Twelve Diploma…	1
California Achievement Tests	1
California Critical Thinking…	1
College Board Achievement…	1
Comprehensive Tests of Basic…	1
Cornell Critical Thinking Test	1
Early Childhood Longitudinal…	1
Eysenck Personality Inventory	1
Fast Response Survey System	1
Florida Comprehensive…	1
More ▼