ERIC - Search Results

Publication Date

In 2026	0
Since 2025	8
Since 2022 (last 5 years)	36
Since 2017 (last 10 years)	115
Since 2007 (last 20 years)	378

Descriptor

Test Theory	1166
Test Items	262
Test Reliability	252
Test Construction	246
Test Validity	245
Psychometrics	183
Scores	176
Item Response Theory	168
Foreign Countries	160
Item Analysis	141
Statistical Analysis	134
Higher Education	132
Mathematical Models	132
Measurement Techniques	123
Comparative Analysis	121
Correlation	114
Error of Measurement	114
Latent Trait Theory	112
Test Interpretation	112
Testing	111
Evaluation Methods	106
Models	98
Testing Problems	93
Elementary Secondary Education	90
Difficulty Level	85
More ▼

Education Level

Higher Education	96
Postsecondary Education	66
Secondary Education	50
Elementary Education	40
Elementary Secondary Education	29
Middle Schools	27
High Schools	24
Junior High Schools	22
Grade 8	18
Grade 7	14
Grade 4	13
Grade 6	11
Adult Education	10
Early Childhood Education	10
Grade 5	10
Intermediate Grades	10
Grade 3	9
Primary Education	6
Grade 2	4
Preschool Education	4
Grade 10	3
Grade 9	3
Kindergarten	3
Grade 1	2
Grade 12	2
More ▼

Audience

Researchers	81
Practitioners	42
Teachers	22
Students	6
Administrators	5
Policymakers	4
Counselors	2

Location

United States	17
United Kingdom (England)	15
Canada	14
Australia	13
Turkey	12
Sweden	8
United Kingdom	8
Netherlands	7
Texas	7
New York	6
Taiwan	6
United Kingdom (Great Britain)	6
Florida	5
Japan	5
Spain	5
Tennessee	5
United Kingdom (Wales)	5
California	4
Colorado	4
Israel	4
Chile	3
China	3
Germany	3
Illinois	3
Indonesia	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Elementary and Secondary…	3
Individuals with Disabilities…	3

What Works Clearinghouse Rating

Showing 16 to 30 of 1,166 results Save | Export

Identifying Word-Problem Difficulty: An Item Response Analysis of an Additive Word-Problem Screener

Peer reviewed

Direct link

Alison M. Hardy; J. E. Miller; Sarah R. Powell; Nancy Scammacca – Psychology in the Schools, 2025

Students encounter hundreds of word problems throughout the elementary grades and on standardized assessments through high school. To demonstrate proficiency on these measures of mathematics competency, students must be skilled in solving word problems. Early detection of word-problem difficulty is essential, and screeners play an important role…

Descriptors: Elementary School Students, Word Problems (Mathematics), Mathematics Education, Secondary School Students

Examining Rating Quality in Rater-Mediated Activities for Standard-Item Alignment Research

Direct link

Yvette Jackson – ProQuest LLC, 2023

Rater-mediated activities in educational research occur when an expert judge or rater utilizes an instrument to judge persons or items and generates scale scores. Scale scores are from a subjective judgment and must undergo a quality control measure called rating quality. Rating quality in this study is broadly defined as the extent to which…

Descriptors: Educational Research, Evaluators, Test Theory, Item Response Theory

A Comparison of the Efficacies of Differential Item Functioning Detection Methods

Peer reviewed
PDF on ERIC

Download full text

Basman, Munevver – International Journal of Assessment Tools in Education, 2023

To ensure the validity of the tests is to check that all items have similar results across different groups of individuals. However, differential item functioning (DIF) occurs when the results of individuals with equal ability levels from different groups differ from each other on the same test item. Based on Item Response Theory and Classic Test…

Descriptors: Test Bias, Test Items, Test Validity, Item Response Theory

Uncovering Student Conceptual Structure by a Multimethod Evaluation of the Energy and Momentum Conceptual Survey

Peer reviewed

Direct link

Xian Wu; Yaoguang Li; Sanjay N. Rebello – Physical Review Physics Education Research, 2025

The Energy and Momentum Conceptual Survey (EMCS) is an instrument designed to assess students' understanding of key physics concepts in introductory mechanics. In this study, we applied a combination of classical test theory (CTT), item response theory (IRT), and exploratory factor analysis (EFA) to a dataset of over 10 000 students collected…

Descriptors: Physics, Scientific Concepts, Concept Formation, Energy

Identifying Enemy Item Pairs Using Natural Language Processing

Peer reviewed

Direct link

Becker, Kirk A.; Kao, Shu-chuan – Journal of Applied Testing Technology, 2022

Natural Language Processing (NLP) offers methods for understanding and quantifying the similarity between written documents. Within the testing industry these methods have been used for automatic item generation, automated scoring of text and speech, modeling item characteristics, automatic question answering, machine translation, and automated…

Descriptors: Item Banks, Natural Language Processing, Computer Assisted Testing, Scoring

Examining the Impact of Violations of Local Item Independence Assumption on Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Mehmet Fatih Doguyurt; Seref Tan – International Journal of Assessment Tools in Education, 2025

This study investigates the impact of violating the local item independence assumption by loading certain items onto a second dimension on test equating errors in unidimensional and dichotomous tests. The research was designed as a simulation study, using data generated based on the PISA 2018 mathematics exam. Analyses were conducted under 36…

Descriptors: Equated Scores, Test Items, Mathematics Tests, International Assessment

Assessing the Fairness of Mathematical Literacy Test in Indonesia: Evidence from Gender-Based Differential Item Function Analysis

Peer reviewed
PDF on ERIC

Download full text

Kartianom Kartianom; Heri Retnawati; Kana Hidayati – Journal of Pedagogical Research, 2024

Conducting a fair test is important for educational research. Unfair assessments can lead to gender disparities in academic achievement, ultimately resulting in disparities in opportunities, wages, and career choice. Differential Item Function [DIF] analysis is presented to provide evidence of whether the test is truly fair, where it does not harm…

Descriptors: Foreign Countries, Test Bias, Item Response Theory, Test Theory

Literary Responses in Spanish Adolescents: Adaptation, Validation, and Analysis of the Literary Response Questionnaire

Peer reviewed

Direct link

Diana Muela-Bermejo; Irene Mendoza-Cercadillo; Lucía Hernández-Heras – Journal of Adolescent & Adult Literacy, 2024

This study involves translating, cross-culturally adapting, and validating the "Literary Response Questionnaire" (LRQ) for 413 Spanish adolescents. It explores the evolution of literary education in Spain and its alignment with the Reading Responses paradigm. The LRQ, adapted across various locations, is validated in Spanish through…

Descriptors: Reader Response, Adolescents, Questionnaires, Translation

The Tall Order of Teaching Measurement Reliability: Introducing Classical Test Theory through Observations of Human Height

Peer reviewed

Direct link

Richards, Adam S. – Communication Teacher, 2021

Course: Communication Research Methods. Objectives: This activity provides students with an experiential introduction to measurement theory and the methods for assessing measurement reliability. First, multiple measurements of a person's height are interpreted according to classical test theory. Second, the measurement of human height is used as…

Descriptors: Body Height, Measurement, Communication Research, Test Theory

Evidence for Validity and Reliability of a Research-Based Assessment Instrument on Measurement Uncertainty

Peer reviewed

Direct link

Gayle Geschwind; Michael Vignal; Marcos D. Caballero; H.? J. Lewandowski – Physical Review Physics Education Research, 2024

The Survey of Physics Reasoning on Uncertainty Concepts in Experiments (SPRUCE) was designed to measure students' proficiency with measurement uncertainty concepts and practices across ten different assessment objectives to help facilitate the improvement of laboratory instruction focused on this important topic. To ensure the reliability and…

Descriptors: Measurement, Ambiguity (Context), Scientific Concepts, Physics

Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items

Peer reviewed

Direct link

Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020

The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…

Descriptors: Test Bias, Interrater Reliability, Responses, Correlation

The Riddle Knowledge Inference Test (R-Kit)

Peer reviewed

Direct link

Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025

Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…

Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability

Programme Evaluation in Action: Theory to Practice from an Asian Educational Context

Peer reviewed

Direct link

Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024

Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…

Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria

Comparison of Classical Test Theory vs. Multi-Facet Rasch Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat; Turhan, Nihan S.; Toraman, Cetin – Pegem Journal of Education and Instruction, 2022

Testing English writing skills could be multi-dimensional; thus, the study aimed to compare students' writing scores calculated according to Classical Test Theory (CTT) and Multi-Facet Rasch Model (MFRM). The research was carried out in 2019 with 100 university students studying at a foreign language preparatory class and four experienced…

Descriptors: Comparative Analysis, Test Theory, Item Response Theory, Student Evaluation

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 78

Educational and Psychological…	63
Psychometrika	48
Journal of Educational…	35
Applied Psychological…	34
ProQuest LLC	26
Educational Measurement:…	23
Language Testing	15
Measurement:…	15
Journal of Educational…	13
Online Submission	13
Assessment in Education:…	12
International Journal of…	12
International Journal of…	11
Applied Measurement in…	10
Journal of Educational and…	10
Journal of Experimental…	8
Alberta Journal of…	7
ETS Research Report Series	7
Journal of School Psychology	7
Annual Review of Applied…	6
Educational Research and…	6
Intelligence	6
Physical Review Physics…	6
Practical Assessment,…	6
School Psychology Review	6
More ▼

Mislevy, Robert J.	20
Zimmerman, Donald W.	15
van der Linden, Wim J.	15
Sinharay, Sandip	9
Andrich, David	8
Haladyna, Tom	7
Wilcox, Rand R.	7
Williams, Richard H.	7
Yen, Wendy M.	7
Brennan, Robert L.	6
Dorans, Neil J.	6
Haberman, Shelby J.	6
Holland, Paul W.	6
Huynh, Huynh	6
Prather, Edward E.	6
Wainer, Howard	6
Baird, Jo-Anne	5
Cliff, Norman	5
Petscher, Yaacov	5
Roid, Gale	5
Thompson, Bruce	5
Tindal, Gerald	5
Zumbo, Bruno D.	5
Engelhard, George, Jr.	4
More ▼

Journal Articles	733
Reports - Research	619
Reports - Evaluative	215
Speeches/Meeting Papers	187
Reports - Descriptive	120
Opinion Papers	113
Information Analyses	67
Dissertations/Theses -…	26
Guides - Non-Classroom	26
Tests/Questionnaires	26
Numerical/Quantitative Data	22
Books	13
Book/Product Reviews	11
Reference Materials -…	8
Collected Works - General	7
Guides - Classroom - Teacher	7
Collected Works - Proceedings	6
ERIC Publications	6
Guides - Classroom - Learner	6
Reports - General	5
Collected Works - Serials	4
Historical Materials	4
Dissertations/Theses -…	2
ERIC Digests in Full Text	2
Guides - General	2
More ▼

SAT (College Admission Test)	23
National Assessment of…	11
Wechsler Intelligence Scale…	11
Armed Services Vocational…	10
ACT Assessment	9
Graduate Record Examinations	7
Comprehensive Tests of Basic…	6
Program for International…	6
Test of English as a Foreign…	6
Trends in International…	5
California Achievement Tests	4
Kaufman Assessment Battery…	4
Stanford Binet Intelligence…	4
Bayley Scales of Infant…	3
Law School Admission Test	3
Stanford Achievement Tests	3
Strengths and Difficulties…	3
ACTFL Oral Proficiency…	2
Advanced Placement…	2
Alabama High School…	2
Childrens Depression Inventory	2
Eysenck Personality Inventory	2
General Aptitude Test Battery	2
Graduate Management Admission…	2
Learning and Study Strategies…	2
More ▼