ERIC - Search Results

Publication Date

In 2025	16
Since 2024	50
Since 2021 (last 5 years)	253
Since 2016 (last 10 years)	578
Since 2006 (last 20 years)	985

Descriptor

Difficulty Level	985
Test Items	985
Item Response Theory	379
Foreign Countries	364
Test Construction	216
Item Analysis	186
Test Reliability	169
Multiple Choice Tests	168
Mathematics Tests	161
Scores	157
Test Validity	155
Comparative Analysis	145
Statistical Analysis	125
Science Tests	117
Correlation	113
Test Format	113
Psychometrics	110
Models	109
Language Tests	100
Achievement Tests	97
Computer Assisted Testing	91
Elementary School Students	88
Test Bias	88
English (Second Language)	85
Undergraduate Students	82
More ▼

Publication Type

Journal Articles	821
Reports - Research	781
Reports - Evaluative	77
Reports - Descriptive	59
Dissertations/Theses -…	52
Tests/Questionnaires	48
Speeches/Meeting Papers	33
Numerical/Quantitative Data	27
Information Analyses	9
Opinion Papers	5
Non-Print Media	4
Reference Materials - General	4
Collected Works - Proceedings	2
ERIC Digests in Full Text	1
Guides - Non-Classroom	1
Reports - General	1
More ▼

Education Level

Higher Education	236
Secondary Education	205
Postsecondary Education	203
Elementary Education	178
Middle Schools	108
Junior High Schools	80
High Schools	78
Grade 8	61
Elementary Secondary Education	60
Intermediate Grades	58
Grade 4	48
Grade 7	44
Early Childhood Education	41
Primary Education	41
Grade 6	36
Grade 5	35
Grade 3	34
Grade 2	17
Grade 12	16
Grade 1	15
Grade 9	14
Kindergarten	13
Grade 10	11
Grade 11	7
Adult Education	3
More ▼

Audience

Teachers	6
Policymakers	4
Administrators	1
Practitioners	1
Researchers	1
Students	1

Location

Turkey	45
Indonesia	30
Germany	29
Australia	21
Canada	16
South Africa	14
Florida	13
United States	13
Nigeria	12
China	11
Iran	11
United Kingdom	11
California	10
United Kingdom (England)	10
Malaysia	9
Taiwan	9
Japan	8
New York	8
Belgium	7
South Korea	7
Indiana	6
Ohio	6
Colorado	5
Jordan	5
Massachusetts	5
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Head Start	1

What Works Clearinghouse Rating

Showing 1 to 15 of 985 results Save | Export

Scoring Running Records: Complexities and Affordances

Peer reviewed

Direct link

Rodgers, Emily; D'Agostino, Jerome V.; Berenbon, Rebecca; Johnson, Tracy; Winkler, Christa – Journal of Early Childhood Literacy, 2023

Running Records are thought to be an excellent formative assessment tool because they generate results that educators can use to make their teaching more responsive. Despite the technical nature of scoring Running Records and the kinds of important decisions that are attached to their analysis, few studies have investigated assessor accuracy. We…

Descriptors: Formative Evaluation, Scoring, Accuracy, Difficulty Level

Embedding Embedded Standard Setting: An Application of Cross-Classified Item Response Theory. CRESST Report 876

Download full text

Yun-Kyung Kim; Li Cai – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2025

This paper introduces an application of cross-classified item response theory (IRT) modeling to an assessment utilizing the embedded standard setting (ESS) method (Lewis & Cook). The cross-classified IRT model is used to treat both item and person effects as random, where the item effects are regressed on the target performance levels (target…

Descriptors: Standard Setting (Scoring), Item Response Theory, Test Items, Difficulty Level

Interaction of Social Deference and Cognitive Processing in the Prediction of Acquiescence

Peer reviewed

Direct link

Patrik Havan; Michal Kohút; Peter Halama – International Journal of Testing, 2025

Acquiescence is the tendency of participants to shift their responses to agreement. Lechner et al. (2019) introduced the following mechanisms of acquiescence: social deference and cognitive processing. We added their interaction into a theoretical framework. The sample consists of 557 participants. We found significant medium strong relationship…

Descriptors: Cognitive Processes, Attention, Difficulty Level, Reflection

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

A Chi-Square Statistic for Testing the Equality of Distracters' Plausibility in Multiple-Choice Test Items

Download full text

Sherwin E. Balbuena – Online Submission, 2024

This study introduces a new chi-square test statistic for testing the equality of response frequencies among distracters in multiple-choice tests. The formula uses the information from the number of correct answers and wrong answers, which becomes the basis of calculating the expected values of response frequencies per distracter. The method was…

Descriptors: Multiple Choice Tests, Statistics, Test Validity, Testing

Seeking the Real Reliability: Why the Traditional Estimators of Reliability Usually Fail in Achievement Testing and Why the Deflation-Corrected Coefficients Could Be Better Options

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023

Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…

Descriptors: Test Reliability, Achievement Tests, Computation, Test Items

Evaluating Methodological Enhancements to the Yes/No Angoff Standard-Setting Method in Language Proficiency Assessment

Peer reviewed

Direct link

Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024

This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…

Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods

Text-Based Question Difficulty Prediction: A Systematic Review of Automatic Approaches

Peer reviewed

Direct link

Samah AlKhuzaey; Floriana Grasso; Terry R. Payne; Valentina Tamma – International Journal of Artificial Intelligence in Education, 2024

Designing and constructing pedagogical tests that contain items (i.e. questions) which measure various types of skills for different levels of students equitably is a challenging task. Teachers and item writers alike need to ensure that the quality of assessment materials is consistent, if student evaluations are to be objective and effective.…

Descriptors: Test Items, Test Construction, Difficulty Level, Prediction

The Impact of Insufficient Effort Responses on the Order of Category Thresholds in the Polytomous Rasch Model

Peer reviewed

Direct link

Kuan-Yu Jin; Thomas Eckes – Educational and Psychological Measurement, 2024

Insufficient effort responding (IER) refers to a lack of effort when answering survey or questionnaire items. Such items typically offer more than two ordered response categories, with Likert-type scales as the most prominent example. The underlying assumption is that the successive categories reflect increasing levels of the latent variable…

Descriptors: Item Response Theory, Test Items, Test Wiseness, Surveys

Feature versus Object in Interpreting Working Memory Capacity

Peer reviewed

Direct link

Wuji Lin; Chenxi Lv; Jiejie Liao; Yuan Hu; Yutong Liu; Jingyuan Lin – npj Science of Learning, 2024

The debate about whether the capacity of working memory (WM) varies with the complexity of memory items continues. This study employed novel experimental materials to investigate the role of complexity in WM capacity. Across seven experiments, we explored the relationship between complexity and WM capacity. The results indicated that the…

Descriptors: Short Term Memory, Difficulty Level, Retention (Psychology), Test Items

How Hard Can This Question Be? An Exploratory Analysis of Features Assessing Question Difficulty Using LLMs

Peer reviewed

Andreea Dutulescu; Stefan Ruseti; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2024

Assessing the difficulty of reading comprehension questions is crucial to educational methodologies and language understanding technologies. Traditional methods of assessing question difficulty rely frequently on human judgments or shallow metrics, often failing to accurately capture the intricate cognitive demands of answering a question. This…

Descriptors: Difficulty Level, Reading Tests, Test Items, Reading Comprehension

Do Subject Matter Experts' Judgments of Multiple-Choice Format Suitability Predict Item Quality?

Peer reviewed

Direct link

Berenbon, Rebecca F.; McHugh, Bridget C. – Educational Measurement: Issues and Practice, 2023

To assemble a high-quality test, psychometricians rely on subject matter experts (SMEs) to write high-quality items. However, SMEs are not typically given the opportunity to provide input on which content standards are most suitable for multiple-choice questions (MCQs). In the present study, we explored the relationship between perceived MCQ…

Descriptors: Test Items, Multiple Choice Tests, Standards, Difficulty Level

Item Parameter Recovery via Traditional 2PL, Testlet and Bi-Factor Models for Testlet-Based Tests

Peer reviewed
PDF on ERIC

Download full text

Soysal, Sumeyra; Yilmaz Kogar, Esin – International Journal of Assessment Tools in Education, 2022

The testlet comprises a set of items based on a common stimulus. When the testlet is used in the tests, there may violate the local independence assumption, and in this case, it would not be appropriate to use traditional item response theory models in the tests in which the testlet is included. When the testlet is discussed, one of the most…

Descriptors: Test Items, Test Theory, Models, Sample Size

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Why Do Regular and Reversed Items Load on Separate Factors? Response Difficulty vs. Item Extremity

Peer reviewed

Direct link

Kam, Chester Chun Seng – Educational and Psychological Measurement, 2023

When constructing measurement scales, regular and reversed items are often used (e.g., "I am satisfied with my job"/"I am not satisfied with my job"). Some methodologists recommend excluding reversed items because they are more difficult to understand and therefore engender a second, artificial factor distinct from the…

Descriptors: Test Items, Difficulty Level, Test Construction, Construct Validity

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 66

ProQuest LLC	51
Educational and Psychological…	48
ETS Research Report Series	34
Applied Measurement in…	32
Online Submission	27
Journal of Educational…	24
International Journal of…	21
Behavioral Research and…	20
Educational Assessment	17
Grantee Submission	17
International Journal of…	16
Practical Assessment,…	16
Applied Psychological…	14
Language Assessment Quarterly	14
Language Testing	13
International Journal of…	10
International Journal of…	10
Journal of Chemical Education	9
Assessment & Evaluation in…	8
CBE - Life Sciences Education	8
Educational Measurement:…	8
Journal of Psychoeducational…	8
SAGE Open	8
Assessment in Education:…	7
Eurasian Journal of…	7
More ▼

Tindal, Gerald	21
Alonzo, Julie	16
Anderson, Daniel	9
Herrmann-Abell, Cari F.	8
Park, Bitnara Jasmine	8
Sinharay, Sandip	8
Bulut, Okan	7
DeBoer, George E.	7
Guo, Hongwen	7
Irvin, P. Shawn	7
Liu, Kimy	7
Paek, Insu	7
Schoen, Robert C.	7
Ketterlin-Geller, Leanne R.	6
Saven, Jessica L.	6
Baghaei, Purya	5
DeMars, Christine E.	5
Liu, Ou Lydia	5
Long, Caroline	5
Retnawati, Heri	5
Wilson, Mark	5
Wyse, Adam E.	5
Yang, Xiaotong	5
Crisp, Victoria	4
Embretson, Susan E.	4
More ▼

Program for International…	27
Trends in International…	20
SAT (College Admission Test)	17
National Assessment of…	13
Test of English as a Foreign…	11
Graduate Record Examinations	7
Peabody Picture Vocabulary…	6
Raven Progressive Matrices	5
Advanced Placement…	4
Measures of Academic Progress	4
Progress in International…	4
Raven Advanced Progressive…	4
Remote Associates Test	4
Flesch Kincaid Grade Level…	3
International English…	3
Test of English for…	3
Big Five Inventory	2
Dynamic Indicators of Basic…	2
Iowa Tests of Basic Skills	2
Law School Admission Test	2
Stanford Achievement Tests	2
Wechsler Adult Intelligence…	2
Wechsler Individual…	2
Wechsler Intelligence Scale…	2
Wide Range Achievement Test	2
More ▼