ERIC - Search Results

Publication Date

In 2025	20
Since 2024	48
Since 2021 (last 5 years)	225
Since 2016 (last 10 years)	503
Since 2006 (last 20 years)	826

Descriptor

Difficulty Level	1030
Test Items	1030
Foreign Countries	362
Item Response Theory	341
Test Construction	228
Item Analysis	201
Multiple Choice Tests	188
Test Reliability	180
Test Validity	158
Scores	149
Comparative Analysis	133
Mathematics Tests	123
Statistical Analysis	122
Test Format	114
Psychometrics	111
Models	107
Science Tests	101
Correlation	100
Computer Assisted Testing	97
Language Tests	96
Achievement Tests	92
Test Bias	86
English (Second Language)	84
Undergraduate Students	84
Elementary School Students	77
More ▼

Publication Type

Journal Articles	1030
Reports - Research	851
Reports - Evaluative	109
Reports - Descriptive	58
Tests/Questionnaires	39
Information Analyses	16
Opinion Papers	7
Speeches/Meeting Papers	6
Guides - Non-Classroom	3
Numerical/Quantitative Data	2
Reports - General	2
ERIC Digests in Full Text	1
More ▼

Education Level

Higher Education	225
Postsecondary Education	191
Secondary Education	175
Elementary Education	131
Middle Schools	81
High Schools	65
Junior High Schools	58
Intermediate Grades	42
Elementary Secondary Education	41
Grade 8	41
Grade 4	30
Grade 7	30
Grade 5	24
Grade 6	24
Primary Education	24
Early Childhood Education	22
Grade 3	19
Grade 9	12
Grade 12	11
Grade 1	10
Grade 10	10
Grade 2	9
Kindergarten	9
Grade 11	7
Adult Education	2
More ▼

Audience

Researchers	6
Practitioners	4
Teachers	4
Administrators	1
Students	1

Location

Turkey	44
Indonesia	29
Germany	27
Australia	18
Canada	16
United States	14
Nigeria	13
South Africa	13
United Kingdom	13
Iran	11
China	10
Malaysia	10
Taiwan	10
United Kingdom (England)	10
California	9
Florida	9
Japan	9
Belgium	8
South Korea	6
Israel	5
Jordan	5
Netherlands	5
New York	5
Singapore	5
Thailand	5
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Showing 1 to 15 of 1,030 results Save | Export

Scoring Running Records: Complexities and Affordances

Peer reviewed

Direct link

Rodgers, Emily; D'Agostino, Jerome V.; Berenbon, Rebecca; Johnson, Tracy; Winkler, Christa – Journal of Early Childhood Literacy, 2023

Running Records are thought to be an excellent formative assessment tool because they generate results that educators can use to make their teaching more responsive. Despite the technical nature of scoring Running Records and the kinds of important decisions that are attached to their analysis, few studies have investigated assessor accuracy. We…

Descriptors: Formative Evaluation, Scoring, Accuracy, Difficulty Level

Interaction of Social Deference and Cognitive Processing in the Prediction of Acquiescence

Peer reviewed

Direct link

Patrik Havan; Michal Kohút; Peter Halama – International Journal of Testing, 2025

Acquiescence is the tendency of participants to shift their responses to agreement. Lechner et al. (2019) introduced the following mechanisms of acquiescence: social deference and cognitive processing. We added their interaction into a theoretical framework. The sample consists of 557 participants. We found significant medium strong relationship…

Descriptors: Cognitive Processes, Attention, Difficulty Level, Reflection

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

Seeking the Real Reliability: Why the Traditional Estimators of Reliability Usually Fail in Achievement Testing and Why the Deflation-Corrected Coefficients Could Be Better Options

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023

Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…

Descriptors: Test Reliability, Achievement Tests, Computation, Test Items

Evaluating Methodological Enhancements to the Yes/No Angoff Standard-Setting Method in Language Proficiency Assessment

Peer reviewed

Direct link

Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024

This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…

Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods

Text-Based Question Difficulty Prediction: A Systematic Review of Automatic Approaches

Peer reviewed

Direct link

Samah AlKhuzaey; Floriana Grasso; Terry R. Payne; Valentina Tamma – International Journal of Artificial Intelligence in Education, 2024

Designing and constructing pedagogical tests that contain items (i.e. questions) which measure various types of skills for different levels of students equitably is a challenging task. Teachers and item writers alike need to ensure that the quality of assessment materials is consistent, if student evaluations are to be objective and effective.…

Descriptors: Test Items, Test Construction, Difficulty Level, Prediction

The Impact of Insufficient Effort Responses on the Order of Category Thresholds in the Polytomous Rasch Model

Peer reviewed

Direct link

Kuan-Yu Jin; Thomas Eckes – Educational and Psychological Measurement, 2024

Insufficient effort responding (IER) refers to a lack of effort when answering survey or questionnaire items. Such items typically offer more than two ordered response categories, with Likert-type scales as the most prominent example. The underlying assumption is that the successive categories reflect increasing levels of the latent variable…

Descriptors: Item Response Theory, Test Items, Test Wiseness, Surveys

Feature versus Object in Interpreting Working Memory Capacity

Peer reviewed

Direct link

Wuji Lin; Chenxi Lv; Jiejie Liao; Yuan Hu; Yutong Liu; Jingyuan Lin – npj Science of Learning, 2024

The debate about whether the capacity of working memory (WM) varies with the complexity of memory items continues. This study employed novel experimental materials to investigate the role of complexity in WM capacity. Across seven experiments, we explored the relationship between complexity and WM capacity. The results indicated that the…

Descriptors: Short Term Memory, Difficulty Level, Retention (Psychology), Test Items

Do Subject Matter Experts' Judgments of Multiple-Choice Format Suitability Predict Item Quality?

Peer reviewed

Direct link

Berenbon, Rebecca F.; McHugh, Bridget C. – Educational Measurement: Issues and Practice, 2023

To assemble a high-quality test, psychometricians rely on subject matter experts (SMEs) to write high-quality items. However, SMEs are not typically given the opportunity to provide input on which content standards are most suitable for multiple-choice questions (MCQs). In the present study, we explored the relationship between perceived MCQ…

Descriptors: Test Items, Multiple Choice Tests, Standards, Difficulty Level

Item Parameter Recovery via Traditional 2PL, Testlet and Bi-Factor Models for Testlet-Based Tests

Peer reviewed
PDF on ERIC

Download full text

Soysal, Sumeyra; Yilmaz Kogar, Esin – International Journal of Assessment Tools in Education, 2022

The testlet comprises a set of items based on a common stimulus. When the testlet is used in the tests, there may violate the local independence assumption, and in this case, it would not be appropriate to use traditional item response theory models in the tests in which the testlet is included. When the testlet is discussed, one of the most…

Descriptors: Test Items, Test Theory, Models, Sample Size

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Why Do Regular and Reversed Items Load on Separate Factors? Response Difficulty vs. Item Extremity

Peer reviewed

Direct link

Kam, Chester Chun Seng – Educational and Psychological Measurement, 2023

When constructing measurement scales, regular and reversed items are often used (e.g., "I am satisfied with my job"/"I am not satisfied with my job"). Some methodologists recommend excluding reversed items because they are more difficult to understand and therefore engender a second, artificial factor distinct from the…

Descriptors: Test Items, Difficulty Level, Test Construction, Construct Validity

Effect of a Representative Sample of Internally Calibrated Mental Effort and Polytomously Scored Data on Representing Cognitive Efficiency

Peer reviewed

Direct link

Nedungadi, Sachin; Rinco Michels, Olga; Kreke, Patricia J.; Raker, Jeffrey R.; Murphy, Kristen L. – Journal of Chemical Education, 2022

Practice examinations developed at the ACS Examinations Institute ask students to self-report mental effort when answering items. This self-reported mental effort together with performance can be represented in the form of a cognitive efficiency graph for each student giving information on the utilization of cognitive resources and content…

Descriptors: Cognitive Processes, Science Tests, Test Items, Difficulty Level

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 69

Educational and Psychological…	81
Journal of Educational…	58
Applied Measurement in…	45
ETS Research Report Series	40
Applied Psychological…	23
International Journal of…	21
Educational Assessment	18
International Journal of…	17
Language Testing	17
Journal of Experimental…	16
Practical Assessment,…	16
Language Assessment Quarterly	15
Educational Measurement:…	11
Psychometrika	11
International Journal of…	10
International Journal of…	10
Journal of Educational…	10
Journal of Educational and…	10
Journal of Chemical Education	9
Online Submission	9
SAGE Open	9
Assessment & Evaluation in…	8
CBE - Life Sciences Education	8
Journal of Psychoeducational…	8
Journal of Speech, Language,…	8
More ▼

Bulut, Okan	7
Guo, Hongwen	7
Sinharay, Sandip	6
Baghaei, Purya	5
DeMars, Christine E.	5
Dorans, Neil J.	5
Liu, Ou Lydia	5
Long, Caroline	5
Plake, Barbara S.	5
Retnawati, Heri	5
Wilson, Mark	5
Wise, Steven L.	5
Wyse, Adam E.	5
Andrich, David	4
Bejar, Isaac I.	4
Crisp, Victoria	4
De Boeck, Paul	4
Deane, Paul	4
Embretson, Susan E.	4
Finch, Holmes	4
Holland, Paul	4
Kubinger, Klaus D.	4
Murphy, Kristen L.	4
Petscher, Yaacov	4
More ▼

Program for International…	22
Trends in International…	19
SAT (College Admission Test)	18
Graduate Record Examinations	14
Test of English as a Foreign…	12
National Assessment of…	11
Raven Progressive Matrices	6
Peabody Picture Vocabulary…	5
International English…	4
Raven Advanced Progressive…	4
Remote Associates Test	4
Test of English for…	4
Flesch Kincaid Grade Level…	3
Progress in International…	3
Stanford Achievement Tests	3
Wechsler Intelligence Scale…	3
Advanced Placement…	2
Big Five Inventory	2
California Achievement Tests	2
Dynamic Indicators of Basic…	2
Iowa Tests of Basic Skills	2
Watson Glaser Critical…	2
Wechsler Adult Intelligence…	2
ACT Assessment	1
Adult Attachment Interview	1
More ▼