ERIC - Search Results

Publication Date

In 2026	0
Since 2025	215
Since 2022 (last 5 years)	1084
Since 2017 (last 10 years)	2594
Since 2007 (last 20 years)	4955

Descriptor

Test Items	9547
Test Construction	2723
Foreign Countries	2184
Item Response Theory	1872
Difficulty Level	1623
Item Analysis	1502
Test Validity	1416
Test Reliability	1187
Multiple Choice Tests	1158
Scores	1137
Computer Assisted Testing	1058
Comparative Analysis	1024
Test Format	956
Higher Education	877
Achievement Tests	855
Statistical Analysis	852
Mathematics Tests	845
Psychometrics	833
Test Bias	772
Models	754
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1314
Postsecondary Education	1064
Secondary Education	927
Elementary Education	716
Middle Schools	420
High Schools	363
Elementary Secondary Education	359
Junior High Schools	320
Grade 8	256
Intermediate Grades	209
Grade 4	183
Early Childhood Education	178
Grade 5	134
Primary Education	126
Grade 7	113
Grade 3	111
Grade 6	107
Grade 9	69
Grade 2	56
Grade 10	53
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	38
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	226
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	66
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
New York	46
Missouri	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Sweden	38
Israel	37
Singapore	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 46 to 60 of 9,547 results Save | Export

Assessing Confidence Reduces the Benefits of Response Revisions in a General Knowledge Test

Peer reviewed

Direct link

Marta Siedlecka; Piotr Litwin; Paulina Szyszka; Boryslaw Paulewicz – European Journal of Psychology of Education, 2025

Students change their responses during tests, and these revisions are often correct. Some studies have suggested that decisions regarding revisions are informed by metacognitive monitoring. We investigated whether assessing and reporting response confidence increases the accuracy of revisions and the final test score, and whether confidence in a…

Descriptors: Student Evaluation, Decision Making, Responses, Achievement Tests

Generating Language Assessment Content Free from Representational Harms

Peer reviewed

Direct link

Ikkyu Choi; Jiyun Zu – Language Testing, 2025

Today's language models can produce syntactically accurate and semantically coherent texts. This capability presents new opportunities for generating content for language assessments, which have traditionally required intensive expert resources. However, these models are also known to generate biased texts, leading to representational harms.…

Descriptors: Artificial Intelligence, Language Tests, Test Bias, Test Construction

Revisiting the Lexical Differences between Academic and General Training IELTS Reading Tests

Peer reviewed

Direct link

Linh Thi Thao Le; Nam Thi Phuong Ho; Nguyen Huynh Trang; Hung Tan Ha – SAGE Open, 2025

The International English Language Testing System (IELTS) has served as one of the most reliable proofs of people's English language proficiency. There have been rumors about the discrepancy in difficulty between the two modules of IELTS, namely Academic (AC) and General Training (GT); however, there is little empirical evidence to confirm such a…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Reading Tests

Assessing Model Fit of the Generalized Graded Unfolding Model

Peer reviewed
PDF on ERIC

Download full text

Abdulla Alzarouni; R. J. De Ayala – Practical Assessment, Research & Evaluation, 2025

The assessment of model fit in latent trait modeling is an integral part of correctly applying the model. Still the assessment of model fit has been less utilized for ideal point models such as the Generalized Graded Unfolding Models (GGUM). The current study assesses the performance of the relative fit indices "AIC" and "BIC,"…

Descriptors: Goodness of Fit, Models, Statistical Analysis, Sample Size

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

Evaluating Youth Empowerment: The Construction and Validation of an Inventory of Dimensions and Indicators

Peer reviewed

Direct link

Anna Planas-Lladó; Xavier Úcar – American Journal of Evaluation, 2024

Empowerment is a concept that has become increasingly used over recent years. However, little research has been undertaken into how empowerment can be evaluated, particularly in the case of young people. The aim of this article is to present an inventory of dimensions and indicators of youth empowerment. The article describes the various phases in…

Descriptors: Youth, Empowerment, Test Construction, Test Validity

The Impact of Measurement Noninvariance across Time and Group in Longitudinal Item Response Modeling

Peer reviewed

Direct link

In-Hee Choi – Asia Pacific Education Review, 2024

Longitudinal item response data often exhibit two types of measurement noninvariance: the noninvariance of item parameters between subject groups and that of item parameters across multiple time points. This study proposes a comprehensive approach to the simultaneous modeling of both types of measurement noninvariance in terms of longitudinal item…

Descriptors: Longitudinal Studies, Item Response Theory, Growth Models, Error of Measurement

An Experimental Comparison of Multiple-Choice and Short-Answer Questions on a High-Stakes Test for Medical Students

Peer reviewed

Direct link

Janet Mee; Ravi Pandian; Justin Wolczynski; Amy Morales; Miguel Paniagua; Polina Harik; Peter Baldwin; Brian E. Clauser – Advances in Health Sciences Education, 2024

Recent advances in automated scoring technology have made it practical to replace multiple-choice questions (MCQs) with short-answer questions (SAQs) in large-scale, high-stakes assessments. However, most previous research comparing these formats has used small examinee samples testing under low-stakes conditions. Additionally, previous studies…

Descriptors: Multiple Choice Tests, High Stakes Tests, Test Format, Test Items

DISTO: Textual Distractors for Multiple Choice Reading Comprehension Questions Using Negative Sampling

Peer reviewed
PDF on ERIC

Download full text

Bilal Ghanem; Alona Fyshe – International Educational Data Mining Society, 2024

Multiple choice questions (MCQs) are a common way to assess reading comprehension. Every MCQ needs a set of distractor answers that are incorrect, but plausible enough to test student knowledge. However, good distractors are hard to create. Distractor generation (DG) models have been proposed, and their performance is typically evaluated using…

Descriptors: Multiple Choice Tests, Reading Comprehension, Test Items, Testing

Embedded Standard Setting for Credentialing

Peer reviewed

Direct link

Daniel Lewis; Melanie Graw; Michael Baker – Journal of Applied Testing Technology, 2024

Embedded Standard Setting (ESS; Lewis & Cook, 2020) transforms standard setting from a standalone workshop to an active part of the assessment development lifecycle. ESS purports to lower costs by eliminating the standard-setting workshop and enhance the validity argument by maintaining a consistent focus on the evidentiary relationship…

Descriptors: Standard Setting (Scoring), Test Items, Test Construction, Food Service

A Deterministic Gated Lognormal Response Time Model to Identify Examinees with Item Preknowledge

Peer reviewed

Direct link

Kasli, Murat; Zopluoglu, Cengiz; Toton, Sarah L. – Journal of Educational Measurement, 2023

Response times (RTs) have recently attracted a significant amount of attention in the literature as they may provide meaningful information about item preknowledge. In this study, a new model, the Deterministic Gated Lognormal Response Time (DG-LNRT) model, is proposed to identify examinees with item preknowledge using RTs. The proposed model was…

Descriptors: Reaction Time, Test Items, Models, Familiarity

Open-Ended versus Closed Probes: Assessing Different Formats of Web Probing

Peer reviewed

Direct link

Neuert, Cornelia E.; Meitinger, Katharina; Behr, Dorothée – Sociological Methods & Research, 2023

The method of web probing integrates cognitive interviewing techniques into web surveys and is increasingly used to evaluate survey questions. In a usual web probing scenario, probes are administered immediately after the question to be tested (concurrent probing), typically as open-ended questions. A second possibility of administering probes is…

Descriptors: Internet, Online Surveys, Test Items, Evaluation

Using Response Time in Multidimensional Computerized Adaptive Testing

Peer reviewed

Direct link

He, Yinhong; Qi, Yuanyuan – Journal of Educational Measurement, 2023

In multidimensional computerized adaptive testing (MCAT), item selection strategies are generally constructed based on responses, and they do not consider the response times required by items. This study constructed two new criteria (referred to as DT-inc and DT) for MCAT item selection by utilizing information from response times. The new designs…

Descriptors: Reaction Time, Adaptive Testing, Computer Assisted Testing, Test Items

A Simple Sorting Selection Method for Picking Results from an Automatic Item Generator

Peer reviewed

Direct link

Ben Babcock; Kim Brunnert – Journal of Applied Testing Technology, 2023

Automatic Item Generation (AIG) is an extremely useful tool to construct many high-quality exam items more efficiently than traditional item writing methods. A large pool of items, however, presents challenges like identifying a particular item to meet a specific need. For example, when making a fixed form exam, best practices forbid item stems…

Descriptors: Test Items, Automation, Algorithms, Artificial Intelligence

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 637

Educational and Psychological…	416
Journal of Educational…	367
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	98
Language Testing	93
Psychometrika	93
International Journal of…	80
Journal of Psychoeducational…	72
Educational Assessment	70
Practical Assessment,…	60
Measurement:…	57
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	38
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5882
Reports - Research	5592
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	179
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼