ERIC - Search Results

Publication Date

In 2026	0
Since 2025	6
Since 2022 (last 5 years)	16
Since 2017 (last 10 years)	36
Since 2007 (last 20 years)	83

Descriptor

Test Items	83
Sampling	74
Foreign Countries	32
Item Response Theory	30
Test Construction	21
Difficulty Level	18
Sample Size	17
Computation	16
Statistical Analysis	16
Comparative Analysis	15
Equated Scores	14
Error of Measurement	13
Scores	13
Data Analysis	12
Questionnaires	11
Scaling	11
Accuracy	10
Item Sampling	10
Psychometrics	10
Achievement Tests	9
International Assessment	9
Reliability	9
Scoring	9
Simulation	9
Test Bias	9
More ▼

Publication Type

Journal Articles	58
Reports - Research	56
Reports - Descriptive	9
Reports - Evaluative	9
Numerical/Quantitative Data	7
Tests/Questionnaires	7
Dissertations/Theses -…	5
Collected Works - General	3
Speeches/Meeting Papers	3
Reports - General	2
Books	1
Guides - Non-Classroom	1
Information Analyses	1
More ▼

Education Level

Secondary Education	13
Elementary Education	9
Elementary Secondary Education	8
Higher Education	8
Postsecondary Education	8
Grade 8	5
Junior High Schools	5
Middle Schools	5
Grade 4	4
Grade 6	3
High Schools	3
Intermediate Grades	3
Grade 11	2
Grade 12	2
Grade 9	2
Grade 10	1
Grade 7	1
More ▼

Audience

Researchers

Location

Germany	7
Australia	6
Japan	4
United States	4
Canada	3
Chile	3
Italy	3
Asia	2
China	2
Denmark	2
France	2
Indonesia	2
South Korea	2
Spain	2
Austria	1
Belgium	1
Bosnia and Herzegovina	1
Brazil	1
Bulgaria	1
Colombia (Bogota)	1
Croatia	1
Cyprus	1
Czech Republic	1
Estonia	1
Europe	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1
Perkins Loan Program	1

Assessments and Surveys

Program for International…	9
National Assessment of…	2
Progress in International…	2
Trends in International…	2
Child Behavior Checklist	1
Flesch Kincaid Grade Level…	1
International Association for…	1
National Longitudinal Study…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 83 results Save | Export

Evaluation of Exam Questions Using Bootstrapping: Practical Applications in R and SPSS with a Case Study

Peer reviewed

Direct link

Changiz Mohiyeddini – Anatomical Sciences Education, 2025

This article presents a step-by-step guide to using R and SPSS to bootstrap exam questions. Bootstrapping, a versatile nonparametric analytical technique, can help to improve the psychometric qualities of exam questions in the process of quality assurance. Bootstrapping is particularly useful in disciplines such as medical education, where student…

Descriptors: Test Items, Sampling, Statistical Inference, Nonparametric Statistics

DISTO: Textual Distractors for Multiple Choice Reading Comprehension Questions Using Negative Sampling

Peer reviewed
PDF on ERIC

Download full text

Bilal Ghanem; Alona Fyshe – International Educational Data Mining Society, 2024

Multiple choice questions (MCQs) are a common way to assess reading comprehension. Every MCQ needs a set of distractor answers that are incorrect, but plausible enough to test student knowledge. However, good distractors are hard to create. Distractor generation (DG) models have been proposed, and their performance is typically evaluated using…

Descriptors: Multiple Choice Tests, Reading Comprehension, Test Items, Testing

Leveraging LLM Respondents for Item Evaluation: A Psychometric Analysis

Peer reviewed

Direct link

Yunting Liu; Shreya Bhandari; Zachary A. Pardos – British Journal of Educational Technology, 2025

Effective educational measurement relies heavily on the curation of well-designed item pools. However, item calibration is time consuming and costly, requiring a sufficient number of respondents to estimate the psychometric properties of items. In this study, we explore the potential of six different large language models (LLMs; GPT-3.5, GPT-4,…

Descriptors: Artificial Intelligence, Test Items, Psychometrics, Educational Assessment

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

Enhancing Exam Question Quality in Medical Education through Bootstrapping

Peer reviewed

Direct link

Changiz Mohiyeddini – Anatomical Sciences Education, 2025

Medical schools are required to assess and evaluate their curricula and to develop exam questions with strong reliability and validity evidence, often based on data derived from statistically small samples of medical students. Achieving a large enough sample to reliably and validly evaluate courses, assessments, and exam questions would require…

Descriptors: Medical Education, Medical Students, Medical Schools, Tests

Evaluating Population Invariance of Test Equating during the COVID-19 Pandemic

Peer reviewed

Direct link

Li, Dongmei; Kapoor, Shalini – Educational Measurement: Issues and Practice, 2022

Population invariance is a desirable property of test equating which might not hold when significant changes occur in the test population, such as those brought about by the COVID-19 pandemic. This research aims to investigate whether equating functions are reasonably invariant when the test population is impacted by the pandemic. Based on…

Descriptors: Test Items, Equated Scores, COVID-19, Pandemics

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

Multi-Group Regularized Gaussian Variational Estimation: Fast Detection of DIF

Peer reviewed

Direct link

Weicong Lyu; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Data harmonization is an emerging approach to strategically combining data from multiple independent studies, enabling addressing new research questions that are not answerable by a single contributing study. A fundamental psychometric challenge for data harmonization is to create commensurate measures for the constructs of interest across…

Descriptors: Data Analysis, Test Items, Psychometrics, Item Response Theory

The Effects of Open-Ended Probes on Closed Survey Questions in Web Surveys

Peer reviewed

Direct link

Patricia Hadler – Sociological Methods & Research, 2025

Probes are follow-ups to survey questions used to gain insights on respondents' understanding of and responses to these questions. They are usually administered as open-ended questions, primarily in the context of questionnaire pretesting. Due to the decreased cost of data collection for open-ended questions in web surveys, researchers have argued…

Descriptors: Online Surveys, Discovery Processes, Test Items, Data Collection

Regarding Item Parameter Invariance for the Rasch and the 2-Parameter Logistic Models: An Investigation under Finite Non-Representative Sample Calibrations

Peer reviewed

Direct link

Paek, Insu; Liang, Xinya; Lin, Zhongtian – Measurement: Interdisciplinary Research and Perspectives, 2021

The property of item parameter invariance in item response theory (IRT) plays a pivotal role in the applications of IRT such as test equating. The scope of parameter invariance when using estimates from finite biased samples in the applications of IRT does not appear to be clearly documented in the IRT literature. This article provides information…

Descriptors: Item Response Theory, Computation, Test Items, Bias

Designing and Evaluating Tasks to Measure Individual Differences in Experimental Psychology: A Tutorial

Peer reviewed

Direct link

Marc Brysbaert – Cognitive Research: Principles and Implications, 2024

Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…

Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis

Application of the Professional Maturity Scale as a Computerized Adaptive Testing

Peer reviewed
PDF on ERIC

Download full text

Süleyman Demir; Derya Çobanoglu Aktan; Nese Güler – International Journal of Assessment Tools in Education, 2023

This study has two main purposes. Firstly, to compare the different item selection methods and stopping rules used in Computerized Adaptive Testing (CAT) applications with simulative data generated based on the item parameters of the Vocational Maturity Scale. Secondly, to test the validity of CAT application scores. For the first purpose,…

Descriptors: Computer Assisted Testing, Adaptive Testing, Vocational Maturity, Measures (Individuals)

Applying Multiphase Sampling to Selecting Testlets with Constraints on Item Blocks. Research Report. ETS RR-19-03

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe; Gu, Lixiong; Li, Shuhong – ETS Research Report Series, 2019

In assembling testlets (i.e., test forms) with a pool of new and used item blocks, test security is one of the main issues of concern. Strict constraints are often imposed on repeated usage of the same item blocks. Nevertheless, for an assessment administering multiple testlets, a goal is to select as large a sample of testlets as possible. In…

Descriptors: Test Construction, Sampling, Test Items, Mathematics

Response Quality in Nonprobability and Probability-Based Online Panels

Peer reviewed

Direct link

Cornesse, Carina; Blom, Annelies G. – Sociological Methods & Research, 2023

Recent years have seen a growing number of studies investigating the accuracy of nonprobability online panels; however, response quality in nonprobability online panels has not yet received much attention. To fill this gap, we investigate response quality in a comprehensive study of seven nonprobability online panels and three probability-based…

Descriptors: Probability, Sampling, Social Science Research, Research Methodology

Impact of Differential Item Functioning on Group Score Reporting in the Context of Large-Scale Assessments

Peer reviewed

Direct link

Joo, Sean; Ali, Usama; Robin, Frederic; Shin, Hyo Jeong – Large-scale Assessments in Education, 2022

We investigated the potential impact of differential item functioning (DIF) on group-level mean and standard deviation estimates using empirical and simulated data in the context of large-scale assessment. For the empirical investigation, PISA 2018 cognitive domains (Reading, Mathematics, and Science) data were analyzed using Jackknife sampling to…

Descriptors: Test Items, Item Response Theory, Scores, Student Evaluation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ETS Research Report Series	7
Educational and Psychological…	5
ProQuest LLC	5
Applied Measurement in…	4
International Association for…	4
Journal of Educational and…	4
Applied Psychological…	3
Educational Measurement:…	3
OECD Publishing	3
Online Submission	3
Anatomical Sciences Education	2
International Journal of…	2
International Journal of…	2
Journal of Education and…	2
Large-scale Assessments in…	2
Measurement:…	2
Ministerial Council on…	2
National Center for Education…	2
Sociological Methods &…	2
AERA Online Paper Repository	1
British Journal of…	1
Chemistry Education Research…	1
Cognitive Research:…	1
Council of the Great City…	1
Educational Testing Service	1
More ▼

Kim, Sooyeon	5
Livingston, Samuel A.	3
Ainley, John, Ed.	2
Changiz Mohiyeddini	2
Donovan, Jenny	2
Fraillon, Julian, Ed.	2
Guo, Hongwen	2
Lennon, Melissa	2
Lu, Ru	2
Qian, Jiahe	2
Robitzsch, Alexander	2
Schulz, Wolfram, Ed.	2
Adeleke, A. A.	1
Agbelevor, Emelia Afi	1
Ali, Usama	1
Alona Fyshe	1
Anita Habók	1
Anwyll, Steve	1
Appiah, Samson Obed	1
Arikan, Çigdem Akin	1
Aviani, Ivica	1
Babcock, Ben	1
Baird, Jo-Anne	1
Bashkov, Bozhidar M.	1
Bilal Ghanem	1
More ▼