ERIC - Search Results

Publication Date

In 2025	4
Since 2024	5
Since 2021 (last 5 years)	13
Since 2016 (last 10 years)	34
Since 2006 (last 20 years)	72

Descriptor

Sampling	147
Test Items	147
Test Construction	51
Item Response Theory	40
Foreign Countries	37
Difficulty Level	36
Item Analysis	27
Statistical Analysis	26
Sample Size	25
Equated Scores	24
Achievement Tests	22
Comparative Analysis	21
Error of Measurement	20
Scaling	18
Computation	16
Data Analysis	16
Mathematical Models	16
Simulation	16
Questionnaires	15
Scores	15
Test Validity	15
Latent Trait Theory	14
Research Methodology	14
Test Bias	14
Monte Carlo Methods	13
More ▼

Publication Type

Reports - Research	87
Journal Articles	73
Reports - Evaluative	34
Speeches/Meeting Papers	26
Numerical/Quantitative Data	11
Reports - Descriptive	10
Tests/Questionnaires	7
Dissertations/Theses -…	5
Collected Works - General	4
Guides - Non-Classroom	4
Opinion Papers	3
Reports - General	3
ERIC Digests in Full Text	2
ERIC Publications	2
Books	1
Computer Programs	1
Guides - General	1
Information Analyses	1
More ▼

Education Level

Secondary Education	13
Elementary Education	10
Elementary Secondary Education	8
Higher Education	6
Postsecondary Education	6
Grade 8	5
Junior High Schools	5
Middle Schools	5
Grade 4	4
Grade 6	3
Intermediate Grades	3
Grade 9	2
High Schools	2
Grade 10	1
Grade 11	1
Grade 12	1
Grade 7	1
More ▼

Audience

Researchers

Location

Australia	8
Germany	7
Japan	4
United States	4
Canada	3
Chile	3
Italy	3
Asia	2
China	2
Denmark	2
France	2
Indonesia	2
Netherlands	2
South Korea	2
Spain	2
Sweden	2
Texas	2
United Kingdom (England)	2
Austria	1
Belgium	1
Brazil	1
Bulgaria	1
Colombia (Bogota)	1
Cyprus	1
Czech Republic	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1
Perkins Loan Program	1

Assessments and Surveys

Program for International…	8
National Assessment of…	7
Progress in International…	2
SAT (College Admission Test)	2
Test of English as a Foreign…	2
Trends in International…	2
Armed Services Vocational…	1
California Achievement Tests	1
Child Behavior Checklist	1
College Board Achievement…	1
Flesch Kincaid Grade Level…	1
General Aptitude Test Battery	1
General Educational…	1
International Adult Literacy…	1
International Association for…	1
Metropolitan Achievement Tests	1
National Longitudinal Study…	1
Texas Assessment of Academic…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 147 results Save | Export

Leveraging LLM Respondents for Item Evaluation: A Psychometric Analysis

Peer reviewed

Direct link

Yunting Liu; Shreya Bhandari; Zachary A. Pardos – British Journal of Educational Technology, 2025

Effective educational measurement relies heavily on the curation of well-designed item pools. However, item calibration is time consuming and costly, requiring a sufficient number of respondents to estimate the psychometric properties of items. In this study, we explore the potential of six different large language models (LLMs; GPT-3.5, GPT-4,…

Descriptors: Artificial Intelligence, Test Items, Psychometrics, Educational Assessment

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

Evaluating Population Invariance of Test Equating during the COVID-19 Pandemic

Peer reviewed

Direct link

Li, Dongmei; Kapoor, Shalini – Educational Measurement: Issues and Practice, 2022

Population invariance is a desirable property of test equating which might not hold when significant changes occur in the test population, such as those brought about by the COVID-19 pandemic. This research aims to investigate whether equating functions are reasonably invariant when the test population is impacted by the pandemic. Based on…

Descriptors: Test Items, Equated Scores, COVID-19, Pandemics

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

Multi-Group Regularized Gaussian Variational Estimation: Fast Detection of DIF

Peer reviewed

Direct link

Weicong Lyu; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Data harmonization is an emerging approach to strategically combining data from multiple independent studies, enabling addressing new research questions that are not answerable by a single contributing study. A fundamental psychometric challenge for data harmonization is to create commensurate measures for the constructs of interest across…

Descriptors: Data Analysis, Test Items, Psychometrics, Item Response Theory

The Effects of Open-Ended Probes on Closed Survey Questions in Web Surveys

Peer reviewed

Direct link

Patricia Hadler – Sociological Methods & Research, 2025

Probes are follow-ups to survey questions used to gain insights on respondents' understanding of and responses to these questions. They are usually administered as open-ended questions, primarily in the context of questionnaire pretesting. Due to the decreased cost of data collection for open-ended questions in web surveys, researchers have argued…

Descriptors: Online Surveys, Discovery Processes, Test Items, Data Collection

Regarding Item Parameter Invariance for the Rasch and the 2-Parameter Logistic Models: An Investigation under Finite Non-Representative Sample Calibrations

Peer reviewed

Direct link

Paek, Insu; Liang, Xinya; Lin, Zhongtian – Measurement: Interdisciplinary Research and Perspectives, 2021

The property of item parameter invariance in item response theory (IRT) plays a pivotal role in the applications of IRT such as test equating. The scope of parameter invariance when using estimates from finite biased samples in the applications of IRT does not appear to be clearly documented in the IRT literature. This article provides information…

Descriptors: Item Response Theory, Computation, Test Items, Bias

Applying Multiphase Sampling to Selecting Testlets with Constraints on Item Blocks. Research Report. ETS RR-19-03

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe; Gu, Lixiong; Li, Shuhong – ETS Research Report Series, 2019

In assembling testlets (i.e., test forms) with a pool of new and used item blocks, test security is one of the main issues of concern. Strict constraints are often imposed on repeated usage of the same item blocks. Nevertheless, for an assessment administering multiple testlets, a goal is to select as large a sample of testlets as possible. In…

Descriptors: Test Construction, Sampling, Test Items, Mathematics

Response Quality in Nonprobability and Probability-Based Online Panels

Peer reviewed

Direct link

Cornesse, Carina; Blom, Annelies G. – Sociological Methods & Research, 2023

Recent years have seen a growing number of studies investigating the accuracy of nonprobability online panels; however, response quality in nonprobability online panels has not yet received much attention. To fill this gap, we investigate response quality in a comprehensive study of seven nonprobability online panels and three probability-based…

Descriptors: Probability, Sampling, Social Science Research, Research Methodology

Impact of Differential Item Functioning on Group Score Reporting in the Context of Large-Scale Assessments

Peer reviewed

Direct link

Joo, Sean; Ali, Usama; Robin, Frederic; Shin, Hyo Jeong – Large-scale Assessments in Education, 2022

We investigated the potential impact of differential item functioning (DIF) on group-level mean and standard deviation estimates using empirical and simulated data in the context of large-scale assessment. For the empirical investigation, PISA 2018 cognitive domains (Reading, Mathematics, and Science) data were analyzed using Jackknife sampling to…

Descriptors: Test Items, Item Response Theory, Scores, Student Evaluation

The Digital Literacy Academic Writing Scale: Exploratory Factor Analysis

Peer reviewed

Direct link

Salim Nabhan; Anita Habók – SAGE Open, 2025

As the integration of digital technologies continues to shape academic landscapes, assessing digital literacy in the context of academic writing becomes paramount. Several instruments and frameworks are available for measuring digital literacy and examining it from different perspectives; however, none are suitable for measuring the digital…

Descriptors: Digital Literacy, Academic Language, Writing (Composition), Measures (Individuals)

A Bias-Corrected RMSD Item Fit Statistic: An Evaluation and Comparison to Alternatives

Peer reviewed

Direct link

Köhler, Carmen; Robitzsch, Alexander; Hartig, Johannes – Journal of Educational and Behavioral Statistics, 2020

Testing whether items fit the assumptions of an item response theory model is an important step in evaluating a test. In the literature, numerous item fit statistics exist, many of which show severe limitations. The current study investigates the root mean squared deviation (RMSD) item fit statistic, which is used for evaluating item fit in…

Descriptors: Test Items, Goodness of Fit, Statistics, Bias

OECD Survey on Social and Emotional Skills 2023 Technical Report

Direct link

OECD Publishing, 2025

The OECD's Survey on Social and Emotional Skills (SSES) 2023 represents the largest global initiative to gather comparable data on the development of social and emotional skills among 10- and 15-year-old students. In the 2023 cycle of SSES, 16 sites implemented an assessment of students' social and emotional skills and collected contextual…

Descriptors: Social Development, Emotional Development, Interpersonal Competence, Surveys

Evaluating the Effects of Analytical Decisions in Large-Scale Assessments: Analyzing PISA Mathematics 2003-2012

Peer reviewed

Direct link

Heine, Jörg-Henrik; Robitzsch, Alexander – Large-scale Assessments in Education, 2022

Research Question: This paper examines the overarching question of to what extent different analytic choices may influence the inference about country-specific cross-sectional and trend estimates in international large-scale assessments. We take data from the assessment of PISA mathematics proficiency from the four rounds from 2003 to 2012 as a…

Descriptors: Foreign Countries, International Assessment, Achievement Tests, Secondary School Students

Toward Education Quality Improvement in China: A Brief Overview of the National Assessment of Education Quality

Peer reviewed

Direct link

Jiang, Yu; Zhang, Jiahui; Xin, Tao – Journal of Educational and Behavioral Statistics, 2019

This article is an overview of the National Assessment of Education Quality (NAEQ) of China in reading, mathematics, sciences, arts, physical education, and moral education at Grades 4 and 8. After a review of the background and history of NAEQ, we present the assessment framework with students' holistic development at the core and the design for…

Descriptors: Foreign Countries, Educational Quality, Educational Improvement, National Competency Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Applied Psychological…	10
ETS Research Report Series	8
Educational and Psychological…	8
Journal of Educational…	7
ProQuest LLC	5
Applied Measurement in…	4
Educational Measurement:…	4
International Association for…	4
Journal of Educational and…	4
OECD Publishing	3
International Journal of…	2
Journal of Education and…	2
Large-scale Assessments in…	2
Measurement:…	2
Ministerial Council on…	2
National Center for Education…	2
Sociological Methods &…	2
AERA Online Paper Repository	1
British Educational Research…	1
British Journal of…	1
Chemistry Education Research…	1
Council of the Great City…	1
Educational Testing Service	1
Grantee Submission	1
International Education…	1
More ▼

Hambleton, Ronald K.	4
Kim, Sooyeon	4
Livingston, Samuel A.	3
Ainley, John, Ed.	2
Allen, Nancy L.	2
Berk, Ronald A.	2
Childs, Ruth A.	2
Cook, Linda L.	2
Donoghue, John R.	2
Donovan, Jenny	2
Dorans, Neil J.	2
Douglass, James B.	2
Fan, Xitao	2
Fraillon, Julian, Ed.	2
Guo, Hongwen	2
Jaciw, Andrew P.	2
Johnson, Eugene G.	2
Lennon, Melissa	2
Lu, Ru	2
Meijer, Rob R.	2
Mislevy, Robert J.	2
Qian, Jiahe	2
Reckase, Mark D.	2
Robitzsch, Alexander	2
More ▼