ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	41
Since 2006 (last 20 years)	109

Descriptor

Statistical Analysis	178
Test Items	178
Computer Assisted Testing	64
Test Construction	44
Item Analysis	40
Item Response Theory	36
Foreign Countries	35
Testing	35
Comparative Analysis	34
Scores	32
Difficulty Level	31
Adaptive Testing	27
Testing Problems	25
Multiple Choice Tests	24
Language Tests	22
Test Bias	22
Simulation	21
Test Format	21
Hypothesis Testing	20
English (Second Language)	18
Item Banks	18
Mathematical Models	18
Test Reliability	18
Test Validity	18
Latent Trait Theory	17
More ▼

Publication Type

Reports - Research	132
Journal Articles	121
Reports - Evaluative	28
Speeches/Meeting Papers	28
Reports - Descriptive	10
Tests/Questionnaires	9
Dissertations/Theses -…	3
Numerical/Quantitative Data	3
Collected Works - Proceedings	2
Opinion Papers	2
Books	1
Collected Works - General	1
Guides - Classroom - Learner	1
Guides - Non-Classroom	1
More ▼

Education Level

Higher Education	32
Postsecondary Education	30
Elementary Education	14
Secondary Education	11
Middle Schools	9
Junior High Schools	7
Elementary Secondary Education	6
Grade 7	4
Grade 8	4
High Schools	4
Grade 4	3
Grade 6	3
Early Childhood Education	1
Grade 3	1
Grade 5	1
Intermediate Grades	1
More ▼

Audience

Researchers	10
Practitioners	1
Teachers	1

Location

Germany	5
Netherlands	4
Australia	3
Japan	3
Turkey	3
California	2
Canada	2
Israel	2
New Jersey	2
Nigeria	2
Pennsylvania	2
Texas	2
United States	2
Africa	1
Austria	1
Botswana	1
China	1
Delaware	1
Europe	1
Florida	1
Indiana	1
Malaysia	1
Maryland	1
Massachusetts	1
Netherlands (Amsterdam)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	5
Graduate Record Examinations	4
SAT (College Admission Test)	3
California Achievement Tests	1
Comprehensive Tests of Basic…	1
Defining Issues Test	1
Florida Comprehensive…	1
International English…	1
Iowa Tests of Basic Skills	1
Praxis Series	1
Program for International…	1
Stanford Binet Intelligence…	1
Test of English for…	1
Texas Essential Knowledge and…	1
Trends in International…	1
United States Medical…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 178 results Save | Export

Comparison of Kernel Equating Methods under NEAT and NEC Designs

Peer reviewed
PDF on ERIC

Download full text

Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023

In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…

Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis

Use of the Lagrange Multiplier Test for Assessing Measurement Invariance under Model Misspecification

Peer reviewed

Direct link

Guastadisegni, Lucia; Cagnone, Silvia; Moustaki, Irini; Vasdekis, Vassilis – Educational and Psychological Measurement, 2022

This article studies the Type I error, false positive rates, and power of four versions of the Lagrange multiplier test to detect measurement noninvariance in item response theory (IRT) models for binary data under model misspecification. The tests considered are the Lagrange multiplier test computed with the Hessian and cross-product approach,…

Descriptors: Measurement, Statistical Analysis, Item Response Theory, Test Items

On the Generalized S-X[superscript 2]-Test of Item Fit: Some Variants, Residuals, and a Graphical Visualization

Peer reviewed

Direct link

Ranger, Jochen; Brauer, Kay – Journal of Educational and Behavioral Statistics, 2022

The generalized S-X[superscript 2]-test is a test of item fit for items with polytomous responses format. The test is based on a comparison of the observed and expected number of responses in strata defined by the test score. In this article, we make four contributions. We demonstrate that the performance of the generalized S-X[superscript 2]-test…

Descriptors: Goodness of Fit, Test Items, Statistical Analysis, Item Response Theory

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

The Lack of Robustness of a Statistic Based on the Neyman-Pearson Lemma to Violations of Its Underlying Assumptions

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2021

Drasgow, Levine, and Zickar (1996) suggested a statistic based on the Neyman-Pearson lemma (e.g., Lehmann & Romano, 2005, p. 60) for detecting preknowledge on a known set of items. The statistic is a special case of the optimal appropriateness indices of Levine and Drasgow (1988) and is the most powerful statistic for detecting item…

Descriptors: Robustness (Statistics), Hypothesis Testing, Statistics, Test Items

Evaluating CAT-Adjusted Approaches for Suspected Item Parameter Drift Detection

Peer reviewed

Direct link

Cappaert, Kevin J.; Wen, Yao; Chang, Yu-Feng – Measurement: Interdisciplinary Research and Perspectives, 2018

Events such as curriculum changes or practice effects can lead to item parameter drift (IPD) in computer adaptive testing (CAT). The current investigation introduced a point- and weight-adjusted D[superscript 2] method for IPD detection for use in a CAT environment when items are suspected of drifting across test administrations. Type I error and…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Identification

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

A Comparison of Methods for Detecting Examinee Preknowledge of Items

Peer reviewed

Direct link

Wang, Xi; Liu, Yang; Robin, Frederic; Guo, Hongwen – International Journal of Testing, 2019

In an on-demand testing program, some items are repeatedly used across test administrations. This poses a risk to test security. In this study, we considered a scenario wherein a test was divided into two subsets: one consisting of secure items and the other consisting of possibly compromised items. In a simulation study of multistage adaptive…

Descriptors: Identification, Methods, Test Items, Cheating

Bayesian Estimation and Testing of a Linear Logistic Test Model for Learning during the Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Applied Measurement in Education, 2021

The present study proposes a Bayesian approach for estimating and testing the operation-specific learning model, a variant of the linear logistic test model that allows for the measurement of the learning that occurs during a test as a result of the repeated use of the operations involved in the items. The advantages of using a Bayesian framework…

Descriptors: Bayesian Statistics, Computation, Learning, Testing

Generalized Discrimination Index

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Kelley's Discrimination Index (DI) is a simple and robust, classical non-parametric short-cut to estimate the item discrimination power (IDP) in the practical educational settings. Unlike item-total correlation, DI can reach the ultimate values of +1 and -1, and it is stable against the outliers. Because of the computational easiness, DI is…

Descriptors: Test Items, Computation, Item Analysis, Nonparametric Statistics

Detection of Item Preknowledge Using Response Times

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2019

Benefiting from item preknowledge (e.g., McLeod, Lewis, & Thissen, 2003) is a major type of fraudulent behavior during educational assessments. This paper suggests a new statistic that can be used for detecting the examinees who may have benefitted from item preknowledge using their response times. The statistic quantifies the difference in…

Descriptors: Test Items, Cheating, Reaction Time, Identification

Optimizing Practice Scheduling Requires Quantitative Tracking of Individual Item Performance

Peer reviewed
PDF on ERIC

Download full text

Direct link

Luke G. Eglington; Philip I. Pavlik – Grantee Submission, 2020

Decades of research has shown that spacing practice trials over time can improve later memory, but there are few concrete recommendations concerning how to optimally space practice. We show that existing recommendations are inherently suboptimal due to their insensitivity to time costs and individual- and item-level differences. We introduce an…

Descriptors: Scheduling, Drills (Practice), Memory, Testing

Optimizing Practice Scheduling Requires Quantitative Tracking of Individual Item Performance

Peer reviewed

Direct link

Luke G. Eglington; Philip I. Pavlik Jr. – npj Science of Learning, 2020

Descriptors: Scheduling, Drills (Practice), Memory, Testing

Investigating the Performance of Omega Index According to Item Parameters and Ability Levels

Peer reviewed
PDF on ERIC

Download full text

Sunbul, Onder; Yormaz, Seha – Eurasian Journal of Educational Research, 2018

Purpose: Several studies can be found in the literature that investigate the performance of ? under various conditions. However no study for the effects of item difficulty, item discrimination, and ability restrictions on the performance of ? could be found. The current study aims to investigate the performance of ? for the conditions given below.…

Descriptors: Test Items, Difficulty Level, Ability, Cheating

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

ETS Research Report Series	14
Educational and Psychological…	13
Applied Psychological…	7
Journal of Educational and…	7
Language Testing	6
Journal of Educational…	5
Applied Measurement in…	4
Grantee Submission	4
International Journal of…	4
Eurasian Journal of…	3
Language Assessment Quarterly	3
ProQuest LLC	3
Educational Measurement:…	2
European Journal of…	2
International Journal of…	2
Journal of Educational…	2
Journal of Experimental…	2
Journal of Interactive Online…	2
Peabody Journal of Education	2
Practical Assessment,…	2
Research-publishing.net	2
African Higher Education…	1
Australasian Journal of…	1
Behavioral Research and…	1
Bilingual Research Journal	1
More ▼

Chang, Hua-Hua	3
Guo, Hongwen	3
Kim, Sooyeon	3
Sinharay, Sandip	3
Dorans, Neil	2
Dorans, Neil J.	2
Han, Kyung T.	2
Huebner, Alan	2
Kelderman, Henk	2
Lee, Yi-Hsuan	2
Legg, Sue M.	2
Liu, Jinghua	2
Livingston, Samuel A.	2
Luke G. Eglington	2
Macready, George B.	2
Metsämuuronen, Jari	2
Puhan, Gautam	2
Ranger, Jochen	2
Robin, Frederic	2
Wang, Chun	2
Wilcox, Rand R.	2
van der Linden, Wim J.	2
Abayeva, Nella F.	1
Adedokun, Omolola A.	1
More ▼