ERIC - Search Results

Publication Date

In 2025	3
Since 2024	7
Since 2021 (last 5 years)	32
Since 2016 (last 10 years)	76
Since 2006 (last 20 years)	134

Descriptor

Test Items	221
Test Length	221
Item Response Theory	89
Sample Size	65
Test Construction	65
Computer Assisted Testing	52
Simulation	51
Adaptive Testing	50
Test Reliability	43
Comparative Analysis	40
Difficulty Level	40
Error of Measurement	40
Item Analysis	37
Accuracy	36
Test Format	36
Correlation	30
Computation	29
Statistical Analysis	29
Test Validity	29
Monte Carlo Methods	28
Test Bias	28
Models	27
Scores	27
Item Banks	24
Goodness of Fit	22
More ▼

Publication Type

Reports - Research	151
Journal Articles	133
Reports - Evaluative	40
Speeches/Meeting Papers	32
Dissertations/Theses -…	19
Reports - Descriptive	7
Numerical/Quantitative Data	6
Guides - Non-Classroom	4
Information Analyses	2
Opinion Papers	2
Tests/Questionnaires	2
Historical Materials	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	13
Postsecondary Education	12
Elementary Education	6
Elementary Secondary Education	6
Secondary Education	5
Early Childhood Education	3
Grade 3	3
High Schools	3
Grade 6	2
Intermediate Grades	2
Middle Schools	2
Primary Education	2
Grade 11	1
Grade 12	1
Preschool Education	1
More ▼

Audience

Researchers	9
Administrators	1
Community	1
Practitioners	1

Location

Turkey	2
Alabama	1
Asia	1
Australia	1
Germany	1
Illinois (Chicago)	1
Indiana	1
Iran	1
Israel	1
Japan	1
Netherlands	1
New Jersey	1
South Korea	1
Taiwan	1
Ukraine	1
More ▼

Laws, Policies, & Programs

Job Training Partnership Act…	1
Race to the Top	1

What Works Clearinghouse Rating

Showing 1 to 15 of 221 results Save | Export

The NEAT Equating via Chaining Random Forests in the Context of Small Sample Sizes: A Machine-Learning Method

Peer reviewed

Direct link

Jiang, Zhehan; Han, Yuting; Xu, Lingling; Shi, Dexin; Liu, Ren; Ouyang, Jinying; Cai, Fen – Educational and Psychological Measurement, 2023

The part of responses that is absent in the nonequivalent groups with anchor test (NEAT) design can be managed to a planned missing scenario. In the context of small sample sizes, we present a machine learning (ML)-based imputation technique called chaining random forests (CRF) to perform equating tasks within the NEAT design. Specifically, seven…

Descriptors: Test Items, Equated Scores, Sample Size, Artificial Intelligence

The Effect of Polytomous Item Ratio on Ability Estimation in Multistage Tests

Peer reviewed
PDF on ERIC

Download full text

Hasibe Yahsi Sari; Hulya Kelecioglu – International Journal of Assessment Tools in Education, 2025

The aim of the study is to examine the effect of polytomous item ratio on ability estimation in different conditions in multistage tests (MST) using mixed tests. The study is simulation-based research. In the PISA 2018 application, the ability parameters of the individuals and the item pool were created by using the item parameters estimated from…

Descriptors: Test Items, Test Format, Accuracy, Test Length

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model

Peer reviewed

Direct link

Fellinghauer, Carolina; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

This simulation study investigated to what extent departures from construct similarity as well as differences in the difficulty and targeting of scales impact the score transformation when scales are equated by means of concurrent calibration using the partial credit model with a common person design. Practical implications of the simulation…

Descriptors: True Scores, Equated Scores, Test Items, Sample Size

A Simulation Study on the Performance of Different Reliability Estimation Methods

Peer reviewed

Direct link

Edwards, Ashley A.; Joyner, Keanan J.; Schatschneider, Christopher – Educational and Psychological Measurement, 2021

The accuracy of certain internal consistency estimators have been questioned in recent years. The present study tests the accuracy of six reliability estimators (Cronbach's alpha, omega, omega hierarchical, Revelle's omega, and greatest lower bound) in 140 simulated conditions of unidimensional continuous data with uncorrelated errors with varying…

Descriptors: Reliability, Computation, Accuracy, Sample Size

A Randomization P-Value Test for Detecting Copying on Multiple-Choice Exams

Peer reviewed

Direct link

Lang, Joseph B. – Journal of Educational and Behavioral Statistics, 2023

This article is concerned with the statistical detection of copying on multiple-choice exams. As an alternative to existing permutation- and model-based copy-detection approaches, a simple randomization p-value (RP) test is proposed. The RP test, which is based on an intuitive match-score statistic, makes no assumptions about the distribution of…

Descriptors: Identification, Cheating, Multiple Choice Tests, Item Response Theory

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Type I Error and Power Rates: A Comparative Analysis of Techniques in Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Ayse Bilicioglu Gunes; Bayram Bicak – International Journal of Assessment Tools in Education, 2023

The main purpose of this study is to examine the Type I error and statistical power ratios of Differential Item Functioning (DIF) techniques based on different theories under different conditions. For this purpose, a simulation study was conducted by using Mantel-Haenszel (MH), Logistic Regression (LR), Lord's [chi-squared], and Raju's Areas…

Descriptors: Test Items, Item Response Theory, Error of Measurement, Test Bias

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

Exploring Number of Response Categories in Factor Analysis: Implications for Sample Size

Peer reviewed
PDF on ERIC

Download full text

Fatih Orçan – International Journal of Assessment Tools in Education, 2025

Factor analysis is a statistical method to explore the relationships among observed variables and identify latent structures. It is crucial in scale development and validity analysis. Key factors affecting the accuracy of factor analysis results include the type of data, sample size, and the number of response categories. While some studies…

Descriptors: Factor Analysis, Factor Structure, Item Response Theory, Sample Size

Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models

Peer reviewed

Direct link

Paek, Insu; Lin, Zhongtian; Chalmers, Robert Philip – Educational and Psychological Measurement, 2023

To reduce the chance of Heywood cases or nonconvergence in estimating the 2PL or the 3PL model in the marginal maximum likelihood with the expectation-maximization (MML-EM) estimation method, priors for the item slope parameter in the 2PL model or for the pseudo-guessing parameter in the 3PL model can be used and the marginal maximum a posteriori…

Descriptors: Models, Item Response Theory, Test Items, Intervals

Evaluation of Factors Affecting the Performance of the "S - X[superscript 2]" Item-Fit Index

Peer reviewed

Direct link

Kim, Hyung Jin; Lee, Won-Chan – Journal of Educational Measurement, 2022

Orlando and Thissen (2000) introduced the "S - X[superscript 2]" item-fit index for testing goodness-of-fit with dichotomous item response theory (IRT) models. This study considers and evaluates an alternative approach for computing "S - X[superscript 2]" values and other factors associated with collapsing tables of observed…

Descriptors: Goodness of Fit, Test Items, Item Response Theory, Computation

A Comparison of the Efficacies of Differential Item Functioning Detection Methods

Peer reviewed
PDF on ERIC

Download full text

Basman, Munevver – International Journal of Assessment Tools in Education, 2023

To ensure the validity of the tests is to check that all items have similar results across different groups of individuals. However, differential item functioning (DIF) occurs when the results of individuals with equal ability levels from different groups differ from each other on the same test item. Based on Item Response Theory and Classic Test…

Descriptors: Test Bias, Test Items, Test Validity, Item Response Theory

The Enhanced ACT Linking Study Report. ACT Research. Research Paper. R2515

Download full text

Dongmei Li; Shalini Kapoor; Ann Arthur; Chi-Yu Huang; YoungWoo Cho; Chen Qiu; Hongling Wang – ACT Education Corp., 2025

Starting in April 2025, ACT will introduce enhanced forms of the ACT® test for national online testing, with a full rollout to all paper and online test takers in national, state and district, and international test administrations by Spring 2026. ACT introduced major updates by changing the test lengths and testing times, providing more time per…

Descriptors: College Entrance Examinations, Testing, Change, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

Educational and Psychological…	33
ProQuest LLC	19
Journal of Educational…	15
Applied Measurement in…	9
Applied Psychological…	9
ETS Research Report Series	9
International Journal of…	7
International Journal of…	6
Journal of Educational and…	5
Measurement:…	4
Assessment & Evaluation in…	2
Educational Sciences: Theory…	2
Eurasian Journal of…	2
Grantee Submission	2
Journal of Experimental…	2
Journal of Psychoeducational…	2
Journal of Technology,…	2
ACT Education Corp.	1
AERA Online Paper Repository	1
Advanced Education	1
Anatomical Sciences Education	1
Asia Pacific Education Review	1
Assessment and Evaluation in…	1
College Entrance Examination…	1
College Student Journal	1
More ▼

Wainer, Howard	6
Hambleton, Ronald K.	4
Wang, Wen-Chung	4
Berk, Ronald A.	3
Burton, Richard F.	3
Cohen, Allan S.	3
Huggins-Manley, Anne Corinne	3
Lee, Won-Chan	3
Lee, Yi-Hsuan	3
Pommerich, Mary	3
Reckase, Mark D.	3
Sijtsma, Klaas	3
Wang, Chun	3
Weiss, David J.	3
Zhang, Jinming	3
Bradshaw, Laine	2
Bulut, Okan	2
Chen, Shu-Ying	2
Cheng, Ying	2
Chernyshenko, Oleksandr S.	2
Cui, Ying	2
De Ayala, R. J.	2
Diao, Qi	2
Dogan, Nuri	2
More ▼

Test of English as a Foreign…	3
Trends in International…	3
Program for International…	2
SAT (College Admission Test)	2
ACT Assessment	1
Advanced Placement…	1
Armed Forces Qualification…	1
COMPASS (Computer Assisted…	1
Comprehensive Tests of Basic…	1
Force Concept Inventory	1
Iowa Tests of Basic Skills	1
MacArthur Communicative…	1
Medical College Admission Test	1
National Longitudinal Study…	1
New Jersey College Basic…	1
Otis Lennon School Ability…	1
Raven Advanced Progressive…	1
School and College Ability…	1
Stanford Binet Intelligence…	1
Texas Assessment of Basic…	1
Texas Educational Assessment…	1
Wechsler Intelligence Scale…	1
Wechsler Intelligence Scales…	1
More ▼