ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	37
Since 2006 (last 20 years)	95

Descriptor

Comparative Analysis	107
Item Response Theory	107
Statistical Analysis	107
Test Items	41
Foreign Countries	27
Scores	27
Simulation	23
Computation	19
Correlation	16
Error of Measurement	16
Models	16
Test Bias	16
Factor Analysis	14
Goodness of Fit	14
Sample Size	14
Difficulty Level	13
Computer Assisted Testing	12
Elementary School Students	12
Mathematics Tests	12
Psychometrics	12
Test Format	12
English (Second Language)	10
Item Analysis	9
Monte Carlo Methods	9
Regression (Statistics)	9
More ▼

Publication Type

Journal Articles	90
Reports - Research	81
Reports - Evaluative	13
Speeches/Meeting Papers	8
Dissertations/Theses -…	6
Reports - Descriptive	5
Tests/Questionnaires	3
Numerical/Quantitative Data	2
Collected Works - Proceedings	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	22
Postsecondary Education	17
Elementary Education	14
Secondary Education	10
Middle Schools	8
Junior High Schools	6
High Schools	4
Early Childhood Education	3
Elementary Secondary Education	3
Grade 10	1
Grade 12	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 8	1
Intermediate Grades	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Researchers	2
Teachers	1

Location

Australia	4
Germany	4
Japan	3
Turkey	3
United States	3
Canada	2
France	2
Hong Kong	2
Italy	2
Norway	2
Taiwan	2
Austria	1
Bahrain	1
Belgium	1
Botswana	1
Brazil	1
China	1
China (Shanghai)	1
Finland	1
Georgia	1
Greece	1
Indiana	1
Indonesia	1
Iran	1
Kuwait	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	4
Test of English as a Foreign…	3
Law School Admission Test	2
Program for International…	2
SAT (College Admission Test)	2
Defining Issues Test	1
General Aptitude Test Battery	1
Graduate Record Examinations	1
Indiana Statewide Testing for…	1
Iowa Tests of Basic Skills	1
National Assessment of…	1
Progress in International…	1
Trends in International…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 107 results Save | Export

The Comparison of Estimation Methods for the Four-Parameter Logistic Item Response Theory Model

Peer reviewed

Direct link

Kalkan, Ömür Kaya – Measurement: Interdisciplinary Research and Perspectives, 2022

The four-parameter logistic (4PL) Item Response Theory (IRT) model has recently been reconsidered in the literature due to the advances in the statistical modeling software and the recent developments in the estimation of the 4PL IRT model parameters. The current simulation study evaluated the performance of expectation-maximization (EM),…

Descriptors: Comparative Analysis, Sample Size, Test Length, Algorithms

Detecting Differential Item Functioning: Item Response Theory Methods versus the Mantel-Haenszel Procedure

Peer reviewed
PDF on ERIC

Download full text

Diaz, Emily; Brooks, Gordon; Johanson, George – International Journal of Assessment Tools in Education, 2021

This Monte Carlo study assessed Type I error in differential item functioning analyses using Lord's chi-square (LC), Likelihood Ratio Test (LRT), and Mantel-Haenszel (MH) procedure. Two research interests were investigated: item response theory (IRT) model specification in LC and the LRT and continuity correction in the MH procedure. This study…

Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Comparative Analysis

An Investigation of Item Position Effects by Means of IRT-Based Differential Item Functioning Methods

Peer reviewed
PDF on ERIC

Download full text

Soysal, Sumeyra; Yilmaz Kogar, Esin – International Journal of Assessment Tools in Education, 2021

In this study, whether item position effects lead to DIF in the condition where different test booklets are used was investigated. To do this the methods of Lord's chi-square and Raju's unsigned area with the 3PL model under with and without item purification were used. When the performance of the methods was compared, it was revealed that…

Descriptors: Item Response Theory, Test Bias, Test Items, Comparative Analysis

Improvement of Norm Score Quality via Regression-Based Continuous Norming

Peer reviewed

Direct link

Lenhard, Wolfgang; Lenhard, Alexandra – Educational and Psychological Measurement, 2021

The interpretation of psychometric test results is usually based on norm scores. We compared semiparametric continuous norming (SPCN) with conventional norming methods by simulating results for test scales with different item numbers and difficulties via an item response theory approach. Subsequently, we modeled the norm scores based on random…

Descriptors: Test Norms, Scores, Regression (Statistics), Test Items

Tree-Based Global Model Tests for Polytomous Rasch Models

Peer reviewed

Direct link

Komboz, Basil; Strobl, Carolin; Zeileis, Achim – Educational and Psychological Measurement, 2018

Psychometric measurement models are only valid if measurement invariance holds between test takers of different groups. Global model tests, such as the well-established likelihood ratio (LR) test, are sensitive to violations of measurement invariance, such as differential item functioning and differential step functioning. However, these…

Descriptors: Item Response Theory, Models, Tests, Measurement

The Effect of Mini and Midi Anchor Tests on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Arikan, Çigdem Akin – International Journal of Progressive Education, 2018

The main purpose of this study is to compare the test forms to the midi anchor test and the mini anchor test performance based on item response theory. The research was conducted with using simulated data which were generated based on Rasch model. In order to equate two test forms the anchor item nonequivalent groups (internal anchor test) was…

Descriptors: Equated Scores, Comparative Analysis, Item Response Theory, Tests

Investigating Separate and Concurrent Approaches for Item Parameter Drift in 3PL Item Response Theory Equating

Peer reviewed

Direct link

Arce-Ferrer, Alvaro J.; Bulut, Okan – International Journal of Testing, 2017

This study examines separate and concurrent approaches to combine the detection of item parameter drift (IPD) and the estimation of scale transformation coefficients in the context of the common item nonequivalent groups design with the three-parameter item response theory equating. The study uses real and synthetic data sets to compare the two…

Descriptors: Item Response Theory, Equated Scores, Identification, Computation

IRT Item Parameter Scaling for Developing New Item Pools

Peer reviewed

Direct link

Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua – Applied Measurement in Education, 2017

Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…

Descriptors: Item Response Theory, Accuracy, Educational Assessment, Test Items

The Consequences of Ignoring Item Parameter Drift in Longitudinal Item Response Models

Peer reviewed

Direct link

Lee, Wooyeol; Cho, Sun-Joo – Applied Measurement in Education, 2017

Utilizing a longitudinal item response model, this study investigated the effect of item parameter drift (IPD) on item parameters and person scores via a Monte Carlo study. Item parameter recovery was investigated for various IPD patterns in terms of bias and root mean-square error (RMSE), and percentage of time the 95% confidence interval covered…

Descriptors: Item Response Theory, Test Items, Bias, Computation

Using the Stan Program for Bayesian Item Response Theory

Peer reviewed

Direct link

Luo, Yong; Jiao, Hong – Educational and Psychological Measurement, 2018

Stan is a new Bayesian statistical software program that implements the powerful and efficient Hamiltonian Monte Carlo (HMC) algorithm. To date there is not a source that systematically provides Stan code for various item response theory (IRT) models. This article provides Stan code for three representative IRT models, including the…

Descriptors: Bayesian Statistics, Item Response Theory, Probability, Computer Software

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

Evaluation of Different Scoring Rules for a Noncognitive Test in Development. Research Report. ETS RR-16-03

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016

In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…

Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics

Differential Item Functioning Detection across Two Methods of Defining Group Comparisons: Pairwise and Composite Group Comparisons

Peer reviewed

Direct link

Sari, Halil Ibrahim; Huggins, Anne Corinne – Educational and Psychological Measurement, 2015

This study compares two methods of defining groups for the detection of differential item functioning (DIF): (a) pairwise comparisons and (b) composite group comparisons. We aim to emphasize and empirically support the notion that the choice of pairwise versus composite group definitions in DIF is a reflection of how one defines fairness in DIF…

Descriptors: Test Bias, Comparative Analysis, Statistical Analysis, College Entrance Examinations

A Comparison of IRT Proficiency Estimation Methods under Adaptive Multistage Testing

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook – Journal of Educational Measurement, 2015

This inquiry is an investigation of item response theory (IRT) proficiency estimators' accuracy under multistage testing (MST). We chose a two-stage MST design that includes four modules (one at Stage 1, three at Stage 2) and three difficulty paths (low, middle, high). We assembled various two-stage MST panels (i.e., forms) by manipulating two…

Descriptors: Comparative Analysis, Item Response Theory, Computation, Accuracy

Use of Item Response Curves of the Force and Motion Conceptual Evaluation to Compare Japanese and American Students' Views on Force and Motion

Peer reviewed

Direct link

Ishimoto, Michi; Davenport, Glen; Wittmann, Michael C. – Physical Review Physics Education Research, 2017

Student views of force and motion reflect the personal experiences and physics education of the student. With a different language, culture, and educational system, we expect that Japanese students' views on force and motion might be different from those of American students. The Force and Motion Conceptual Evaluation (FMCE) is an instrument used…

Descriptors: Item Response Theory, Physics, Motion, Comparative Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

ETS Research Report Series	11
Educational and Psychological…	10
Applied Psychological…	8
ProQuest LLC	6
Grantee Submission	5
Applied Measurement in…	4
Journal of Educational…	4
Language Testing	3
ACT, Inc.	2
CBE - Life Sciences Education	2
International Journal of…	2
International Journal of…	2
Journal of Educational and…	2
Structural Equation Modeling:…	2
Adults Learning Mathematics	1
American Educational Research…	1
Assessment & Evaluation in…	1
Chemistry Education Research…	1
College Board	1
Deafness & Education…	1
Early Child Development and…	1
Educational Evaluation and…	1
Educational Psychology	1
Educational Research and…	1
Educational Research and…	1
More ▼

Cho, Sun-Joo	4
Cai, Li	3
Puhan, Gautam	3
Sinharay, Sandip	3
Chang, Hua-Hua	2
Choi, Seung W.	2
Chuang, Chi-ching	2
DeBoer, George E.	2
DeMars, Christine E.	2
Fujiki, Mayo	2
Herman, Keith	2
Herrmann-Abell, Cari F.	2
Kim, Sooyeon	2
Maydeu-Olivares, Alberto	2
Moses, Tim	2
Qian, Jiahe	2
Reinke, Wendy	2
Rohrer, David	2
Wang, Ze	2
Woods, Carol M.	2
Young, John W.	2
Abdelfattah, Faisal	1
Abduljabbar, Adel Salah	1
Abu-Hilal, Maher M.	1
Adedoyin, O. O.	1
More ▼