ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	21
Since 2006 (last 20 years)	46

Descriptor

Difficulty Level	69
Error of Measurement	69
Test Items	69
Item Response Theory	36
Test Construction	16
Test Reliability	16
Comparative Analysis	14
Item Analysis	13
Statistical Analysis	13
Simulation	12
Computation	11
Equated Scores	11
Mathematical Models	11
Multiple Choice Tests	11
Sample Size	10
Foreign Countries	9
Goodness of Fit	9
Psychometrics	9
Correlation	8
Mathematics Tests	8
Monte Carlo Methods	8
Cutting Scores	7
Sampling	7
Statistical Bias	7
Test Format	7
More ▼

Publication Type

Reports - Research	56
Journal Articles	39
Speeches/Meeting Papers	12
Reports - Evaluative	10
Numerical/Quantitative Data	7
Reports - Descriptive	3
Information Analyses	1
Tests/Questionnaires	1

Education Level

Elementary Education	10
Secondary Education	6
Grade 4	5
Higher Education	5
Middle Schools	5
Postsecondary Education	5
Early Childhood Education	4
Grade 3	4
Grade 5	4
Junior High Schools	4
Primary Education	4
Elementary Secondary Education	3
Grade 2	3
Grade 7	3
Grade 1	2
Grade 8	2
Intermediate Grades	2
Kindergarten	2
High Schools	1
More ▼

Audience

Researchers

Location

Austria	1
Belgium	1
Canada	1
Chile	1
Florida	1
Germany	1
Indonesia	1
Luxembourg	1
New Jersey	1
Philippines	1
United Kingdom (England)	1
United Kingdom (Wales)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	3
ACT Assessment	2
Comprehensive Tests of Basic…	1
Medical College Admission Test	1
National Assessment of…	1
Program for International…	1
Progress in International…	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 69 results Save | Export

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

A Comparison of Kernel Equating and Item Response Theory Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Akin-Arikan, Çigdem; Gelbal, Selahattin – Eurasian Journal of Educational Research, 2021

Purpose: This study aims to compare the performances of Item Response Theory (IRT) equating and kernel equating (KE) methods based on equating errors (RMSD) and standard error of equating (SEE) using the anchor item nonequivalent groups design. Method: Within this scope, a set of conditions, including ability distribution, type of anchor items…

Descriptors: Equated Scores, Item Response Theory, Test Items, Statistical Analysis

The Effect of Multiple-Choice Test Items' Difficulty Degree on the Reliability Coefficient and the Standard Error of Measurement Depending on the Item Response Theory (IRT)

Peer reviewed
PDF on ERIC

Download full text

Al-zboon, Habis Saad; Alrekebat, Amjad Farhan – International Journal of Higher Education, 2021

This study aims at identifying the effect of multiple-choice test items' difficulty degree on the reliability coefficient and the standard error of measurement depending on the item response theory IRT. To achieve the objectives of the study, (WinGen3) software was used to generate the IRT parameters (difficulty, discrimination, guessing) for four…

Descriptors: Multiple Choice Tests, Test Items, Difficulty Level, Error of Measurement

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

Robustness of Weighted Differential Item Functioning (DIF) Analysis: The Case of Mantel-Haenszel DIF Statistics. Research Report. ETS RR-21-12

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2021

Two families of analysis methods can be used for differential item functioning (DIF) analysis. One family is DIF analysis based on observed scores, such as the Mantel-Haenszel (MH) and the standardized proportion-correct metric for DIF procedures; the other is analysis based on latent ability, in which the statistic is a measure of departure from…

Descriptors: Robustness (Statistics), Weighted Scores, Test Items, Item Analysis

Analyzing Different Module Characteristics in Computer Adaptive Multistage Testing

Peer reviewed
PDF on ERIC

Download full text

Sahin, Melek Gulsah – International Journal of Assessment Tools in Education, 2020

Computer Adaptive Multistage Testing (ca-MST), which take the advantage of computer technology and adaptive test form, are widely used, and are now a popular issue of assessment and evaluation. This study aims at analyzing the effect of different panel designs, module lengths, and different sequence of a parameter value across stages and change in…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Response Theory

Comparison of Passing Scores Determined by the Angoff Method in Different Item Samples

Peer reviewed
PDF on ERIC

Download full text

Kara, Hakan; Cetin, Sevda – International Journal of Assessment Tools in Education, 2020

In this study, the efficiency of various random sampling methods to reduce the number of items rated by judges in an Angoff standard-setting study was examined and the methods were compared with each other. Firstly, the full-length test was formed by combining Placement Test 2012 and 2013 mathematics subsets. After then, simple random sampling…

Descriptors: Cutting Scores, Standard Setting (Scoring), Sampling, Error of Measurement

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Differential Item Functioning Effect Size from the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021

This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…

Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods

Position of Correct Option and Distractors Impacts Responses to Multiple-Choice Items: Evidence from a National Test

Peer reviewed

Direct link

Lions, Séverin; Dartnell, Pablo; Toledo, Gabriela; Godoy, María Inés; Córdova, Nora; Jiménez, Daniela; Lemarié, Julie – Educational and Psychological Measurement, 2023

Even though the impact of the position of response options on answers to multiple-choice items has been investigated for decades, it remains debated. Research on this topic is inconclusive, perhaps because too few studies have obtained experimental data from large-sized samples in a real-world context and have manipulated the position of both…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Responses

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

A Comparison of Estimation Techniques for IRT Models with Small Samples

Peer reviewed

Direct link

Finch, Holmes; French, Brian F. – Applied Measurement in Education, 2019

The usefulness of item response theory (IRT) models depends, in large part, on the accuracy of item and person parameter estimates. For the standard 3 parameter logistic model, for example, these parameters include the item parameters of difficulty, discrimination, and pseudo-chance, as well as the person ability parameter. Several factors impact…

Descriptors: Item Response Theory, Accuracy, Test Items, Difficulty Level

FIPC Linking across Multidimensional Test Forms: Effects of Confounding Difficulty within Dimensions

Peer reviewed

Direct link

Kim, Sohee; Cole, Ki Lynn; Mwavita, Mwarumba – International Journal of Testing, 2018

This study investigated the effects of linking potentially multidimensional test forms using the fixed item parameter calibration. Forms had equal or unequal total test difficulty with and without confounding difficulty. The mean square errors and bias of estimated item and ability parameters were compared across the various confounding tests. The…

Descriptors: Test Items, Item Response Theory, Test Format, Difficulty Level

Developing IRT-Based Physics Critical Thinking Skill Test: A CAT to Answer 21st Century Challenge

Peer reviewed
PDF on ERIC

Download full text

Istiyono, Edi; Dwandaru, Wipsar Sunu Brams; Lede, Yulita Adelfin; Rahayu, Farida; Nadapdap, Amipa – International Journal of Instruction, 2019

The objective of this study was to develop Physics critical thinking skill test using computerized adaptive test (CAT) based on item response theory (IRT). This research was a development research using 4-D (define, design, develop, and disseminate). The content validity of the items was proven using Aiken's V. The test trial involved 252 students…

Descriptors: Critical Thinking, Thinking Skills, Cognitive Tests, Physics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	7
Applied Measurement in…	6
Behavioral Research and…	6
Grantee Submission	3
Journal of Educational…	3
Online Submission	3
Applied Psychological…	2
ETS Research Report Series	2
International Journal of…	2
International Journal of…	2
Practical Assessment,…	2
American Institutes for…	1
College Entrance Examination…	1
Educational Measurement:…	1
Educational Research and…	1
Educational Testing Service	1
Eurasian Journal of…	1
Evaluation and the Health…	1
IEEE Transactions on Education	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Multivariate Behavioral…	1
Research Matters	1
Research Papers in Education	1
More ▼

Alonzo, Julie	6
Tindal, Gerald	6
Feigenbaum, Miriam	3
Finch, Holmes	3
Liu, Jinghua	3
Paek, Insu	3
Schoen, Robert C.	3
Sinharay, Sandip	3
Yang, Xiaotong	3
Curley, Edward	2
Holland, Paul	2
Liu, Kimy	2
Park, Bitnara Jasmine	2
Abulela, Mohammed A. A.	1
Ahn, Soyeon	1
Akin-Arikan, Çigdem	1
Al-zboon, Habis Saad	1
Alrekebat, Amjad Farhan	1
Antal, Judit	1
Anwyll, Steve	1
Arce, Alvaro J.	1
Arikan, Çigdem Akin	1
Babcock, Ben	1
Benson, Jeri	1
More ▼