ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	26
Since 2006 (last 20 years)	58

Descriptor

Difficulty Level	58
Error of Measurement	58
Test Items	46
Item Response Theory	36
Comparative Analysis	14
Computation	12
Statistical Analysis	12
Equated Scores	11
Foreign Countries	11
Sample Size	10
Simulation	10
Test Construction	10
Accuracy	9
Goodness of Fit	8
Item Analysis	8
Models	8
Multiple Choice Tests	8
Psychometrics	8
Statistical Bias	8
Test Reliability	8
Correlation	7
Elementary School Students	7
Sampling	7
Mathematics Tests	6
Reading Tests	6
More ▼

Publication Type

Reports - Research	45
Journal Articles	44
Reports - Evaluative	9
Numerical/Quantitative Data	7
Reports - Descriptive	3
Tests/Questionnaires	2
Dissertations/Theses -…	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	10
Secondary Education	6
Grade 4	5
Middle Schools	5
Early Childhood Education	4
Elementary Secondary Education	4
Grade 3	4
Grade 5	4
Higher Education	4
Junior High Schools	4
Postsecondary Education	4
Primary Education	4
Grade 2	3
Grade 7	3
Grade 1	2
Grade 8	2
Intermediate Grades	2
Kindergarten	2
High Schools	1
More ▼

Audience

Location

Austria	1
Belgium	1
Canada	1
Chile	1
Cyprus	1
Florida	1
Germany	1
Indonesia	1
Japan	1
Luxembourg	1
New Jersey	1
New Zealand	1
Philippines	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
Cognitive Assessment System	1
National Assessment of…	1
Program for International…	1
Progress in International…	1
Trends in International…	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 58 results Save | Export

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

A Comparison of Kernel Equating and Item Response Theory Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Akin-Arikan, Çigdem; Gelbal, Selahattin – Eurasian Journal of Educational Research, 2021

Purpose: This study aims to compare the performances of Item Response Theory (IRT) equating and kernel equating (KE) methods based on equating errors (RMSD) and standard error of equating (SEE) using the anchor item nonequivalent groups design. Method: Within this scope, a set of conditions, including ability distribution, type of anchor items…

Descriptors: Equated Scores, Item Response Theory, Test Items, Statistical Analysis

The Effect of Multiple-Choice Test Items' Difficulty Degree on the Reliability Coefficient and the Standard Error of Measurement Depending on the Item Response Theory (IRT)

Peer reviewed
PDF on ERIC

Download full text

Al-zboon, Habis Saad; Alrekebat, Amjad Farhan – International Journal of Higher Education, 2021

This study aims at identifying the effect of multiple-choice test items' difficulty degree on the reliability coefficient and the standard error of measurement depending on the item response theory IRT. To achieve the objectives of the study, (WinGen3) software was used to generate the IRT parameters (difficulty, discrimination, guessing) for four…

Descriptors: Multiple Choice Tests, Test Items, Difficulty Level, Error of Measurement

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

The Effect of Chance Success on Equalization Error in Test Equation Based on Classical Test Theory

Peer reviewed
PDF on ERIC

Download full text

Koçak, Duygu – International Journal of Progressive Education, 2020

The aim of this study was to determine the effect of chance success on test equalization. For this purpose, artificially generated 500 and 1000 sample size data sets were synchronized using linear equalization and equal percentage equalization methods. In the data which were produced as a simulative, a total of four cases were created with no…

Descriptors: Test Theory, Equated Scores, Error of Measurement, Sample Size

Robustness of Weighted Differential Item Functioning (DIF) Analysis: The Case of Mantel-Haenszel DIF Statistics. Research Report. ETS RR-21-12

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2021

Two families of analysis methods can be used for differential item functioning (DIF) analysis. One family is DIF analysis based on observed scores, such as the Mantel-Haenszel (MH) and the standardized proportion-correct metric for DIF procedures; the other is analysis based on latent ability, in which the statistic is a measure of departure from…

Descriptors: Robustness (Statistics), Weighted Scores, Test Items, Item Analysis

Approaches for Specifying the Level-1 Error Structure When Synthesizing Single-Case Data

Peer reviewed

Direct link

Joo, Seang-Hwane; Ferron, John M.; Moeyaert, Mariola; Beretvas, S. Natasha; Van den Noortgate, Wim – Journal of Experimental Education, 2019

Multilevel modeling has been utilized for combining single-case experimental design (SCED) data assuming simple level-1 error structures. The purpose of this study is to compare various multilevel analysis approaches for handling potential complexity in the level-1 error structure within SCED data, including approaches assuming simple and complex…

Descriptors: Hierarchical Linear Modeling, Synthesis, Data Analysis, Accuracy

Examining the Precision of Cut Scores within a Generalizability Theory Framework: A Closer Look at the Item Effect

Peer reviewed

Direct link

Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020

An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…

Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting

Analyzing Different Module Characteristics in Computer Adaptive Multistage Testing

Peer reviewed
PDF on ERIC

Download full text

Sahin, Melek Gulsah – International Journal of Assessment Tools in Education, 2020

Computer Adaptive Multistage Testing (ca-MST), which take the advantage of computer technology and adaptive test form, are widely used, and are now a popular issue of assessment and evaluation. This study aims at analyzing the effect of different panel designs, module lengths, and different sequence of a parameter value across stages and change in…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Response Theory

Comparison of Passing Scores Determined by the Angoff Method in Different Item Samples

Peer reviewed
PDF on ERIC

Download full text

Kara, Hakan; Cetin, Sevda – International Journal of Assessment Tools in Education, 2020

In this study, the efficiency of various random sampling methods to reduce the number of items rated by judges in an Angoff standard-setting study was examined and the methods were compared with each other. Firstly, the full-length test was formed by combining Placement Test 2012 and 2013 mathematics subsets. After then, simple random sampling…

Descriptors: Cutting Scores, Standard Setting (Scoring), Sampling, Error of Measurement

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Differential Item Functioning Effect Size from the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021

This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…

Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods

Position of Correct Option and Distractors Impacts Responses to Multiple-Choice Items: Evidence from a National Test

Peer reviewed

Direct link

Lions, Séverin; Dartnell, Pablo; Toledo, Gabriela; Godoy, María Inés; Córdova, Nora; Jiménez, Daniela; Lemarié, Julie – Educational and Psychological Measurement, 2023

Even though the impact of the position of response options on answers to multiple-choice items has been investigated for decades, it remains debated. Research on this topic is inconclusive, perhaps because too few studies have obtained experimental data from large-sized samples in a real-world context and have manipulated the position of both…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Responses

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Behavioral Research and…	6
Applied Measurement in…	5
Educational and Psychological…	5
ETS Research Report Series	3
Grantee Submission	3
International Journal of…	3
Journal of Educational…	3
Online Submission	3
Applied Psychological…	2
International Journal of…	2
International Journal of…	2
Practical Assessment,…	2
American Institutes for…	1
Educational Measurement:…	1
Educational Psychologist	1
Educational Research and…	1
Educational Testing Service	1
Eurasian Journal of…	1
IEEE Transactions on Education	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Experimental…	1
Journal of Experimental…	1
More ▼

Alonzo, Julie	6
Tindal, Gerald	6
Finch, Holmes	3
Paek, Insu	3
Schoen, Robert C.	3
Sinharay, Sandip	3
Yang, Xiaotong	3
Curley, Edward	2
Feigenbaum, Miriam	2
Holland, Paul	2
Liu, Jinghua	2
Liu, Kimy	2
Park, Bitnara Jasmine	2
Abulela, Mohammed A. A.	1
Ahn, Soyeon	1
Akin-Arikan, Çigdem	1
Al-zboon, Habis Saad	1
Alrekebat, Amjad Farhan	1
Antal, Judit	1
Anwyll, Steve	1
Arce, Alvaro J.	1
Arikan, Çigdem Akin	1
Babcock, Ben	1
Beretvas, S. Natasha	1
Bramley, Tom	1
More ▼