ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	60
Since 2006 (last 20 years)	125

Descriptor

Difficulty Level	177
Statistical Analysis	177
Test Items	177
Item Analysis	60
Item Response Theory	59
Foreign Countries	45
Test Construction	44
Multiple Choice Tests	35
Comparative Analysis	30
Scores	27
Test Validity	27
Test Reliability	25
Correlation	23
Models	21
English (Second Language)	20
Test Bias	20
Achievement Tests	19
Language Tests	19
Mathematics Tests	19
Computation	18
Equated Scores	18
Goodness of Fit	17
Mathematical Models	17
College Entrance Examinations	16
Second Language Learning	16
More ▼

Publication Type

Reports - Research	141
Journal Articles	122
Reports - Evaluative	19
Speeches/Meeting Papers	16
Tests/Questionnaires	13
Numerical/Quantitative Data	8
Reports - Descriptive	5
Dissertations/Theses -…	3
Information Analyses	3
Guides - Non-Classroom	2
Books	1
Collected Works - General	1
Guides - General	1
Opinion Papers	1
Reports - General	1
More ▼

Education Level

Higher Education	35
Postsecondary Education	27
Secondary Education	26
Middle Schools	18
Elementary Education	16
Junior High Schools	13
Grade 8	9
High Schools	9
Elementary Secondary Education	6
Grade 5	6
Grade 7	6
Grade 6	4
Grade 9	4
Intermediate Grades	4
Grade 4	3
Early Childhood Education	2
Grade 12	2
Grade 2	2
Grade 3	2
Primary Education	2
Grade 1	1
Grade 10	1
Grade 11	1
Kindergarten	1
More ▼

Audience

Researchers	4
Practitioners	1
Teachers	1

Location

Australia	7
Germany	4
Turkey	4
Canada	3
Japan	3
Minnesota	3
Austria	2
Belgium	2
California	2
Colorado	2
France	2
Greece	2
India	2
Indiana	2
Kansas	2
Massachusetts	2
Michigan	2
Nigeria	2
Ohio	2
Oregon	2
South Africa	2
United Kingdom (England)	2
Vermont	2
Alabama	1
Brazil	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	7
SAT (College Admission Test)	6
Graduate Record Examinations	4
Trends in International…	4
National Assessment of…	3
Program for International…	3
Defining Issues Test	1
Iowa Tests of Basic Skills	1
Law School Admission Test	1
Progress in International…	1
Raven Advanced Progressive…	1
Stanford Binet Intelligence…	1
Test of English for…	1
United States Medical…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 177 results Save | Export

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

Detecting Local Dependence: A Threshold-Autoregressive Item Response Theory (TAR-IRT) Approach for Polytomous Items

Peer reviewed

Direct link

Tang, Xiaodan; Karabatsos, George; Chen, Haiqin – Applied Measurement in Education, 2020

In applications of item response theory (IRT) models, it is known that empirical violations of the local independence (LI) assumption can significantly bias parameter estimates. To address this issue, we propose a threshold-autoregressive item response theory (TAR-IRT) model that additionally accounts for order dependence among the item responses…

Descriptors: Item Response Theory, Test Items, Models, Computation

A Comparison of Kernel Equating and Item Response Theory Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Akin-Arikan, Çigdem; Gelbal, Selahattin – Eurasian Journal of Educational Research, 2021

Purpose: This study aims to compare the performances of Item Response Theory (IRT) equating and kernel equating (KE) methods based on equating errors (RMSD) and standard error of equating (SEE) using the anchor item nonequivalent groups design. Method: Within this scope, a set of conditions, including ability distribution, type of anchor items…

Descriptors: Equated Scores, Item Response Theory, Test Items, Statistical Analysis

How Useful Is Comparative Judgement of Item Difficulty for Standard Maintaining?

Download full text

Benton, Tom – Research Matters, 2020

This article reviews the evidence on the extent to which experts' perceptions of item difficulties, captured using comparative judgement, can predict empirical item difficulties. This evidence is drawn from existing published studies on this topic and also from statistical analysis of data held by Cambridge Assessment. Having reviewed the…

Descriptors: Test Items, Difficulty Level, Expertise, Comparative Analysis

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

Bayesian Estimation and Testing of a Linear Logistic Test Model for Learning during the Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Applied Measurement in Education, 2021

The present study proposes a Bayesian approach for estimating and testing the operation-specific learning model, a variant of the linear logistic test model that allows for the measurement of the learning that occurs during a test as a result of the repeated use of the operations involved in the items. The advantages of using a Bayesian framework…

Descriptors: Bayesian Statistics, Computation, Learning, Testing

Optimizing Practice Scheduling Requires Quantitative Tracking of Individual Item Performance

Peer reviewed
PDF on ERIC

Download full text

Direct link

Luke G. Eglington; Philip I. Pavlik – Grantee Submission, 2020

Decades of research has shown that spacing practice trials over time can improve later memory, but there are few concrete recommendations concerning how to optimally space practice. We show that existing recommendations are inherently suboptimal due to their insensitivity to time costs and individual- and item-level differences. We introduce an…

Descriptors: Scheduling, Drills (Practice), Memory, Testing

Optimizing Practice Scheduling Requires Quantitative Tracking of Individual Item Performance

Peer reviewed

Direct link

Luke G. Eglington; Philip I. Pavlik Jr. – npj Science of Learning, 2020

Descriptors: Scheduling, Drills (Practice), Memory, Testing

Does Comparative Judgement of Scripts Provide an Effective Means of Maintaining Standards in Mathematics? Research Report

Download full text

Benton, Tom; Leech, Tony; Hughes, Sarah – Cambridge Assessment, 2020

In the context of examinations, the phrase "maintaining standards" usually refers to any activity designed to ensure that it is no easier (or harder) to achieve a given grade in one year than in another. Specifically, it tends to mean activities associated with setting examination grade boundaries. Benton et al (2020) describes a method…

Descriptors: Mathematics Tests, Equated Scores, Comparative Analysis, Difficulty Level

Improvement of Norm Score Quality via Regression-Based Continuous Norming

Peer reviewed

Direct link

Lenhard, Wolfgang; Lenhard, Alexandra – Educational and Psychological Measurement, 2021

The interpretation of psychometric test results is usually based on norm scores. We compared semiparametric continuous norming (SPCN) with conventional norming methods by simulating results for test scales with different item numbers and difficulties via an item response theory approach. Subsequently, we modeled the norm scores based on random…

Descriptors: Test Norms, Scores, Regression (Statistics), Test Items

Effects of Test Level Discrimination and Difficulty on Answer-Copying Indices

Peer reviewed
PDF on ERIC

Download full text

Sunbul, Onder; Yormaz, Seha – International Journal of Evaluation and Research in Education, 2018

In this study Type I Error and the power rates of omega (?) and GBT (generalized binomial test) indices were investigated for several nominal alpha levels and for 40 and 80-item test lengths with 10,000-examinee sample size under several test level restrictions. As a result, Type I error rates of both indices were found to be below the acceptable…

Descriptors: Difficulty Level, Cheating, Duplication, Test Length

Investigating the Performance of Omega Index According to Item Parameters and Ability Levels

Peer reviewed
PDF on ERIC

Download full text

Sunbul, Onder; Yormaz, Seha – Eurasian Journal of Educational Research, 2018

Purpose: Several studies can be found in the literature that investigate the performance of ? under various conditions. However no study for the effects of item difficulty, item discrimination, and ability restrictions on the performance of ? could be found. The current study aims to investigate the performance of ? for the conditions given below.…

Descriptors: Test Items, Difficulty Level, Ability, Cheating

An Empirical Study for the Statistical Adjustment of Rater Bias

Peer reviewed
PDF on ERIC

Download full text

Ilhan, Mustafa – International Journal of Assessment Tools in Education, 2019

This study investigated the effectiveness of statistical adjustments applied to rater bias in many-facet Rasch analysis. Some changes were first made in the dataset that did not include "rater × examinee" bias to cause to have "rater × examinee" bias. Later, bias adjustment was applied to rater bias included in the data file,…

Descriptors: Statistical Analysis, Item Response Theory, Evaluators, Bias

The Effect of Mini and Midi Anchor Tests on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Arikan, Çigdem Akin – International Journal of Progressive Education, 2018

The main purpose of this study is to compare the test forms to the midi anchor test and the mini anchor test performance based on item response theory. The research was conducted with using simulated data which were generated based on Rasch model. In order to equate two test forms the anchor item nonequivalent groups (internal anchor test) was…

Descriptors: Equated Scores, Comparative Analysis, Item Response Theory, Tests

Is the Factor Observed in Investigations on the Item-Position Effect Actually the Difficulty Factor?

Peer reviewed

Direct link

Schweizer, Karl; Troche, Stefan – Educational and Psychological Measurement, 2018

In confirmatory factor analysis quite similar models of measurement serve the detection of the difficulty factor and the factor due to the item-position effect. The item-position effect refers to the increasing dependency among the responses to successively presented items of a test whereas the difficulty factor is ascribed to the wide range of…

Descriptors: Investigations, Difficulty Level, Factor Analysis, Models

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

ETS Research Report Series	15
Educational and Psychological…	12
Applied Psychological…	5
Language Testing	5
Applied Measurement in…	4
Behavioral Research and…	4
International Journal of…	4
Practical Assessment,…	4
CBE - Life Sciences Education	3
Educational Testing Service	3
International Journal of…	3
Journal of Educational…	3
ProQuest LLC	3
Universal Journal of…	3
African Journal of Research…	2
Chemistry Education Research…	2
Eurasian Journal of…	2
Grantee Submission	2
International Journal of…	2
Journal of Education and…	2
Journal of Educational…	2
Journal of Educational and…	2
Journal of Psychoeducational…	2
Online Submission	2
Psychometrika	2
More ▼

Tindal, Gerald	4
Alonzo, Julie	3
Livingston, Samuel A.	3
Sinharay, Sandip	3
Baird, Jo-Anne	2
Bejar, Isaac I.	2
Benton, Tom	2
Bernholt, Sascha	2
DeMars, Christine E.	2
Feigenbaum, Miriam	2
Futagi, Yoko	2
Graf, Edith Aurora	2
Guo, Hongwen	2
Kostin, Irene	2
Lawless, René	2
Legg, Sue M.	2
Liu, Jinghua	2
Liu, Kimy	2
Long, Caroline	2
Luke G. Eglington	2
Oh, Hyeonjoo J.	2
Parchmann, Ilka	2
Rudner, Lawrence M.	2
Sunbul, Onder	2
More ▼