ERIC - Search Results

Publication Date

In 2025	1
Since 2024	7
Since 2021 (last 5 years)	48
Since 2016 (last 10 years)	98
Since 2006 (last 20 years)	157

Descriptor

Difficulty Level	209
Scores	209
Test Items	209
Foreign Countries	60
Item Response Theory	60
Item Analysis	51
Test Reliability	42
Test Construction	41
Comparative Analysis	36
Test Format	35
Multiple Choice Tests	34
Test Validity	31
Language Tests	28
Mathematics Tests	28
Computer Assisted Testing	27
Statistical Analysis	27
Psychometrics	26
Correlation	25
Second Language Learning	25
Achievement Tests	24
English (Second Language)	24
Test Bias	21
Models	20
Student Evaluation	20
Elementary School Students	18
More ▼

Publication Type

Reports - Research	163
Journal Articles	149
Speeches/Meeting Papers	26
Reports - Evaluative	20
Dissertations/Theses -…	13
Tests/Questionnaires	11
Reports - Descriptive	7
Numerical/Quantitative Data	5
Information Analyses	2
Guides - General	1
Guides - Non-Classroom	1
More ▼

Education Level

Higher Education	48
Postsecondary Education	37
Secondary Education	27
Elementary Education	24
Middle Schools	14
High Schools	11
Junior High Schools	9
Intermediate Grades	8
Grade 4	7
Grade 5	6
Grade 8	6
Primary Education	6
Elementary Secondary Education	5
Grade 3	5
Grade 7	5
Early Childhood Education	4
Grade 12	3
Grade 6	3
Grade 9	3
Kindergarten	2
Adult Basic Education	1
Adult Education	1
Grade 1	1
Grade 10	1
Grade 11	1
More ▼

Audience

Researchers	4
Policymakers	1
Teachers	1

Location

Turkey	6
Canada	5
United States	4
China	3
Colorado	3
Indonesia	3
Iran	3
Massachusetts	3
New Jersey	3
New York	3
Ohio	3
Arkansas	2
Belgium	2
California	2
District of Columbia	2
Finland	2
Florida	2
France	2
Germany	2
Indiana	2
Kansas	2
Michigan	2
Minnesota	2
New Mexico	2
Oregon	2
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 209 results Save | Export

Seeking the Real Reliability: Why the Traditional Estimators of Reliability Usually Fail in Achievement Testing and Why the Deflation-Corrected Coefficients Could Be Better Options

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023

Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…

Descriptors: Test Reliability, Achievement Tests, Computation, Test Items

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Does Timed Testing Affect the Interpretation of Efficiency Scores?--A GLMM Analysis of Reading Components

Peer reviewed

Direct link

Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024

The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…

Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation

Argument-Based Validation of Chulalongkorn University Language Institute (CULI) Test: A Rasch-Based Evidence Investigation

Peer reviewed

Direct link

Apichat Khamboonruang – Language Testing in Asia, 2025

Chulalongkorn University Language Institute (CULI) test was developed as a local standardised test of English for professional and international communication. To ensure that the CULI test fulfils its intended purposes, this study employed Kane's argument-based validation and Rasch measurement approaches to construct the validity argument for the…

Descriptors: Universities, Second Language Learning, Second Language Instruction, Language Tests

Assessment of Item and Test Parameters: Cosine Similarity Approach

Peer reviewed
PDF on ERIC

Download full text

Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021

The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…

Descriptors: Test Items, Difficulty Level, Scores, Test Reliability

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

IRTrees for Skipping Items in PIRLS

Peer reviewed

Direct link

Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…

Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment

A Novel Examination of None-of-the-Above as It Influences Examinee Item Responses

Direct link

Thompson, Kathryn N. – ProQuest LLC, 2023

It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…

Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores

Better Remedies for Bad Exams: Correcting for Difficult Questions in a Fair and Systematic Way

Peer reviewed
PDF on ERIC

Download full text

Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022

Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…

Descriptors: College Students, Student Evaluation, Tests, Test Items

Does Question Order Matter on Online Math Assessments? A Big Data Analysis of Undergraduate Mathematics Final Exams

Peer reviewed

Direct link

Gruss, Richard; Clemons, Josh – Journal of Computer Assisted Learning, 2023

Background: The sudden growth in online instruction due to COVID-19 restrictions has given renewed urgency to questions about remote learning that have remained unresolved. Web-based assessment software provides instructors an array of options for varying testing parameters, but the pedagogical impacts of some of these variations has yet to be…

Descriptors: Test Items, Test Format, Computer Assisted Testing, Mathematics Tests

The Impact of Cheating on Score Comparability via Pool-Based IRT Pre-Equating

Peer reviewed

Direct link

Liu, Jinghua; Becker, Kirk – Journal of Educational Measurement, 2022

For any testing programs that administer multiple forms across multiple years, maintaining score comparability via equating is essential. With continuous testing and high-stakes results, especially with less secure online administrations, testing programs must consider the potential for cheating on their exams. This study used empirical and…

Descriptors: Cheating, Item Response Theory, Scores, High Stakes Tests

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

The Experimental Study of the Effect of Functional-Variational Factors on the Results of Linguistic Testing

Peer reviewed
PDF on ERIC

Download full text

Hryvko, Antonina V.; Zhuk, Yurii O. – Journal of Curriculum and Teaching, 2022

A feature of the presented study is a comprehensive approach to studying the reliability problem of linguistic testing results due to the several functional and variable factors impact. Contradictions and ambiguous views of scientists on the researched issues determine the relevance of this study. The article highlights the problem of equivalence…

Descriptors: Student Evaluation, Language Tests, Test Format, Test Items

Influence of Selected-Response Format Variants on Test Characteristics and Test-Taking Effort: An Empirical Study. Research Report. ETS RR-22-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Rios, Joseph A.; Ling, Guangming; Wang, Zhen; Gu, Lin; Yang, Zhitong; Liu, Lydia O. – ETS Research Report Series, 2022

Different variants of the selected-response (SR) item type have been developed for various reasons (i.e., simulating realistic situations, examining critical-thinking and/or problem-solving skills). Generally, the variants of SR item format are more complex than the traditional multiple-choice (MC) items, which may be more challenging to test…

Descriptors: Test Format, Test Wiseness, Test Items, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 14

Journal of Educational…	12
ProQuest LLC	12
ETS Research Report Series	10
Educational and Psychological…	6
Applied Measurement in…	5
Language Assessment Quarterly	5
Online Submission	4
Practical Assessment,…	4
Educational Assessment	3
Language Testing	3
SAGE Open	3
Applied Psychological…	2
Assessment & Evaluation in…	2
Chemistry Education Research…	2
European Journal of…	2
Hacettepe University Journal…	2
Journal of Experimental…	2
Language Testing in Asia	2
Partnership for Assessment of…	2
Physical Review Special…	2
Teaching of Psychology	2
Achieve, Inc.	1
Acta Educationis Generalis	1
Advances in Health Sciences…	1
Advances in Physiology…	1
More ▼

Guo, Hongwen	4
Bulut, Okan	3
Baghaei, Purya	2
Bretz, Stacey Lowery	2
Bridgeman, Brent	2
Camilli, Gregory	2
Cawthon, Stephanie W.	2
Cline, Frederick	2
Custer, Michael	2
Gu, Lin	2
Ling, Guangming	2
Liu, Jinghua	2
Liu, Ou Lydia	2
Long, Caroline	2
Lord, Frederic M.	2
Meijer, Rob R.	2
Metsämuuronen, Jari	2
Plake, Barbara S.	2
Pollock, Steven J.	2
Pomplun, Mark	2
Prowker, Adam	2
Rios, Joseph A.	2
Ritchie, Timothy	2
Rock, Donald A.	2
More ▼

Program for International…	8
Test of English as a Foreign…	8
National Assessment of…	5
SAT (College Admission Test)	5
Graduate Record Examinations	3
Trends in International…	3
Flesch Kincaid Grade Level…	2
International English…	2
Peabody Picture Vocabulary…	2
ACT Assessment	1
Advanced Placement…	1
Communication and Symbolic…	1
Connecticut Mastery Testing…	1
Digit Span Test	1
Expressive One Word Picture…	1
Flesch Reading Ease Formula	1
Gates MacGinitie Reading Tests	1
Law School Admission Test	1
Progress in International…	1
Raven Progressive Matrices	1
Sequential Tests of…	1
Stanford Early School…	1
Tennessee Comprehensive…	1
Wechsler Individual…	1
Wechsler Intelligence Scale…	1
More ▼