ERIC - Search Results

Publication Date

In 2025	8
Since 2024	19
Since 2021 (last 5 years)	107
Since 2016 (last 10 years)	260
Since 2006 (last 20 years)	466

Descriptor

Difficulty Level	578
Item Response Theory	578
Test Items	457
Foreign Countries	158
Test Construction	109
Psychometrics	98
Models	95
Item Analysis	90
Comparative Analysis	88
Scores	78
Test Reliability	78
Multiple Choice Tests	77
Mathematics Tests	73
Statistical Analysis	72
Computer Assisted Testing	66
Correlation	61
Test Validity	58
Computation	54
Simulation	53
Test Format	51
Science Tests	49
Elementary School Students	46
Error of Measurement	45
Achievement Tests	44
Language Tests	44
More ▼

Publication Type

Journal Articles	424
Reports - Research	421
Reports - Evaluative	100
Speeches/Meeting Papers	67
Dissertations/Theses -…	31
Numerical/Quantitative Data	23
Reports - Descriptive	20
Tests/Questionnaires	14
Information Analyses	6
Collected Works - Proceedings	3
ERIC Digests in Full Text	1
ERIC Publications	1
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	93
Secondary Education	92
Elementary Education	85
Postsecondary Education	81
Middle Schools	44
Junior High Schools	39
High Schools	35
Grade 8	31
Elementary Secondary Education	29
Early Childhood Education	23
Intermediate Grades	22
Primary Education	22
Grade 3	20
Grade 7	20
Grade 5	18
Grade 4	17
Grade 6	17
Grade 1	10
Grade 2	10
Kindergarten	9
Grade 9	8
Grade 12	7
Grade 10	5
Adult Education	3
Grade 11	2
More ▼

Audience

Practitioners

Location

Turkey	18
Germany	14
Indonesia	11
Taiwan	9
United States	9
Australia	8
Nigeria	8
Canada	7
Florida	7
South Africa	6
Brazil	5
California	5
Japan	5
Belgium	4
China	4
Greece	4
Iran	4
Malaysia	4
Netherlands	4
South Korea	4
United Kingdom	4
Hong Kong	3
Illinois	3
Indiana	3
Israel	3
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 578 results Save | Export

Embedding Embedded Standard Setting: An Application of Cross-Classified Item Response Theory. CRESST Report 876

Download full text

Yun-Kyung Kim; Li Cai – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2025

This paper introduces an application of cross-classified item response theory (IRT) modeling to an assessment utilizing the embedded standard setting (ESS) method (Lewis & Cook). The cross-classified IRT model is used to treat both item and person effects as random, where the item effects are regressed on the target performance levels (target…

Descriptors: Standard Setting (Scoring), Item Response Theory, Test Items, Difficulty Level

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

The Impact of Insufficient Effort Responses on the Order of Category Thresholds in the Polytomous Rasch Model

Peer reviewed

Direct link

Kuan-Yu Jin; Thomas Eckes – Educational and Psychological Measurement, 2024

Insufficient effort responding (IER) refers to a lack of effort when answering survey or questionnaire items. Such items typically offer more than two ordered response categories, with Likert-type scales as the most prominent example. The underlying assumption is that the successive categories reflect increasing levels of the latent variable…

Descriptors: Item Response Theory, Test Items, Test Wiseness, Surveys

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

An Investigation of the Nature and Consequence of the Relationship between IRT Difficulty and Discrimination

Peer reviewed

Direct link

Sweeney, Sandra M.; Sinharay, Sandip; Johnson, Matthew S.; Steinhauer, Eric W. – Educational Measurement: Issues and Practice, 2022

The focus of this paper is on the empirical relationship between item difficulty and item discrimination. Two studies--an empirical investigation and a simulation study--were conducted to examine the association between item difficulty and item discrimination under classical test theory and item response theory (IRT), and the effects of the…

Descriptors: Correlation, Item Response Theory, Item Analysis, Difficulty Level

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Application of Two-Parameter Item Response Theory for Determining Form-Dependent Items on Exams Using Different Item Orders

Peer reviewed
PDF on ERIC

Download full text

Pentecost, Thomas C.; Raker, Jeffery R.; Murphy, Kristen L. – Practical Assessment, Research & Evaluation, 2023

Using multiple versions of an assessment has the potential to introduce item environment effects. These types of effects result in version dependent item characteristics (i.e., difficulty and discrimination). Methods to detect such effects and resulting implications are important for all levels of assessment where multiple forms of an assessment…

Descriptors: Item Response Theory, Test Items, Test Format, Science Tests

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model

Peer reviewed

Direct link

Fellinghauer, Carolina; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

This simulation study investigated to what extent departures from construct similarity as well as differences in the difficulty and targeting of scales impact the score transformation when scales are equated by means of concurrent calibration using the partial credit model with a common person design. Practical implications of the simulation…

Descriptors: True Scores, Equated Scores, Test Items, Sample Size

Supercharging BKT with Multidimensional Generalizable IRT and Skill Discovery

Peer reviewed
PDF on ERIC

Download full text

Mohammad M. Khajah – Journal of Educational Data Mining, 2024

Bayesian Knowledge Tracing (BKT) is a popular interpretable computational model in the educational mining community that can infer a student's knowledge state and predict future performance based on practice history, enabling tutoring systems to adaptively select exercises to match the student's competency level. Existing BKT implementations do…

Descriptors: Students, Bayesian Statistics, Intelligent Tutoring Systems, Cognitive Development

A Randomization P-Value Test for Detecting Copying on Multiple-Choice Exams

Peer reviewed

Direct link

Lang, Joseph B. – Journal of Educational and Behavioral Statistics, 2023

This article is concerned with the statistical detection of copying on multiple-choice exams. As an alternative to existing permutation- and model-based copy-detection approaches, a simple randomization p-value (RP) test is proposed. The RP test, which is based on an intuitive match-score statistic, makes no assumptions about the distribution of…

Descriptors: Identification, Cheating, Multiple Choice Tests, Item Response Theory

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

IRTrees for Skipping Items in PIRLS

Peer reviewed

Direct link

Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…

Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment

Meeting Students Where They Are: Using Rasch Modeling for Improving the Measurement of Active Research in Higher Education

Peer reviewed

Direct link

Dahl, Laura S.; Staples, B. Ashley; Mayhew, Matthew J.; Rockenbach, Alyssa N. – Innovative Higher Education, 2023

Surveys with rating scales are often used in higher education research to measure student learning and development, yet testing and reporting on the longitudinal psychometric properties of these instruments is rare. Rasch techniques allow scholars to map item difficulty and individual aptitude on the same linear, continuous scale to compare…

Descriptors: Surveys, Rating Scales, Higher Education, Educational Research

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 39

Educational and Psychological…	41
ProQuest LLC	31
Journal of Educational…	28
Applied Measurement in…	19
Applied Psychological…	18
ETS Research Report Series	18
Behavioral Research and…	16
Grantee Submission	14
International Journal of…	11
Online Submission	10
Psychometrika	10
International Journal of…	9
Journal of Educational and…	8
Language Testing	8
Practical Assessment,…	8
International Educational…	7
Journal of Psychoeducational…	6
Journal of Speech, Language,…	6
Language Assessment Quarterly	6
International Journal of…	5
African Journal of Research…	4
Assessment for Effective…	4
Assessment in Education:…	4
Educational Measurement:…	4
Eurasian Journal of…	4
More ▼

Tindal, Gerald	16
Alonzo, Julie	12
Anderson, Daniel	9
Park, Bitnara Jasmine	8
Paek, Insu	7
Irvin, P. Shawn	6
Petscher, Yaacov	6
Saven, Jessica L.	6
Schoen, Robert C.	6
Bulut, Okan	5
DeBoer, George E.	5
DeMars, Christine E.	5
Engelhard, George, Jr.	5
Herrmann-Abell, Cari F.	5
Finch, Holmes	4
Guo, Hongwen	4
He, Wei	4
Jin, Kuan-Yu	4
Liu, Kimy	4
Long, Caroline	4
Sinharay, Sandip	4
Wise, Steven L.	4
Yang, Xiaotong	4
Andrich, David	3
More ▼

Program for International…	11
Trends in International…	8
Test of English as a Foreign…	6
Graduate Record Examinations	5
Advanced Placement…	3
International English…	3
National Assessment of…	3
Progress in International…	3
Raven Progressive Matrices	3
SAT (College Admission Test)	3
Measures of Academic Progress	2
Peabody Picture Vocabulary…	2
Raven Advanced Progressive…	2
Remote Associates Test	2
ACT Assessment	1
Child Behavior Checklist	1
Childrens Manifest Anxiety…	1
Connecticut Mastery Testing…	1
Defining Issues Test	1
Dynamic Indicators of Basic…	1
English Proficiency Test	1
Gates MacGinitie Reading Tests	1
General Aptitude Test Battery	1
Hidden Figures Test	1
Iowa Tests of Basic Skills	1
More ▼