ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	41

Descriptor

Scores	97
Test Theory	97
Test Reliability	25
Comparative Analysis	23
Test Items	21
Correlation	15
Item Response Theory	15
Error of Measurement	14
Foreign Countries	14
Statistical Analysis	14
Psychometrics	13
Mathematical Models	12
Testing	12
Higher Education	11
Reliability	11
Criterion Referenced Tests	10
Item Analysis	10
Statistical Studies	10
Test Interpretation	10
Test Validity	10
Difficulty Level	9
Measurement Techniques	9
Test Construction	9
Latent Trait Theory	8
Testing Problems	8
More ▼

Publication Type

Reports - Research	97
Journal Articles	64
Speeches/Meeting Papers	17
Tests/Questionnaires	2
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Opinion Papers	1

Education Level

Higher Education	10
Postsecondary Education	5
Secondary Education	5
Elementary Education	4
High Schools	3
Early Childhood Education	2
Grade 2	2
Grade 4	2
Primary Education	2
Grade 1	1
Grade 5	1
Grade 8	1
Grade 9	1
Kindergarten	1
Middle Schools	1
More ▼

Audience

Researchers	6
Policymakers	1
Practitioners	1
Teachers	1

Location

Canada	2
Indonesia	2
Texas	2
Turkey (Ankara)	2
United States	2
Chile	1
China	1
Europe	1
Florida	1
Hawaii	1
Hong Kong	1
Nigeria	1
Pennsylvania	1
Sweden	1
Turkey	1
United Kingdom (Great Britain)	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

SAT (College Admission Test)	4
Childrens Depression Inventory	2
Comprehensive Tests of Basic…	2
ACT Assessment	1
Alabama High School…	1
Armed Services Vocational…	1
California Achievement Tests	1
Program for International…	1
Stanford Early School…	1
Strengths and Difficulties…	1
Student Descriptive…	1
Test of English as a Foreign…	1
Youth Risk Behavior Survey	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 97 results Save | Export

Latent Trait Item Response Models for Continuous Responses

Peer reviewed

Direct link

Gerhard Tutz; Pascal Jordan – Journal of Educational and Behavioral Statistics, 2024

A general framework of latent trait item response models for continuous responses is given. In contrast to classical test theory (CTT) models, which traditionally distinguish between true scores and error scores, the responses are clearly linked to latent traits. It is shown that CTT models can be derived as special cases, but the model class is…

Descriptors: Item Response Theory, Responses, Scores, Models

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

Assessment of Item and Test Parameters: Cosine Similarity Approach

Peer reviewed
PDF on ERIC

Download full text

Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021

The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…

Descriptors: Test Items, Difficulty Level, Scores, Test Reliability

Assessing the Fairness of Mathematical Literacy Test in Indonesia: Evidence from Gender-Based Differential Item Function Analysis

Peer reviewed
PDF on ERIC

Download full text

Kartianom Kartianom; Heri Retnawati; Kana Hidayati – Journal of Pedagogical Research, 2024

Conducting a fair test is important for educational research. Unfair assessments can lead to gender disparities in academic achievement, ultimately resulting in disparities in opportunities, wages, and career choice. Differential Item Function [DIF] analysis is presented to provide evidence of whether the test is truly fair, where it does not harm…

Descriptors: Foreign Countries, Test Bias, Item Response Theory, Test Theory

Comparison of Classical Test Theory vs. Multi-Facet Rasch Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat; Turhan, Nihan S.; Toraman, Cetin – Pegem Journal of Education and Instruction, 2022

Testing English writing skills could be multi-dimensional; thus, the study aimed to compare students' writing scores calculated according to Classical Test Theory (CTT) and Multi-Facet Rasch Model (MFRM). The research was carried out in 2019 with 100 university students studying at a foreign language preparatory class and four experienced…

Descriptors: Comparative Analysis, Test Theory, Item Response Theory, Student Evaluation

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

Estimating Treatment Effects with the Explanatory Item Response Model. EdWorkingPaper No. 22-677

Download full text

Joshua B. Gilbert – Annenberg Institute for School Reform at Brown University, 2022

This simulation study examines the characteristics of the Explanatory Item Response Model (EIRM) when estimating treatment effects when compared to classical test theory (CTT) sum and mean scores and item response theory (IRT)-based theta scores. Results show that the EIRM and IRT theta scores provide generally equivalent bias and false positive…

Descriptors: Item Response Theory, Models, Test Theory, Computation

Comparison of Performance Measures Obtained from Foreign Language Tests According to Item Response Theory vs Classical Test Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – International Online Journal of Education and Teaching, 2022

Foreign language testing is a multi-dimensional phenomenon and obtaining objective and error-free scores on learners' language skills is often problematic. While assessing foreign language performance on high-stakes tests, using different testing approaches including Classical Test Theory (CTT), Generalizability Theory (GT) and/or Item Response…

Descriptors: Second Language Learning, Second Language Instruction, Item Response Theory, Language Tests

Invariance Person Estimate of Basic Education Certificate Examination: Classical Test Theory and Item Response Theory Scoring Perspective

Peer reviewed
PDF on ERIC

Download full text

Ayanwale, Musa Adekunle; Adeleke, Joshua Oluwatoyin; Mamadelo, Titilayo Iyabode – Journal of the International Society for Teacher Education, 2019

A scoring framework that does not reflect true performance of an examinee would ultimately result in an abnormal score. This study assessed invariance person estimates of 2017 Nigerian National Examinations Council Basic Education Certificate Examination Mathematics Multiple Choice using classical test theory (CTT) and item response theory (IRT)…

Descriptors: Test Theory, Item Response Theory, Scoring, National Competency Tests

Do Students Rapidly Guess Repeatedly over Time? A Longitudinal Analysis of Student Test Disengagement, Background, and Attitudes

Peer reviewed

Direct link

Soland, James; Kuhfeld, Megan – Educational Assessment, 2019

Considerable research has examined the use of rapid guessing measures to identify disengaged item responses. However, little is known about students who rapidly guess over the course of several tests. In this study, we use achievement test data from six administrations over three years to investigate whether rapid guessing is a stable trait-like…

Descriptors: Testing, Guessing (Tests), Reaction Time, Achievement Tests

Using Generalizability Theory to Assess the Score Reliability of Communication Skills of Dentistry Students

Peer reviewed
PDF on ERIC

Download full text

Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018

The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…

Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

Peer reviewed

Direct link

Powers, Donald; Schedl, Mary; Papageorgiou, Spiros – Language Testing, 2017

The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…

Descriptors: English (Second Language), Second Language Learning, Language Proficiency, Scores

"TechCheck": Development and Validation of an Unplugged Assessment of Computational Thinking in Early Childhood Education

Peer reviewed

Direct link

Relkin, Emily; de Ruiter, Laura; Bers, Marina Umaschi – Journal of Science Education and Technology, 2020

There is a need for developmentally appropriate Computational Thinking (CT) assessments that can be implemented in early childhood classrooms. We developed a new instrument called "TechCheck" for assessing CT skills in young children that does not require prior knowledge of computer programming. "TechCheck" is based on…

Descriptors: Developmentally Appropriate Practices, Computation, Thinking Skills, Early Childhood Education

Test Assembly Implications for Providing Reliable and Valid Subscores

Peer reviewed

Direct link

Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017

This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…

Descriptors: Scores, Test Construction, Test Reliability, Test Validity

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational and Psychological…	13
Journal of Educational…	5
Psychometrika	3
ETS Research Report Series	2
Educational Assessment	2
Journal of Educational and…	2
Journal of Experimental…	2
Language Testing	2
Annenberg Institute for…	1
Asian Journal of Education…	1
Assessment for Effective…	1
Behavioral Disorders	1
Biochemistry and Molecular…	1
College Board	1
Dyslexia	1
Educational Measurement:…	1
Educational Sciences: Theory…	1
ICHPER-SD Journal of Research	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Online Journal…	1
Journal of Adolescence	1
Journal of Computers in…	1
More ▼

Haberman, Shelby J.	4
Zimmerman, Donald W.	4
Sinharay, Sandip	3
Crowley, Susan L.	2
Polat, Murat	2
Price, Gary G.	2
Yen, Wendy M.	2
Acevedo, Daniela	1
Adeleke, Joshua Oluwatoyin	1
Adler, Nurit	1
Aktas, Mehtap	1
Aleamoni, Lawrence, M.	1
Asiret, Semih	1
Ayanwale, Musa Adekunle	1
Bailey, Janelle M.	1
Balch, William R.	1
Bandalos, Deborah L.	1
Banerji, Madhabi	1
Barton-Weston, Heather M.	1
Beaujean, A. Alexander	1
Bers, Marina Umaschi	1
Blixt, Sonya L.	1
Boldt, R. F.	1
Bormuth, John R.	1
More ▼