ERIC - Search Results

Publication Date

In 2025	3
Since 2024	9
Since 2021 (last 5 years)	29
Since 2016 (last 10 years)	86
Since 2006 (last 20 years)	208

Descriptor

Test Theory	615
Test Items	166
Test Reliability	148
Test Validity	127
Foreign Countries	116
Test Construction	115
Item Analysis	101
Scores	97
Item Response Theory	95
Mathematical Models	93
Latent Trait Theory	87
Higher Education	86
Psychometrics	86
Comparative Analysis	79
Statistical Analysis	79
Correlation	78
Difficulty Level	65
Error of Measurement	65
Factor Analysis	53
Measurement Techniques	52
Multiple Choice Tests	48
Test Interpretation	45
Achievement Tests	44
Testing	42
Criterion Referenced Tests	41
More ▼

Publication Type

Reports - Research	615
Journal Articles	377
Speeches/Meeting Papers	112
Tests/Questionnaires	17
Information Analyses	11
Numerical/Quantitative Data	10
Opinion Papers	7
Reports - Evaluative	4
Dissertations/Theses -…	2
Guides - Non-Classroom	2
Reports - Descriptive	2
Books	1
Collected Works - General	1
Collected Works - Proceedings	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	57
Postsecondary Education	42
Secondary Education	36
Elementary Education	31
Middle Schools	21
Junior High Schools	17
High Schools	16
Grade 8	15
Elementary Secondary Education	12
Grade 7	11
Grade 4	10
Early Childhood Education	9
Intermediate Grades	9
Grade 6	8
Grade 5	7
Adult Education	6
Grade 3	6
Primary Education	6
Grade 2	4
Preschool Education	4
Grade 9	3
Grade 1	2
Grade 10	2
Kindergarten	2
Grade 12	1
More ▼

Audience

Researchers	61
Practitioners	13
Teachers	7
Administrators	1
Policymakers	1

Location

Canada	11
Australia	10
Turkey	10
United States	9
United Kingdom (England)	8
Spain	5
Texas	5
Florida	4
New York	4
Taiwan	4
Tennessee	4
United Kingdom	4
California	3
China	3
Colorado	3
Indonesia	3
Israel	3
Japan	3
Netherlands	3
Sweden	3
Turkey (Ankara)	3
United Kingdom (Great Britain)	3
Chile	2
Hong Kong	2
Illinois	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	2

What Works Clearinghouse Rating

Showing 1 to 15 of 615 results Save | Export

Latent Trait Item Response Models for Continuous Responses

Peer reviewed

Direct link

Gerhard Tutz; Pascal Jordan – Journal of Educational and Behavioral Statistics, 2024

A general framework of latent trait item response models for continuous responses is given. In contrast to classical test theory (CTT) models, which traditionally distinguish between true scores and error scores, the responses are clearly linked to latent traits. It is shown that CTT models can be derived as special cases, but the model class is…

Descriptors: Item Response Theory, Responses, Scores, Models

Modeling Partial Knowledge in Multiple-Choice Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Kentaro Fukushima; Nao Uchida; Kensuke Okada – Journal of Educational and Behavioral Statistics, 2025

Diagnostic tests are typically administered in a multiple-choice (MC) format due to their advantages of objectivity and time efficiency. The MC-deterministic input, noisy "and" gate (DINA) family of models, a representative class of cognitive diagnostic models for MC items, efficiently and parsimoniously estimates the mastery profiles of…

Descriptors: Diagnostic Tests, Cognitive Measurement, Multiple Choice Tests, Educational Assessment

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

Item Parameter Recovery via Traditional 2PL, Testlet and Bi-Factor Models for Testlet-Based Tests

Peer reviewed
PDF on ERIC

Download full text

Soysal, Sumeyra; Yilmaz Kogar, Esin – International Journal of Assessment Tools in Education, 2022

The testlet comprises a set of items based on a common stimulus. When the testlet is used in the tests, there may violate the local independence assumption, and in this case, it would not be appropriate to use traditional item response theory models in the tests in which the testlet is included. When the testlet is discussed, one of the most…

Descriptors: Test Items, Test Theory, Models, Sample Size

Electronic Assessment Anxiety Scale: Development, Validity and Reliability

Peer reviewed
PDF on ERIC

Download full text

Osman Tat; Abdullah Faruk Kilic – Turkish Online Journal of Distance Education, 2024

The widespread availability of internet access in daily life has resulted in a greater acceptance of online assessment methods. E-assessment platforms offer various features such as randomizing questions and answers, utilizing extensive question banks, setting time limits, and managing access during online exams. Electronic assessment enables…

Descriptors: Test Construction, Test Validity, Test Reliability, Anxiety

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

Assessment of Item and Test Parameters: Cosine Similarity Approach

Peer reviewed
PDF on ERIC

Download full text

Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021

The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…

Descriptors: Test Items, Difficulty Level, Scores, Test Reliability

Accuracy and Sensitivity of Coefficient Alpha and Its Alternatives with Unidimensional and Contaminated Scales

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023

We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…

Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length

A Comparison of the Efficacies of Differential Item Functioning Detection Methods

Peer reviewed
PDF on ERIC

Download full text

Basman, Munevver – International Journal of Assessment Tools in Education, 2023

To ensure the validity of the tests is to check that all items have similar results across different groups of individuals. However, differential item functioning (DIF) occurs when the results of individuals with equal ability levels from different groups differ from each other on the same test item. Based on Item Response Theory and Classic Test…

Descriptors: Test Bias, Test Items, Test Validity, Item Response Theory

Identifying Enemy Item Pairs Using Natural Language Processing

Peer reviewed

Direct link

Becker, Kirk A.; Kao, Shu-chuan – Journal of Applied Testing Technology, 2022

Natural Language Processing (NLP) offers methods for understanding and quantifying the similarity between written documents. Within the testing industry these methods have been used for automatic item generation, automated scoring of text and speech, modeling item characteristics, automatic question answering, machine translation, and automated…

Descriptors: Item Banks, Natural Language Processing, Computer Assisted Testing, Scoring

Assessing the Fairness of Mathematical Literacy Test in Indonesia: Evidence from Gender-Based Differential Item Function Analysis

Peer reviewed
PDF on ERIC

Download full text

Kartianom Kartianom; Heri Retnawati; Kana Hidayati – Journal of Pedagogical Research, 2024

Conducting a fair test is important for educational research. Unfair assessments can lead to gender disparities in academic achievement, ultimately resulting in disparities in opportunities, wages, and career choice. Differential Item Function [DIF] analysis is presented to provide evidence of whether the test is truly fair, where it does not harm…

Descriptors: Foreign Countries, Test Bias, Item Response Theory, Test Theory

Literary Responses in Spanish Adolescents: Adaptation, Validation, and Analysis of the Literary Response Questionnaire

Peer reviewed

Direct link

Diana Muela-Bermejo; Irene Mendoza-Cercadillo; Lucía Hernández-Heras – Journal of Adolescent & Adult Literacy, 2024

This study involves translating, cross-culturally adapting, and validating the "Literary Response Questionnaire" (LRQ) for 413 Spanish adolescents. It explores the evolution of literary education in Spain and its alignment with the Reading Responses paradigm. The LRQ, adapted across various locations, is validated in Spanish through…

Descriptors: Reader Response, Adolescents, Questionnaires, Translation

Evidence for Validity and Reliability of a Research-Based Assessment Instrument on Measurement Uncertainty

Peer reviewed

Direct link

Gayle Geschwind; Michael Vignal; Marcos D. Caballero; H.? J. Lewandowski – Physical Review Physics Education Research, 2024

The Survey of Physics Reasoning on Uncertainty Concepts in Experiments (SPRUCE) was designed to measure students' proficiency with measurement uncertainty concepts and practices across ten different assessment objectives to help facilitate the improvement of laboratory instruction focused on this important topic. To ensure the reliability and…

Descriptors: Measurement, Ambiguity (Context), Scientific Concepts, Physics

Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items

Peer reviewed

Direct link

Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020

The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…

Descriptors: Test Bias, Interrater Reliability, Responses, Correlation

The Riddle Knowledge Inference Test (R-Kit)

Peer reviewed

Direct link

Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025

Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…

Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 41

Educational and Psychological…	39
Psychometrika	26
Journal of Educational…	21
Language Testing	10
Applied Psychological…	9
International Journal of…	8
Journal of Educational…	8
Online Submission	8
ETS Research Report Series	7
Journal of Experimental…	6
Applied Measurement in…	5
Educational Research and…	5
Journal of School Psychology	5
Physical Review Physics…	5
Alberta Journal of…	4
Educational Sciences: Theory…	4
Journal of Educational…	4
Journal of Educational and…	4
Assessment for Effective…	3
Computers & Education	3
EURASIA Journal of…	3
Educational Assessment	3
Educational Measurement:…	3
Grantee Submission	3
International Journal of…	3
More ▼

Zimmerman, Donald W.	8
Wilcox, Rand R.	7
Haladyna, Tom	6
Yen, Wendy M.	6
Dorans, Neil J.	4
Haberman, Shelby J.	4
Roid, Gale	4
van der Linden, Wim J.	4
Ackerman, Terry A.	3
Andrich, David	3
Cliff, Norman	3
Cope, Ronald T.	3
Divgi, D. R.	3
Engelhard, George, Jr.	3
Hambleton, Ronald K.	3
Hutchinson, T. P.	3
Huynh, Huynh	3
Lane, Kathleen Lynne	3
Lee, Young-Sun	3
Levine, Michael V.	3
Lord, Frederic M.	3
Marsh, Herbert W.	3
Mislevy, Robert J.	3
Powell, J. C.	3
More ▼

SAT (College Admission Test)	15
Wechsler Intelligence Scale…	8
Armed Services Vocational…	6
Comprehensive Tests of Basic…	5
Graduate Record Examinations	5
National Assessment of…	5
Trends in International…	5
ACT Assessment	4
California Achievement Tests	4
Bayley Scales of Infant…	3
Kaufman Assessment Battery…	3
Strengths and Difficulties…	3
Test of English as a Foreign…	3
Alabama High School…	2
Childrens Depression Inventory	2
Law School Admission Test	2
Learning and Study Strategies…	2
My Class Inventory	2
Program for International…	2
Stanford Binet Intelligence…	2
Wechsler Adult Intelligence…	2
ACTFL Oral Proficiency…	1
Adaptive Behavior Scale	1
Armed Forces Qualification…	1
California Critical Thinking…	1
More ▼