ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	19

Descriptor

Classification	23
Comparative Analysis	23
Psychometrics	23
Educational Assessment	7
Item Response Theory	7
Evaluation Methods	6
Foreign Countries	6
Accuracy	4
Definitions	4
Educational Testing	4
Equated Scores	4
High Stakes Tests	4
Measurement Techniques	4
Models	4
Predictive Measurement	4
Test Interpretation	4
Test Use	4
Test Validity	4
Testing Problems	4
Children	3
Correlation	3
Diagnostic Tests	3
Error of Measurement	3
Mastery Learning	3
Measures (Individuals)	3
More ▼

Source

Measurement:…	6
Journal of Psychoeducational…	2
Applied Psychological…	1
College Student Journal	1
Educational Measurement:…	1
Journal of Applied Research…	1
Journal of Applied Testing…	1
Journal of Educational…	1
Journal of Experimental…	1
Language Assessment Quarterly	1
Language Testing	1
Multivariate Behavioral…	1
ProQuest LLC	1
Psychometrika	1
Research in Autism Spectrum…	1
Research in Developmental…	1
More ▼

Publication Type

Journal Articles	21
Reports - Research	12
Opinion Papers	6
Reports - Evaluative	4
Dissertations/Theses -…	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	5
Higher Education	1

Audience

Researchers	1
Students	1

Location

United Kingdom	3
United States	3
United Kingdom (England)	2
United Kingdom (Wales)	2
Australia	1
Germany	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Advanced Placement…	2
SAT (College Admission Test)	2
Childhood Autism Rating Scale	1
Developmental Behavior…	1
Kaufman Test of Educational…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Reliability and Validity Evidence of Diagnostic Methods: Comparison of Diagnostic Classification Models and Item Response Theory-Based Methods

Direct link

Yoo Jeong Jang – ProQuest LLC, 2022

Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…

Descriptors: Classification, Accuracy, Item Response Theory, Correlation

IRT Approaches to Modeling Scores on Mixed-Format Tests

Peer reviewed

Direct link

Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020

This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…

Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests

Diagnosing Diagnostic Models: From Von Neumann's Elephant to Model Equivalencies and Network Psychometrics

Peer reviewed

Direct link

von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2018

This article critically reviews how diagnostic models have been conceptualized and how they compare to other approaches used in educational measurement. In particular, certain assumptions that have been taken for granted and used as defining characteristics of diagnostic models are reviewed and it is questioned whether these assumptions are the…

Descriptors: Criticism, Psychometrics, Diagnostic Tests, Educational Assessment

Exploring the Impacts of Different Score Resolution Procedures on Person Fit and Estimated Achievement in Rater-Mediated Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Walker, A. Adrienne – Language Assessment Quarterly, 2020

Scoring procedures for many rater-mediated performance assessments include score resolution procedures in which a third rater adjudicates discrepancies between two raters' ratings of the same performance. There are numerous approaches for calculating resolved scores that involve different combinations of the original and third ratings. Using data…

Descriptors: Scoring, Evaluators, Goodness of Fit, Content Area Writing

Examining Cognitive Diagnostic Modeling in Classroom Assessment Conditions

Peer reviewed

Direct link

Paulsen, Justin; Valdivia, Dubravka Svetina – Journal of Experimental Education, 2022

Cognitive diagnostic models (CDMs) are a family of psychometric models designed to provide categorical classifications for multiple latent attributes. CDMs provide more granular evidence than other psychometric models and have potential for guiding teaching and learning decisions in the classroom. However, CDMs have primarily been conducted using…

Descriptors: Psychometrics, Classification, Teaching Methods, Learning Processes

Error Analysis: Past, Present, and Future

Peer reviewed

Direct link

McCloskey, George – Journal of Psychoeducational Assessment, 2017

This commentary will take an historical perspective on the Kaufman Test of Educational Achievement (KTEA) error analysis, discussing where it started, where it is today, and where it may be headed in the future. In addition, the commentary will compare and contrast the KTEA error analysis procedures that are rooted in psychometric methodology and…

Descriptors: Achievement Tests, Error Patterns, Comparative Analysis, Psychometrics

Reliably Assessing Growth with Longitudinal Diagnostic Classification Models

Peer reviewed

Direct link

Madison, Matthew J. – Educational Measurement: Issues and Practice, 2019

Recent advances have enabled diagnostic classification models (DCMs) to accommodate longitudinal data. These longitudinal DCMs were developed to study how examinees change, or transition, between different attribute mastery statuses over time. This study examines using longitudinal DCMs as an approach to assessing growth and serves three purposes:…

Descriptors: Longitudinal Studies, Item Response Theory, Psychometrics, Criterion Referenced Tests

Setting Cut Scores on an EFL Placement Test Using the Prototype Group Method: A Receiver Operating Characteristic (ROC) Analysis

Peer reviewed

Direct link

Eckes, Thomas – Language Testing, 2017

This paper presents an approach to standard setting that combines the prototype group method (PGM; Eckes, 2012) with a receiver operating characteristic (ROC) analysis. The combined PGM-ROC approach is applied to setting cut scores on a placement test of English as a foreign language (EFL). To implement the PGM, experts first named learners whom…

Descriptors: English (Second Language), Language Tests, Cutting Scores, Standard Setting (Scoring)

In Search of the Optimal Number of Response Categories in a Rating Scale

Peer reviewed

Direct link

Lee, Jihyun; Paek, Insu – Journal of Psychoeducational Assessment, 2014

Likert-type rating scales are still the most widely used method when measuring psychoeducational constructs. The present study investigates a long-standing issue of identifying the optimal number of response categories. A special emphasis is given to categorical data, which were generated by the Item Response Theory (IRT) Graded-Response Modeling…

Descriptors: Likert Scales, Responses, Item Response Theory, Classification

A Clinical Tool to Measure Trunk Control in Children with Cerebral Palsy: The Trunk Control Measurement Scale

Peer reviewed

Direct link

Heyrman, Lieve; Molenaers, Guy; Desloovere, Kaat; Verheyden, Geert; De Cat, Jos; Monbaliu, Elegast; Feys, Hilde – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011

In this study the psychometric properties of the Trunk Control Measurement Scale (TCMS) in children with cerebral palsy (CP) were examined. Twenty-six children with spastic CP (mean age 11 years 3 months, range 8-15 years; Gross Motor Function Classification System level I n = 11, level II n = 5, level III n = 10) were included in this study. To…

Descriptors: Construct Validity, Cerebral Palsy, Test Validity, Interrater Reliability

Convergent Validity of the Autism Spectrum Disorder-Diagnostic for Children (ASD-DC) and Childhood Autism Rating Scales (CARS)

Peer reviewed

Direct link

Matson, Johnny L.; Mahan, Sara; Hess, Julie A.; Fodstad, Jill C.; Neal, Daniene – Research in Autism Spectrum Disorders, 2010

Previous studies analyzed the reliability as well as sensitivity and specificity of the Autism Spectrum Disorder-Diagnostic for Children (ASD-DC). This study further examines the psychometric properties of the ASD-DC by assessing whether the ASD-DC has convergent validity against a psychometrically sound observational instrument for Autistic…

Descriptors: Verbal Communication, Nonverbal Communication, Autism, Validity

The Usefulness of the DBC-ASA as a Screening Instrument for Autism in Children with Intellectual Disabilities: A Pilot Study

Peer reviewed

Direct link

Deb, Shoumitro; Dhaliwal, Akal-Joat; Roy, Meera – Journal of Applied Research in Intellectual Disabilities, 2009

Aims: To explore the validity of Developmental Behaviour Checklist-Autism Screening Algorithm (DBC-ASA) as a screening instrument for autism among children with intellectual disabilities. Method: Data were collected from the case notes of 109 children with intellectual disabilities attending a specialist clinic in the UK. Results: The mean score…

Descriptors: Mental Retardation, Autism, Screening Tests, Children

Diagnostic Classification Models: Are They Necessary? Commentary on Rupp and Templin (2008)

Peer reviewed

Direct link

Gorin, Joanna S. – Measurement: Interdisciplinary Research and Perspectives, 2009

In their paper "Unique Characteristics of Diagnostic Classification Models: A Comprehensive Review of the Current State-of-the-Art," Andre Rupp and Jonathan Templin (2008) provide a comparative analysis of selected psychometric models useful for the analysis of multidimensional data for purposes of diagnostic score reporting. Recent assessment…

Descriptors: Psychological Evaluation, Classification, Scores, Psychometrics

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Previous Page | Next Page »

Pages: 1 | 2

Lee, Won-Chan	2
Abedi, Jamal	1
Brennan, Robert L.	1
Choi, Jiwon	1
Cresswell, Mike	1
De Cat, Jos	1
Deb, Shoumitro	1
Desloovere, Kaat	1
Dhaliwal, Akal-Joat	1
Eckes, Thomas	1
Ferrari, Joseph R.	1
Feys, Hilde	1
Fodstad, Jill C.	1
Gorin, Joanna S.	1
Hanson, Bradley A.	1
Herk, Hester van	1
Hess, Julie A.	1
Heyrman, Lieve	1
Illian, Janine B.	1
Kaiser, Heather A.	1
Kang, Yujin	1
Kim, Stella Y.	1
Kloot, Willem A. van der	1
Lee, Jihyun	1
Madison, Matthew J.	1
More ▼