Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 19 |
Descriptor
Classification | 23 |
Comparative Analysis | 23 |
Psychometrics | 23 |
Educational Assessment | 7 |
Item Response Theory | 7 |
Evaluation Methods | 6 |
Foreign Countries | 6 |
Accuracy | 4 |
Definitions | 4 |
Educational Testing | 4 |
Equated Scores | 4 |
More ▼ |
Source
Author
Lee, Won-Chan | 2 |
Abedi, Jamal | 1 |
Brennan, Robert L. | 1 |
Choi, Jiwon | 1 |
Cresswell, Mike | 1 |
De Cat, Jos | 1 |
Deb, Shoumitro | 1 |
Desloovere, Kaat | 1 |
Dhaliwal, Akal-Joat | 1 |
Eckes, Thomas | 1 |
Ferrari, Joseph R. | 1 |
More ▼ |
Publication Type
Journal Articles | 21 |
Reports - Research | 12 |
Opinion Papers | 6 |
Reports - Evaluative | 4 |
Dissertations/Theses -… | 1 |
Reports - Descriptive | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 5 |
Higher Education | 1 |
Audience
Researchers | 1 |
Students | 1 |
Location
United Kingdom | 3 |
United States | 3 |
United Kingdom (England) | 2 |
United Kingdom (Wales) | 2 |
Australia | 1 |
Germany | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Advanced Placement… | 2 |
SAT (College Admission Test) | 2 |
Childhood Autism Rating Scale | 1 |
Developmental Behavior… | 1 |
Kaufman Test of Educational… | 1 |
What Works Clearinghouse Rating
Yoo Jeong Jang – ProQuest LLC, 2022
Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…
Descriptors: Classification, Accuracy, Item Response Theory, Correlation
Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020
This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…
Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests
von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2018
This article critically reviews how diagnostic models have been conceptualized and how they compare to other approaches used in educational measurement. In particular, certain assumptions that have been taken for granted and used as defining characteristics of diagnostic models are reviewed and it is questioned whether these assumptions are the…
Descriptors: Criticism, Psychometrics, Diagnostic Tests, Educational Assessment
Wind, Stefanie A.; Walker, A. Adrienne – Language Assessment Quarterly, 2020
Scoring procedures for many rater-mediated performance assessments include score resolution procedures in which a third rater adjudicates discrepancies between two raters' ratings of the same performance. There are numerous approaches for calculating resolved scores that involve different combinations of the original and third ratings. Using data…
Descriptors: Scoring, Evaluators, Goodness of Fit, Content Area Writing
Paulsen, Justin; Valdivia, Dubravka Svetina – Journal of Experimental Education, 2022
Cognitive diagnostic models (CDMs) are a family of psychometric models designed to provide categorical classifications for multiple latent attributes. CDMs provide more granular evidence than other psychometric models and have potential for guiding teaching and learning decisions in the classroom. However, CDMs have primarily been conducted using…
Descriptors: Psychometrics, Classification, Teaching Methods, Learning Processes
McCloskey, George – Journal of Psychoeducational Assessment, 2017
This commentary will take an historical perspective on the Kaufman Test of Educational Achievement (KTEA) error analysis, discussing where it started, where it is today, and where it may be headed in the future. In addition, the commentary will compare and contrast the KTEA error analysis procedures that are rooted in psychometric methodology and…
Descriptors: Achievement Tests, Error Patterns, Comparative Analysis, Psychometrics
Madison, Matthew J. – Educational Measurement: Issues and Practice, 2019
Recent advances have enabled diagnostic classification models (DCMs) to accommodate longitudinal data. These longitudinal DCMs were developed to study how examinees change, or transition, between different attribute mastery statuses over time. This study examines using longitudinal DCMs as an approach to assessing growth and serves three purposes:…
Descriptors: Longitudinal Studies, Item Response Theory, Psychometrics, Criterion Referenced Tests
Eckes, Thomas – Language Testing, 2017
This paper presents an approach to standard setting that combines the prototype group method (PGM; Eckes, 2012) with a receiver operating characteristic (ROC) analysis. The combined PGM-ROC approach is applied to setting cut scores on a placement test of English as a foreign language (EFL). To implement the PGM, experts first named learners whom…
Descriptors: English (Second Language), Language Tests, Cutting Scores, Standard Setting (Scoring)
Lee, Jihyun; Paek, Insu – Journal of Psychoeducational Assessment, 2014
Likert-type rating scales are still the most widely used method when measuring psychoeducational constructs. The present study investigates a long-standing issue of identifying the optimal number of response categories. A special emphasis is given to categorical data, which were generated by the Item Response Theory (IRT) Graded-Response Modeling…
Descriptors: Likert Scales, Responses, Item Response Theory, Classification
Heyrman, Lieve; Molenaers, Guy; Desloovere, Kaat; Verheyden, Geert; De Cat, Jos; Monbaliu, Elegast; Feys, Hilde – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
In this study the psychometric properties of the Trunk Control Measurement Scale (TCMS) in children with cerebral palsy (CP) were examined. Twenty-six children with spastic CP (mean age 11 years 3 months, range 8-15 years; Gross Motor Function Classification System level I n = 11, level II n = 5, level III n = 10) were included in this study. To…
Descriptors: Construct Validity, Cerebral Palsy, Test Validity, Interrater Reliability
Matson, Johnny L.; Mahan, Sara; Hess, Julie A.; Fodstad, Jill C.; Neal, Daniene – Research in Autism Spectrum Disorders, 2010
Previous studies analyzed the reliability as well as sensitivity and specificity of the Autism Spectrum Disorder-Diagnostic for Children (ASD-DC). This study further examines the psychometric properties of the ASD-DC by assessing whether the ASD-DC has convergent validity against a psychometrically sound observational instrument for Autistic…
Descriptors: Verbal Communication, Nonverbal Communication, Autism, Validity
Deb, Shoumitro; Dhaliwal, Akal-Joat; Roy, Meera – Journal of Applied Research in Intellectual Disabilities, 2009
Aims: To explore the validity of Developmental Behaviour Checklist-Autism Screening Algorithm (DBC-ASA) as a screening instrument for autism among children with intellectual disabilities. Method: Data were collected from the case notes of 109 children with intellectual disabilities attending a specialist clinic in the UK. Results: The mean score…
Descriptors: Mental Retardation, Autism, Screening Tests, Children
Gorin, Joanna S. – Measurement: Interdisciplinary Research and Perspectives, 2009
In their paper "Unique Characteristics of Diagnostic Classification Models: A Comprehensive Review of the Current State-of-the-Art," Andre Rupp and Jonathan Templin (2008) provide a comparative analysis of selected psychometric models useful for the analysis of multidimensional data for purposes of diagnostic score reporting. Recent assessment…
Descriptors: Psychological Evaluation, Classification, Scores, Psychometrics
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Previous Page | Next Page ยป
Pages: 1 | 2