NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 8 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Yangmeng Xu; Stefanie A. Wind – Educational Measurement: Issues and Practice, 2025
Double-scoring constructed-response items is a common but costly practice in mixed-format assessments. This study explored the impacts of Targeted Double-Scoring (TDS) and random double-scoring procedures on the quality of psychometric outcomes, including student achievement estimates, person fit, and student classifications under various…
Descriptors: Academic Achievement, Psychometrics, Scoring, Evaluation Methods
Peer reviewed Peer reviewed
W. Jake Thompson – Grantee Submission, 2024
Diagnostic classification models (DCMs) are psychometric models that can be used to estimate the presence or absence of psychological traits, or proficiency on fine-grained skills. Critical to the use of any psychometric model in practice, including DCMs, is an evaluation of model fit. Traditionally, DCMs have been estimated with maximum…
Descriptors: Bayesian Statistics, Classification, Psychometrics, Goodness of Fit
Peer reviewed Peer reviewed
Direct linkDirect link
Wind, Stefanie A.; Walker, A. Adrienne – Language Assessment Quarterly, 2020
Scoring procedures for many rater-mediated performance assessments include score resolution procedures in which a third rater adjudicates discrepancies between two raters' ratings of the same performance. There are numerous approaches for calculating resolved scores that involve different combinations of the original and third ratings. Using data…
Descriptors: Scoring, Evaluators, Goodness of Fit, Content Area Writing
Peer reviewed Peer reviewed
Direct linkDirect link
Bradshaw, Laine P.; Madison, Matthew J. – International Journal of Testing, 2016
In item response theory (IRT), the invariance property states that item parameter estimates are independent of the examinee sample, and examinee ability estimates are independent of the test items. While this property has long been established and understood by the measurement community for IRT models, the same cannot be said for diagnostic…
Descriptors: Classification, Models, Simulation, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Ardasheva, Yuliya; Tretter, Thomas R. – Modern Language Journal, 2013
As the school-aged English language learner (ELL) population continues to grow in the United States and other English-speaking countries, psychometrically sound instruments to measure their language learning strategies (LLS) become ever more critical. This study adapted and validated an adult-oriented measure of LLS (50-item "Strategy…
Descriptors: Second Language Learning, Second Language Instruction, Learning Strategies, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Ebesutani, Chad; Bernstein, Adam; Nakamura, Brad J.; Chorpita, Bruce F.; Weisz, John R. – Journal of Abnormal Child Psychology, 2010
The Revised Child Anxiety and Depression Scale-Parent Version (RCADS-P) is a 47-item parent-report questionnaire of youth anxiety and depression, with scales corresponding to the DSM-IV categories of Separation Anxiety Disorder, Social Phobia, Generalized Anxiety Disorder (GAD), Panic Disorder, Obsessive-Compulsive Disorder, and Major Depressive…
Descriptors: Validity, Measures (Individuals), Psychometrics, Depression (Psychology)
Peer reviewed Peer reviewed
Direct linkDirect link
Langer, David A.; Wood, Jeffrey J.; Bergman, R. Lindsey; Piacentini, John C. – Child Psychiatry and Human Development, 2010
The present study examines the construct validity of separation anxiety disorder (SAD), social phobia (SoP), panic disorder (PD), and generalized anxiety disorder (GAD) in a clinical sample of children. Participants were 174 children, 6 to 17 years old (94 boys) who had undergone a diagnostic evaluation at a university hospital based clinic.…
Descriptors: Multitrait Multimethod Techniques, Construct Validity, Validity, Classification
Lam, Peter; Foong, Yoke-Yeen – 1996
This study attempts to estimate Structure of Learning Outcome (SOLO) levels in mathematics using the Partial Credit and Rating Scale models. A 30-item test comprising 10 testlets of 3 items each was designed and administered to 674 lower secondary school students. The items were arranged in a hierarchical manner, each testing SOLO levels in this…
Descriptors: Classification, Computer Assisted Testing, Foreign Countries, Goodness of Fit