NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Does not meet standards1
Showing 1 to 15 of 16 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Ziegler, Wolfram; Staiger, Anja; Schölderle, Theresa; Vogel, Mathias – Journal of Speech, Language, and Hearing Research, 2017
Purpose: Standardized clinical assessment of dysarthria is essential for management and research. We present a new, fully standardized dysarthria assessment, the Bogenhausen Dysarthria Scales (BoDyS). The measurement model of the BoDyS is based on auditory evaluations of connected speech using 9 scales (traits) assessed by 4 elicitation methods.…
Descriptors: Auditory Evaluation, Test Reliability, Test Validity, Rating Scales
Peer reviewed Peer reviewed
Direct linkDirect link
Wodka, Ericka L.; Puts, Nicolaas A. J.; Mahone, E. Mark; Edden, Richard A. E.; Tommerdahl, Mark; Mostofsky, Stewart H. – Journal of Autism and Developmental Disorders, 2016
Sensory processing abnormalities in autism have largely been described by parent report. This study used a multi-method (parent-report and measurement), multi-trait (tactile sensitivity and attention) design to evaluate somatosensory processing in ASD. Results showed multiple significant within-method (e.g., parent report of different…
Descriptors: Attention, Multitrait Multimethod Techniques, Autism, Pervasive Developmental Disorders
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Park, Siwon – Journal of Pan-Pacific Association of Applied Linguistics, 2017
This paper examines how different test methods may tap different aspects of second language knowledge. It employs multiple-choice (MC) and constructed response (CR) items which yield distinct or convergent information in the computer delivered testing of English in its presentation of this factor. In order to examine the effects of test method, a…
Descriptors: Evaluation Methods, Second Language Learning, English (Second Language), Computer Assisted Testing
Kern, Justin L.; McBride, Brent A.; Laxman, Daniel J.; Dyer, W. Justin; Santos, Rosa M.; Jeans, Laurie M. – Grantee Submission, 2016
Measurement invariance (MI) is a property of measurement that is often implicitly assumed, but in many cases, not tested. When the assumption of MI is tested, it generally involves determining if the measurement holds longitudinally or cross-culturally. A growing literature shows that other groupings can, and should, be considered as well.…
Descriptors: Psychology, Measurement, Error of Measurement, Measurement Objectives
Peer reviewed Peer reviewed
Direct linkDirect link
Ngo, Federick; Kwon, William W. – Research in Higher Education, 2015
Community college students are often placed in developmental math courses based on the results of a single placement test. However, concerns about accurate placement have recently led states and colleges across the country to consider using other measures to inform placement decisions. While the relationships between college outcomes and such…
Descriptors: Access to Education, Success, Community Colleges, Mathematics Education
Peer reviewed Peer reviewed
Direct linkDirect link
Blagov, Pavel S.; Bi, Wu; Shedler, Jonathan; Westen, Drew – Assessment, 2012
The Shedler-Westen Assessment Procedure (SWAP) is a personality assessment instrument designed for use by expert clinical assessors. Critics have raised questions about its psychometrics, most notably its validity across observers and situations, the impact of its fixed score distribution on research findings, and its test-retest reliability. We…
Descriptors: Personality Measures, Personality Assessment, Psychometrics, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Rojahn, Johannes; Schroeder, Stephen R.; Mayo-Ortega, Liliana; Oyama-Ganiko, Rosao; LeBlanc, Judith; Marquis, Janet; Berke, Elizabeth – Research in Developmental Disabilities: A Multidisciplinary Journal, 2013
Reliable and valid assessment of aberrant behaviors is essential in empirically verifying prevention and intervention for individuals with intellectual or developmental disabilities (IDD). Few instruments exist which assess behavior problems in infants. The current longitudinal study examined the performance of three behavior-rating scales for…
Descriptors: Rating Scales, Behavior Problems, Developmental Disabilities, Infants
Peer reviewed Peer reviewed
Direct linkDirect link
Lowe, Patricia A. – Journal of Psychoeducational Assessment, 2014
The psychometric properties of the Revised Children's Manifest Anxiety Scale-Second Edition (RCMAS-2) were examined in a sample of 1,003 U.S. elementary and secondary students in Grades 2 to 12. Confirmatory factor analyses (CFAs) were performed comparing the five-factor (target) model consisting of three anxiety (Physiological Anxiety, Social…
Descriptors: Psychometrics, Anxiety, Elementary School Students, Secondary School Students
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Gu, Lin; Turkan, Sultan; Gomez, Pablo Garcia – ETS Research Report Series, 2015
ELTeach is an online professional development program developed by Educational Testing Service (ETS) in collaboration with National Geographic Learning. The ELTeach program consists of two courses: English-for-Teaching and Professional Knowledge for English Language Teaching (ELT). Each course includes a coordinated assessment leading to a score…
Descriptors: Item Analysis, Test Items, English (Second Language), Second Language Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Bernard, Larry C.; Mills, Michael; Swenson, Leland; Walsh, R. Patricia – Assessment, 2008
We report the development of the Assessment of Individual Motives-Questionnaire (AIM-Q), a new instrument based on an evolutionary psychology theory of human motivation. It provides multitrait-multimethod (MTMM) assessment of individual differences on 15 motive scales. A total heterogeneous sample of N = 1,251 participated in eight studies that…
Descriptors: Test Construction, Questionnaires, Test Reliability, Multitrait Multimethod Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007
This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…
Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory
Peer reviewed Peer reviewed
Rafaeli, Sheizaf; Tractinsky, Noam – Computers in Human Behavior, 1991
Discussion of time-related measures in computerized ability tests focuses on a study of college students that used two intelligence test item types to develop a multitrait, multimethod assessment of response time measures. Convergent and discriminant validation are discussed, correlations between response time and accuracy are examined, and…
Descriptors: Computer Assisted Testing, Correlation, Higher Education, Intelligence Tests
Van Velsor, Ellen; Leslie, Jean Brittain; Fleenor, John W. – 1997
This book presents a nontechnical, step-by-step process that shows how to evaluate any 360-degree-feedback instrument intended for management or leadership development. The 360-degree-feedback instruments collect information from different sources about a target manager's performance, and they offer multiple perspectives. The 16 steps in…
Descriptors: Administrator Characteristics, Evaluation Methods, Feedback, Interrater Reliability
Chang, Lei – 1993
Equivalence in reliability and validity across 4-point and 6-point scales was assessed by fitting different measurement models through confirmatory factor analysis of a multitrait-multimethod covariance matrix. Responses to nine Likert-type items designed to measure perceived quantitative ability, self-perceived usefulness of quantitative…
Descriptors: Ability, Comparative Testing, Education Majors, Graduate Students
Previous Page | Next Page »
Pages: 1  |  2