NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 41 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
von Davier, Matthias – ETS Research Report Series, 2016
This report presents results on a parallel implementation of the expectation-maximization (EM) algorithm for multidimensional latent variable models. The developments presented here are based on code that parallelizes both the E step and the M step of the parallel-E parallel-M algorithm. Examples presented in this report include item response…
Descriptors: Psychometrics, Mathematics, Models, Statistical Analysis
Kankaraš, Miloš; Feron, Eva; Renbarger, Rachel – OECD Publishing, 2019
Triangulation -- a combined use of different assessment methods or sources to evaluate psychological constructs -- is still a rarely used assessment approach in spite of its potential in overcoming inherent constraints of individual assessment methods. This paper uses field test data from a new OECD Study on Social and Emotional Skills to examine…
Descriptors: Interpersonal Competence, Emotional Intelligence, Evaluation Methods, Student Evaluation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Chaidi, Thirachai; Damrongpanich, Sunthorapot – Educational Research and Reviews, 2016
The purposes of this study were to develop a model to measure the belief in Buddhism of junior high school students at Chiang Rai Buddhist Scripture School, and to determine construct validity of the model for measuring the belief in Buddhism by using Multitrait-Multimethod analysis. The samples were 590 junior high school students at Buddhist…
Descriptors: Buddhism, Beliefs, Attitude Measures, Multitrait Multimethod Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Donovan, Corinne Baron; Mercier, Kevin; Phillips, Sharon R. – Measurement in Physical Education and Exercise Science, 2015
The Centers for Disease Control have suggested that physical education plays a role in promoting healthy lifestyles. Prior research suggests a link between attitudes toward physical education and physical activity outside school. The current study provides additional evidence of construct validity through a validation across two instruments…
Descriptors: Physical Education, Construct Validity, Test Validity, Life Style
Peer reviewed Peer reviewed
Direct linkDirect link
Blagov, Pavel S.; Bi, Wu; Shedler, Jonathan; Westen, Drew – Assessment, 2012
The Shedler-Westen Assessment Procedure (SWAP) is a personality assessment instrument designed for use by expert clinical assessors. Critics have raised questions about its psychometrics, most notably its validity across observers and situations, the impact of its fixed score distribution on research findings, and its test-retest reliability. We…
Descriptors: Personality Measures, Personality Assessment, Psychometrics, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Lowe, Patricia A. – Journal of Psychoeducational Assessment, 2014
The psychometric properties of the Revised Children's Manifest Anxiety Scale-Second Edition (RCMAS-2) were examined in a sample of 1,003 U.S. elementary and secondary students in Grades 2 to 12. Confirmatory factor analyses (CFAs) were performed comparing the five-factor (target) model consisting of three anxiety (Physiological Anxiety, Social…
Descriptors: Psychometrics, Anxiety, Elementary School Students, Secondary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Dik, Bryan J.; Eldridge, Brandy M.; Steger, Michael F.; Duffy, Ryan D. – Journal of Career Assessment, 2012
Research on work as a calling is limited by measurement concerns. In response, the authors introduce the multidimensional Calling and Vocation Questionnaire (CVQ) and the Brief Calling scale (BCS), instruments assessing presence of, and search for, a calling. Study 1 describes CVQ development using exploratory and confirmatory factor analysis…
Descriptors: Multitrait Multimethod Techniques, Construct Validity, Validity, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Rodgers, Joseph Lee; Rodgers, Jacci L. – Journal of Continuing Higher Education, 2011
We propose, develop, and evaluate the black ink-red ink (BIRI) method of testing. This approach uses two different methods within the same test administration setting, one that matches recognition learning and the other that matches recall learning. Students purposively define their own tradeoff between the two approaches. Evaluation of the method…
Descriptors: Testing, Test Anxiety, Recall (Psychology), Recognition (Psychology)
Peer reviewed Peer reviewed
Direct linkDirect link
Quellmalz, Edys S.; Davenport, Jodi L.; Timms, Michael J.; DeBoer, George E.; Jordan, Kevin A.; Huang, Chun-Wei; Buckley, Barbara C. – Journal of Educational Psychology, 2013
How can assessments measure complex science learning? Although traditional, multiple-choice items can effectively measure declarative knowledge such as scientific facts or definitions, they are considered less well suited for providing evidence of science inquiry practices such as making observations or designing and conducting investigations.…
Descriptors: Science Education, Educational Assessment, Psychometrics, Science Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Leite, Walter L.; Svinicki, Marilla; Shi, Yuying – Educational and Psychological Measurement, 2010
The authors examined the dimensionality of the VARK learning styles inventory. The VARK measures four perceptual preferences: visual (V), aural (A), read/write (R), and kinesthetic (K). VARK questions can be viewed as testlets because respondents can select multiple items within a question. The correlations between items within testlets are a type…
Descriptors: Multitrait Multimethod Techniques, Construct Validity, Reliability, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Langer, David A.; Wood, Jeffrey J.; Bergman, R. Lindsey; Piacentini, John C. – Child Psychiatry and Human Development, 2010
The present study examines the construct validity of separation anxiety disorder (SAD), social phobia (SoP), panic disorder (PD), and generalized anxiety disorder (GAD) in a clinical sample of children. Participants were 174 children, 6 to 17 years old (94 boys) who had undergone a diagnostic evaluation at a university hospital based clinic.…
Descriptors: Multitrait Multimethod Techniques, Construct Validity, Validity, Classification
Peer reviewed Peer reviewed
Direct linkDirect link
Seifert, Tricia A.; Goodman, Kathleen; King, Patricia M.; Baxter Magolda, Marcia B. – Journal of Mixed Methods Research, 2010
This study details the collection, analysis, and interpretation of data from a national multi-institutional longitudinal mixed methods study of college impact and student development of liberal arts outcomes. The authors found three sets of practices in the quantitative data that corroborated with the themes that emerged from the qualitative data:…
Descriptors: Student Development, Liberal Arts, Content Analysis, Data Interpretation
Peer reviewed Peer reviewed
Direct linkDirect link
Gotwals, John K.; Dunn, John G. H. – Measurement in Physical Education and Exercise Science, 2009
This article presents a chronology of three empirical studies that outline the measurement process by which two new subscales ("Doubts about Actions" and "Organization") were developed and integrated into a revised version of Dunn, Causgrove Dunn, and Syrotuik's (2002) "Sport Multidimensional Perfectionism Scale"…
Descriptors: Construct Validity, Measures (Individuals), Multidimensional Scaling, Multitrait Multimethod Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Doumen, Sarah; Verschueren, Karine; Buyse, Evelien; De Munter, Sofie; Max, Kristel; Moens, Loth – Infant and Child Development, 2009
Two studies extended psychometric research on the Student-Teacher Relationship Scale (STRS) with kindergarten and preschool children (N[subscript 1] = 60-7[subscript 1]; N[subscript 2] = 35) and their teachers. These studies used a multi-method approach to replicate and extend previous findings concerning the convergent validity of the STRS…
Descriptors: Conflict, Validity, Preschool Children, Kindergarten
Previous Page | Next Page »
Pages: 1  |  2  |  3