NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 15 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Tenko Raykov; Bingsheng Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Multidimensional measuring instruments are often used in behavioral, social, educational, marketing, and biomedical research. For these scales, the paper discusses how to find the optimal score based on their components that is associated with the highest possible reliability. Within the framework of structural equation modeling, an approach to…
Descriptors: Multidimensional Scaling, Measurement Equipment, Measurement Techniques, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Akaeze, Hope O.; Wu, Jamie Heng-Chieh; Lawrence, Frank R.; Weber, Everett P. – Journal of Psychoeducational Assessment, 2023
This paper reports an investigation into the psychometric properties of the COR-Advantage1.5 (COR-Adv1.5) assessment tool, a criterion-referenced observation-based instrument designed to assess the developmental abilities of children from birth through kindergarten. Using data from 8534 children participating in a state-funded preschool program…
Descriptors: Criterion Referenced Tests, Evaluation Methods, Measures (Individuals), Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sabatini, John; Petscher, Yaacov; O'Reilly, Tenaha; Truckenmiller, Adrea – Grantee Submission, 2015
For decades, standardized reading comprehension tests have consisted of a series of passages and associated multiple-choice questions. Although widely used in and out of the classroom, there continues to be considerable disagreement regarding how or whether such tests have net value in the service of advancing educational progress in reading. This…
Descriptors: Middle School Students, High School Students, Reading Comprehension, Reading Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Martin, Andrew J.; Yu, Kai; Papworth, Brad; Ginns, Paul; Collie, Rebecca J. – Journal of Psychoeducational Assessment, 2015
This study explored motivation and engagement among North American (the United States and Canada; n = 1,540), U.K. (n = 1,558), Australian (n = 2,283), and Chinese (n = 3,753) secondary school students. Motivation and engagement were assessed via students' responses to the Motivation and Engagement Scale-High School (MES-HS). Confirmatory factor…
Descriptors: Foreign Countries, Motivation, Learner Engagement, Secondary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Marson, Stephen M.; Wei, Guo; Wasserman, Deborah – American Journal of Evaluation, 2009
Goal attainment scaling (GAS) has been considered to be one of the most versatile and appealing evaluation protocols available for human services. Aspects of the protocol that make the method so appealing to practitioners--that is, collaboratively working with individual clients to identify and assign weights to goals they will work to…
Descriptors: Human Services, Scaling, Test Reliability, Interrater Reliability
Peer reviewed Peer reviewed
Plucker, Jonathan A. – Journal of Secondary Gifted Education, 1997
This study used a sample (n=967) of academically gifted adolescent students attending summer enrichment programs and participating in urban school districts' gifted programs to evaluate the reliability and validity of the Adolescent Coping Scale. Results suggest the instrument is sufficiently reliable for group administration and research purposes…
Descriptors: Academically Gifted, Adolescents, Coping, Elementary Secondary Education
Peer reviewed Peer reviewed
Direct linkDirect link
Brunner, Martin; SuB, Heinz-Martin – Educational and Psychological Measurement, 2005
Two aspects of the reliability of multidimensional measures can be distinguished: the amount of scale score variance that is accounted for by all underlying factors (composite reliability) and the degree to which the scale score reflects one particular factor (construct reliability). Confidence intervals for composite and construct reliabilities…
Descriptors: Measures (Individuals), Intervals, Intelligence Tests, Evaluation Methods
Secolsky, Charles, Ed.; Denison, D. Brian, Ed. – Routledge, Taylor & Francis Group, 2011
Increased demands for colleges and universities to engage in outcomes assessment for accountability purposes have accelerated the need to bridge the gap between higher education practice and the fields of measurement, assessment, and evaluation. The "Handbook on Measurement, Assessment, and Evaluation in Higher Education" provides higher…
Descriptors: Generalizability Theory, Higher Education, Institutional Advancement, Teacher Effectiveness
Younglove, William A. – 1983
In the early twentieth century behaviorist Edward L. Thorndike began the development and use of measurement scales to replace personal judgment to evaluate student compositions in U.S. public schools. In 1912, utilizing the Fullerton and Catell equal difference theorem, Milo B. Hillegas released the first scientifically designed scale to measure…
Descriptors: Behavior Theories, Educational History, Elementary Secondary Education, Evaluation Methods
Obiekwe, Jerry C. – 1999
This study compared college students' responses on their evaluations of the effectiveness of full- and part-time college faculty. A group of 1,101 students completed evaluation instruments for all courses taught by full-time faculty, and 2,067 students completed evaluations for all courses taught by part-time faculty in spring 1998. In fall 1998,…
Descriptors: College Faculty, College Students, Evaluation Methods, Full Time Faculty
Michigan State Dept. of Education, Lansing. Research, Evaluation, and Assessment Services. – 1972
The ninth report of the Michigan Educational Assessment Program contains the technical information needed to evaluate the instruments and techniques used to measure and report the status of student achievement and attitude. This report is intended for people with expertise in psychometrics. The Program is described in the first section of the…
Descriptors: Achievement Tests, Attitude Measures, Cluster Analysis, Correlation
Izard, J. F. – 1977
This material provides a discussion of the construction and analysis of tests prepared for classroom use by teachers. The initial discussion is concerned with the purposes of evaluation and the specification of objectives. This is followed by an examination of theoretical and practical considerations in planning a test. The material on test item…
Descriptors: Criterion Referenced Tests, Difficulty Level, Educational Objectives, Evaluation Criteria