ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	9

Descriptor

Evaluation Methods	15
Test Reliability	15
Test Validity	11
Multidimensional Scaling	8
Measurement Techniques	7
Scaling	7
Psychometrics	6
Test Construction	6
College Students	4
Factor Analysis	4
Item Analysis	4
Item Response Theory	4
Rating Scales	4
Statistical Analysis	4
Correlation	3
Elementary Secondary Education	3
Scores	3
Achievement Tests	2
College Faculty	2
Criterion Referenced Tests	2
Data Analysis	2
Evaluation Criteria	2
Factor Structure	2
Foreign Countries	2
Generalizability Theory	2
More ▼

Source

Journal of Psychoeducational…	2
American Journal of Evaluation	1
Educational Assessment	1
Educational Sciences: Theory…	1
Educational and Psychological…	1
Grantee Submission	1
Journal of Educational…	1
Journal of Secondary Gifted…	1
Routledge, Taylor & Francis…	1
Structural Equation Modeling:…	1

Publication Type

Journal Articles	9
Reports - Research	7
Reports - Descriptive	2
Speeches/Meeting Papers	2
Books	1
Collected Works - General	1
Guides - General	1
Historical Materials	1
Information Analyses	1
Reports - Evaluative	1
Tests/Questionnaires	1
More ▼

Education Level

Secondary Education	4
Higher Education	3
Junior High Schools	2
Middle Schools	2
Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
High Schools	1
Postsecondary Education	1
Two Year Colleges	1

Audience

Researchers

Location

Michigan	2
Australia	1
California	1
Canada	1
China	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	2
Program for International…	1
Progress in International…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Evaluation of Maximal Reliability for Multidimensional Measuring Instruments Using Structural Equation Modeling

Peer reviewed

Direct link

Tenko Raykov; Bingsheng Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Multidimensional measuring instruments are often used in behavioral, social, educational, marketing, and biomedical research. For these scales, the paper discusses how to find the optimal score based on their components that is associated with the highest possible reliability. Within the framework of structural equation modeling, an approach to…

Descriptors: Multidimensional Scaling, Measurement Equipment, Measurement Techniques, Test Reliability

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

Validation of the Child Observation Record Advantage 1.5 Assessment Tool for Preschool Children: A Multilevel Bifactor Modeling Approach

Peer reviewed

Direct link

Akaeze, Hope O.; Wu, Jamie Heng-Chieh; Lawrence, Frank R.; Weber, Everett P. – Journal of Psychoeducational Assessment, 2023

This paper reports an investigation into the psychometric properties of the COR-Advantage1.5 (COR-Adv1.5) assessment tool, a criterion-referenced observation-based instrument designed to assess the developmental abilities of children from birth through kindergarten. Using data from 8534 children participating in a state-funded preschool program…

Descriptors: Criterion Referenced Tests, Evaluation Methods, Measures (Individuals), Measurement Techniques

Test Assembly Implications for Providing Reliable and Valid Subscores

Peer reviewed

Direct link

Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017

This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…

Descriptors: Scores, Test Construction, Test Reliability, Test Validity

Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities

Peer reviewed
PDF on ERIC

Download full text

Sabatini, John; Petscher, Yaacov; O'Reilly, Tenaha; Truckenmiller, Adrea – Grantee Submission, 2015

For decades, standardized reading comprehension tests have consisted of a series of passages and associated multiple-choice questions. Although widely used in and out of the classroom, there continues to be considerable disagreement regarding how or whether such tests have net value in the service of advancing educational progress in reading. This…

Descriptors: Middle School Students, High School Students, Reading Comprehension, Reading Tests

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Motivation and Engagement in the United States, Canada, United Kingdom, Australia, and China: Testing a Multi-Dimensional Framework

Peer reviewed

Direct link

Martin, Andrew J.; Yu, Kai; Papworth, Brad; Ginns, Paul; Collie, Rebecca J. – Journal of Psychoeducational Assessment, 2015

This study explored motivation and engagement among North American (the United States and Canada; n = 1,540), U.K. (n = 1,558), Australian (n = 2,283), and Chinese (n = 3,753) secondary school students. Motivation and engagement were assessed via students' responses to the Motivation and Engagement Scale-High School (MES-HS). Confirmatory factor…

Descriptors: Foreign Countries, Motivation, Learner Engagement, Secondary School Students

A Reliability Analysis of Goal Attainment Scaling (GAS) Weights

Peer reviewed

Direct link

Marson, Stephen M.; Wei, Guo; Wasserman, Deborah – American Journal of Evaluation, 2009

Goal attainment scaling (GAS) has been considered to be one of the most versatile and appealing evaluation protocols available for human services. Aspects of the protocol that make the method so appealing to practitioners--that is, collaboratively working with individual clients to identify and assign weights to goals they will work to…

Descriptors: Human Services, Scaling, Test Reliability, Interrater Reliability

Psychometric Characteristics of the Adolescent Coping Scale with Academically Gifted Adolescents.

Peer reviewed

Plucker, Jonathan A. – Journal of Secondary Gifted Education, 1997

This study used a sample (n=967) of academically gifted adolescent students attending summer enrichment programs and participating in urban school districts' gifted programs to evaluate the reliability and validity of the Adolescent Coping Scale. Results suggest the instrument is sufficiently reliable for group administration and research purposes…

Descriptors: Academically Gifted, Adolescents, Coping, Elementary Secondary Education

Analyzing the Reliability of Multidimensional Measures: An Example from Intelligence Research

Peer reviewed

Direct link

Brunner, Martin; SuB, Heinz-Martin – Educational and Psychological Measurement, 2005

Two aspects of the reliability of multidimensional measures can be distinguished: the amount of scale score variance that is accounted for by all underlying factors (composite reliability) and the degree to which the scale score reflects one particular factor (construct reliability). Confidence intervals for composite and construct reliabilities…

Descriptors: Measures (Individuals), Intervals, Intelligence Tests, Evaluation Methods

Handbook on Measurement, Assessment, and Evaluation in Higher Education

Direct link

Secolsky, Charles, Ed.; Denison, D. Brian, Ed. – Routledge, Taylor & Francis Group, 2011

Increased demands for colleges and universities to engage in outcomes assessment for accountability purposes have accelerated the need to bridge the gap between higher education practice and the fields of measurement, assessment, and evaluation. The "Handbook on Measurement, Assessment, and Evaluation in Higher Education" provides higher…

Descriptors: Generalizability Theory, Higher Education, Institutional Advancement, Teacher Effectiveness

A Look at Behavioristic Measurement of English Composition in United States Public Schools, 1901-1941.

Younglove, William A. – 1983

In the early twentieth century behaviorist Edward L. Thorndike began the development and use of measurement scales to replace personal judgment to evaluate student compositions in U.S. public schools. In 1912, utilizing the Fullerton and Catell equal difference theorem, Milo B. Hillegas released the first scientifically designed scale to measure…

Descriptors: Behavior Theories, Educational History, Elementary Secondary Education, Evaluation Methods

The Multidimensional Character of Teaching Effectiveness: A Comparative Analysis of Student Evaluation Responses of Full and Part-Time Faculty.

Download full text

Obiekwe, Jerry C. – 1999

This study compared college students' responses on their evaluations of the effectiveness of full- and part-time college faculty. A group of 1,101 students completed evaluation instruments for all courses taught by full-time faculty, and 2,067 students completed evaluations for all courses taught by part-time faculty in spring 1998. In fall 1998,…

Descriptors: College Faculty, College Students, Evaluation Methods, Full Time Faculty

Technical Report of the 1970-71 Michigan Educational Assessment Battery. The Ninth Report of the 1970-71 Michigan Educational Assessment Program.

Michigan State Dept. of Education, Lansing. Research, Evaluation, and Assessment Services. – 1972

The ninth report of the Michigan Educational Assessment Program contains the technical information needed to evaluate the instruments and techniques used to measure and report the status of student achievement and attitude. This report is intended for people with expertise in psychometrics. The Program is described in the first section of the…

Descriptors: Achievement Tests, Attitude Measures, Cluster Analysis, Correlation

Construction and Analysis of Classroom Tests.

Izard, J. F. – 1977

This material provides a discussion of the construction and analysis of tests prepared for classroom use by teachers. The initial discussion is concerned with the purposes of evaluation and the specification of objectives. This is followed by an examination of theoretical and practical considerations in planning a test. The material on test item…

Descriptors: Criterion Referenced Tests, Difficulty Level, Educational Objectives, Evaluation Criteria

Akaeze, Hope O.	1
Amery D. Wu	1
Bingsheng Zhang	1
Brunner, Martin	1
Collie, Rebecca J.	1
Denison, D. Brian, Ed.	1
Ginns, Paul	1
Izard, J. F.	1
Jake Stone	1
Kelecioglu, Hülya	1
Lawrence, Frank R.	1
Lee, Minji K.	1
Marson, Stephen M.	1
Martin, Andrew J.	1
Melican, Gerald J.	1
O'Reilly, Tenaha	1
Obiekwe, Jerry C.	1
Papworth, Brad	1
Petscher, Yaacov	1
Plucker, Jonathan A.	1
Sabatini, John	1
Secolsky, Charles, Ed.	1
Shun-Fu Hu	1
SuB, Heinz-Martin	1
Sweeney, Kevin	1
More ▼