ERIC - Search Results

Publication Date

In 2025	2
Since 2024	6
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	24
Since 2006 (last 20 years)	48

Descriptor

Evaluation Methods	99
Test Reliability	99
Models	73
Test Validity	65
Statistical Analysis	23
Test Construction	22
Measurement Techniques	20
Student Evaluation	19
Foreign Countries	18
Higher Education	14
Academic Achievement	12
Mathematical Models	12
Item Analysis	11
Scores	11
Correlation	10
Criterion Referenced Tests	10
Evaluation Criteria	10
Testing	10
College Students	9
Computer Assisted Testing	9
Program Evaluation	9
Teacher Effectiveness	9
Teacher Evaluation	9
Value Added Models	9
Elementary Secondary Education	8
More ▼

Publication Type

Reports - Research	51
Journal Articles	50
Reports - Evaluative	17
Speeches/Meeting Papers	10
Reports - Descriptive	6
Collected Works - Proceedings	4
Information Analyses	4
Books	2
Collected Works - General	2
Dissertations/Theses -…	2
Guides - Classroom - Teacher	2
Guides - Non-Classroom	2
Numerical/Quantitative Data	2
Reports - General	2
Guides - General	1
Opinion Papers	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	13
Postsecondary Education	9
Elementary Secondary Education	5
High Schools	4
Secondary Education	3
Early Childhood Education	2
Elementary Education	2
Grade 10	2
Grade 9	2
Kindergarten	2
Middle Schools	2
Primary Education	2
Adult Education	1
Grade 1	1
Grade 11	1
Grade 12	1
Grade 3	1
Preschool Education	1
Two Year Colleges	1
More ▼

Audience

Practitioners	3
Researchers	3
Administrators	1
Policymakers	1
Teachers	1

Location

China	3
Japan	3
United Kingdom	3
Brazil	2
California	2
Florida	2
Germany	2
Ohio	2
Pennsylvania	2
Russia	2
Spain	2
United States	2
Asia	1
Australia	1
China (Beijing)	1
Colorado	1
Colorado (Denver)	1
Connecticut	1
Denmark	1
Egypt	1
Estonia	1
France	1
Georgia	1
Ghana	1
Greece	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Georgia Criterion Referenced…	1
Hidden Figures Test	1
Motivated Strategies for…	1
NEO Personality Inventory	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 99 results Save | Export

Signal-to-Noise Ratio in Estimating and Testing the Mediation Effect: Structural Equation Modeling versus Path Analysis with Weighted Composites

Peer reviewed

Direct link

Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024

Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…

Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing

Modeling Retest Effects in Developmental Processes Using Latent Change Score Models

Peer reviewed

Direct link

Rohit Batra; Silvia A. Bunge; Emilio Ferrer – Structural Equation Modeling: A Multidisciplinary Journal, 2022

Studying development processes, as they unfold over time, involves collecting repeated measures from individuals and modeling the changes over time. One methodological challenge in this type of longitudinal data is separating retest effects, due to the repeated assessments, from developmental processes such as maturation or age. In this article,…

Descriptors: Children, Adolescents, Longitudinal Studies, Test Reliability

Evaluation of Maximal Reliability for Multidimensional Measuring Instruments Using Structural Equation Modeling

Peer reviewed

Direct link

Tenko Raykov; Bingsheng Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Multidimensional measuring instruments are often used in behavioral, social, educational, marketing, and biomedical research. For these scales, the paper discusses how to find the optimal score based on their components that is associated with the highest possible reliability. Within the framework of structural equation modeling, an approach to…

Descriptors: Multidimensional Scaling, Measurement Equipment, Measurement Techniques, Test Reliability

Estimating the Reliability of Skill Transitions in Longitudinal Diagnostic Classification Models

Peer reviewed

Direct link

Madeline A. Schellman; Matthew J. Madison – Grantee Submission, 2024

Diagnostic classification models (DCMs) have grown in popularity as stakeholders increasingly desire actionable information related to students' skill competencies. Longitudinal DCMs offer a psychometric framework for providing estimates of students' proficiency status transitions over time. For both cross-sectional and longitudinal DCMs, it is…

Descriptors: Diagnostic Tests, Classification, Models, Psychometrics

Enhancing Model Fit Evaluation in SEM: Practical Tips for Optimizing Chi-Square Tests

Peer reviewed

Direct link

Bang Quan Zheng; Peter M. Bentler – Structural Equation Modeling: A Multidisciplinary Journal, 2025

This paper aims to advocate for a balanced approach to model fit evaluation in structural equation modeling (SEM). The ongoing debate surrounding chi-square test statistics and fit indices has been characterized by ambiguity and controversy. Despite the acknowledged limitations of relying solely on the chi-square test, its careful application can…

Descriptors: Monte Carlo Methods, Structural Equation Models, Goodness of Fit, Robustness (Statistics)

Application of Rasch Model in Two-Tier Test for Assessing Critical Thinking in Physics Education

Peer reviewed
PDF on ERIC

Download full text

Sujiyani Kassiavera; A. Suparmi; C. Cari; Sukarmin Sukarmin – Journal of Baltic Science Education, 2024

The challenge of accurately assessing critical thinking in physics education, particularly on topics like work and energy, remains a key issue for educators. The current study aims to address this challenge by exploring students' critical thinking abilities using two-tier test data analyzed through the Rasch model. Data were collected from…

Descriptors: Critical Thinking, Physics, Science Instruction, Foreign Countries

An Instrument for Measuring the Improvement Work of Professional Learning Communities

Peer reviewed
PDF on ERIC

Download full text

Aimee Howley; Craig B. Howley; Marged Dudek – Journal of Educational Leadership and Policy Studies, 2025

This article explores the development and evaluation of the Building Leadership Team Assessment Tool (BLT-AT), designed to measure Professional Learning Communities' (PLCs') use of effective school improvement practices. The BLT-AT is grounded in Ohio's inclusive instructional leadership model, which emphasizes the improvement of teaching and…

Descriptors: Test Construction, Communities of Practice, Instructional Leadership, Evaluation Methods

Evaluation Tool for Traditional Chinese Medicine Students in China: A Competency Perspective

Peer reviewed

Direct link

Zheng, Boyang; Sun, Guiping; Wang, Hourong – SAGE Open, 2019

Traditional Chinese medicine (TCM) is an important component of China's medical system. How to educate TCM practitioners in China, therefore, has become a crucial issue. To contribute to this issue, the current research identified the competency model of TCM practitioners in China and developed an evaluation for TCM students. We combined Bloom's…

Descriptors: Medical Students, Correlation, Foreign Countries, Test Reliability

Exploration of Factors Affecting the Added Value of Test Subscores

Peer reviewed

Direct link

Wang, Xiaolin; Svetina, Dubravka; Dai, Shenghai – Journal of Experimental Education, 2019

Recently, interest in test subscore reporting for diagnosis purposes has been growing rapidly. The two simulation studies here examined factors (sample size, number of subscales, correlation between subscales, and three factors affecting subscore reliability: number of items per subscale, item parameter distribution, and data generating model)…

Descriptors: Value Added Models, Scores, Sample Size, Correlation

Approaches for Combining Multiple Measures of Teacher Performance: Reliability, Validity, and Implications for Evaluation Policy

Peer reviewed

Direct link

Martínez, José Felipe; Schweig, Jonathan; Goldschmidt, Pete – Educational Evaluation and Policy Analysis, 2016

A key question facing teacher evaluation systems is how to combine multiple measures of complex constructs into composite indicators of performance. We use data from the Measures of Effective Teaching (MET) study to investigate the measurement properties of composite indicators obtained under various conjunctive, disjunctive (or complementary),…

Descriptors: Teacher Evaluation, Outcome Measures, Evaluation Methods, Educational Policy

All Sizzle and No Steak: Value-Added Model Doesn't Add Value in Houston

Direct link

Amrein-Beardsley, Audrey; Geiger, Tray – Phi Delta Kappan, 2017

Houston's experience with the Educational Value-Added Assessment System (R) (EVAAS) raises questions that other districts should consider before buying the software and using it for high-stakes decisions. Researchers found that teachers in Houston, all of whom were under the EVAAS gun, but who taught relatively more racial minority students,…

Descriptors: Value Added Models, School Districts, Computer Software, Educational Technology

Evaluating the Measuring Properties of the Principal Instructional Management Rating Scale in the Chinese Educational System: Implications for Measuring School Leadership

Peer reviewed

Direct link

Antoniou, Panayiotis; Lu, Mohan – Educational Management Administration & Leadership, 2018

During the last 25 years researchers have proposed a number of conceptual frameworks to measure the various functions of instructional leadership. One of the most frequently used frameworks is the Principal Instructional Management Rating Scale (PIMRS). Despite the great number of studies employing the PIMRS, evidence for its reliability and…

Descriptors: Rating Scales, Instructional Leadership, Evaluation Methods, Educational Administration

The State Kindergarten Entry Assessment Digital Technology Landscape. PERC Report and ETS Research Report Series No. RR-20-26

Peer reviewed
PDF on ERIC

Download full text

Ackerman, Debra J. – ETS Research Report Series, 2020

Over the past 8 years, U.S. kindergarten classrooms have been impacted by policies mandating or recommending the administration of a specific kindergarten entry assessment (KEA) in the initial months of school as well as the increasing reliance on digital technology in the form of mobile apps, touchscreen devices, and online data platforms. Using…

Descriptors: Kindergarten, School Readiness, Computer Assisted Testing, Preschool Teachers

Adapting Scale for Children: A Practical Model for Researchers

Download full text

Aydin, Selami; Harputlu, Leyla; Çelik, Seyda Savran; Ustuk, Özgehan; Güzel, Serhat; Genç, Deniz – Online Submission, 2016

Measurement of children's behaviors in an educational and research context is a problematic and complex area. It is also evident that adapting scales to measure children's behaviors in an educational and research context is a complex process due to several reasons. First, cultural elements constitute a considerable problem. Second, it is difficult…

Descriptors: Child Behavior, Models, Test Construction, Test Validity

Between Scylla and Charybdis: Reflections on and Problems Associated with the Evaluation of Teachers in an Era of Metrification

Peer reviewed
PDF on ERIC

Download full text

Berliner, David C. – Education Policy Analysis Archives, 2018

The Scylla and Charybdis in this discussion of teacher evaluation are standardized achievement test data on the one hand, and classroom observational systems on the other. These are the two most common methods used to judge teachers' competency. Both have serious flaws: the former primarily with validity, the latter primarily with reliability. At…

Descriptors: Teacher Evaluation, Evaluation Problems, Standardized Tests, Achievement Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational and Psychological…	3
Structural Equation Modeling:…	3
ETS Research Report Series	2
Educational Evaluation and…	2
Grantee Submission	2
International Association for…	2
International Journal of…	2
Multivariate Behavioral…	2
Online Submission	2
ProQuest LLC	2
Alberta Journal of…	1
Annual Review of Applied…	1
Applied Psychological…	1
Assessment & Evaluation in…	1
College ESL	1
Decision Sciences Journal of…	1
Education Policy Analysis…	1
Educational Management…	1
Educational Measurement:…	1
Educational Psychology	1
European Journal of…	1
Evaluation and the Health…	1
Health Education & Behavior	1
Informatics in Education	1
Journal of Baltic Science…	1
More ▼

Amrein-Beardsley, Audrey	2
Cason, Gerald J.	2
A. Suparmi	1
Ackerman, Debra J.	1
Aimee Howley	1
Algina, James	1
Antoniou, Panayiotis	1
Arreola, Raoul A.	1
Aydin, Selami	1
Bachor, Dan G.	1
Bang Quan Zheng	1
Bartram, Dave	1
Bauer, Christopher F.	1
Bejar, Isaac I.	1
Ben Shlomo, Shirley	1
Berliner, David C.	1
Betoret, Fernando Domenech	1
Bingsheng Zhang	1
Black, Erik	1
Borgatto, Adriano F.	1
C. Cari	1
Cahan, Sorel	1
Campbell, Heather E.	1
Campbell, Shanyce L.	1
More ▼