ERIC - Search Results

Publication Date

In 2025	4
Since 2024	8
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	16
Since 2006 (last 20 years)	22

Descriptor

Error of Measurement	37
Evaluation Methods	37
Test Reliability	37
Test Validity	12
Student Evaluation	11
Interrater Reliability	6
Evaluation Research	5
Foreign Countries	5
Measurement Techniques	5
Test Theory	5
Testing Problems	5
Alternative Assessment	4
Evaluation Criteria	4
Item Response Theory	4
Standardized Tests	4
Test Bias	4
Test Construction	4
Testing	4
Elementary Secondary Education	3
Higher Education	3
Measurement	3
Models	3
National Competency Tests	3
Performance Based Assessment	3
Personnel Evaluation	3
More ▼

Publication Type

Journal Articles	24
Reports - Research	20
Reports - Evaluative	6
Reports - Descriptive	5
Speeches/Meeting Papers	4
Dissertations/Theses -…	2
Information Analyses	2
Opinion Papers	2

Education Level

Elementary Secondary Education	5
Postsecondary Education	3
Elementary Education	2
Higher Education	2
Secondary Education	2
Adult Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Location

Australia	1
New Jersey	1
Oklahoma	1
Taiwan	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
Program for International…	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Technical Adequacy-Reliability

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2025

The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…

Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement

Enhancing Model Fit Evaluation in SEM: Practical Tips for Optimizing Chi-Square Tests

Peer reviewed

Direct link

Bang Quan Zheng; Peter M. Bentler – Structural Equation Modeling: A Multidisciplinary Journal, 2025

This paper aims to advocate for a balanced approach to model fit evaluation in structural equation modeling (SEM). The ongoing debate surrounding chi-square test statistics and fit indices has been characterized by ambiguity and controversy. Despite the acknowledged limitations of relying solely on the chi-square test, its careful application can…

Descriptors: Monte Carlo Methods, Structural Equation Models, Goodness of Fit, Robustness (Statistics)

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

Quality-of-Life Measurement in Randomised Controlled Trials of Mental Health Interventions for Autistic Adults: A Systematic Review

Peer reviewed

Direct link

Amanda Timmerman; Vasiliki Totsika; Valerie Lye; Laura Crane; Audrey Linden; Elizabeth Pellicano – Autism: The International Journal of Research and Practice, 2025

Autistic people are more likely to have co-occurring mental health conditions compared to the general population, and mental health interventions have been identified as a top research priority by autistic people and the wider autism community. Autistic adults have also communicated that quality of life is the outcome that matters most to them in…

Descriptors: Adults, Autism Spectrum Disorders, Quality of Life, Randomized Controlled Trials

Evidence-Based Evaluation of Student and Marker Performances in Assessment and Examination

Peer reviewed

Direct link

Ole J. Kemi – Advances in Physiology Education, 2025

Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…

Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards

Stabilizing School Performance Indicators in New Jersey to Reduce the Effect of Random Error. Appendixes. REL 2025-009

Peer reviewed
PDF on ERIC

Download full text

Regional Educational Laboratory Mid-Atlantic, 2024

These are the appendixes for the report, "Stabilizing School Performance Indicators in New Jersey to Reduce the Effect of Random Error." This study applied a stabilization model called Bayesian hierarchical modeling to group-level data (with groups assigned according to demographic designations) within schools in New Jersey with the aim…

Descriptors: Institutional Evaluation, Elementary Secondary Education, Bayesian Statistics, Test Reliability

Controlling for Measurement Error in Evaluations When Treatment Group Assignment Is Based on Noisy Measures

Peer reviewed

Direct link

Robert Meyer; Sara Hu; Michael Christian – Society for Research on Educational Effectiveness, 2023

Background: This paper develops a new method to estimate quasi-experimental evaluation models when it is necessary to control for measurement error in predictors and individual assignment to the treatment group is based on these same fallible variables. A major methodological finding of the study is that standard methods of estimating models that…

Descriptors: Error of Measurement, Measurement Techniques, Elementary Secondary Education, Report Cards

Signal-to-Noise Ratio in Estimating and Testing the Mediation Effect: Structural Equation Modeling versus Path Analysis with Weighted Composites

Peer reviewed

Direct link

Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024

Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…

Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing

Investigating Current Clinical Practice in Assessment and Diagnosis of Voice Disorders: A Cross-Sectional Multidisciplinary Global Web Survey

Peer reviewed

Direct link

Christopher L. Payten; Kelly A. Weir; Catherine J. Madill – International Journal of Language & Communication Disorders, 2024

Background: Published best-practice guidelines and standardized protocols for voice assessment recommend multidisciplinary evaluation utilizing a comprehensive range of clinical measures. Previous studies report variations in assessment practices when compared with these guidelines. Aims: To provide an up-to-date evaluation of current global…

Descriptors: Voice Disorders, Speech Language Pathology, Allied Health Personnel, Auditory Tests

Exploring Rating Quality in the Context of High-Stakes Rater-Mediated Educational Assessments

Direct link

Wenjing Guo – ProQuest LLC, 2021

Constructed response (CR) items are widely used in large-scale testing programs, including the National Assessment of Educational Progress (NAEP) and many district and state-level assessments in the United States. One unique feature of CR items is that they depend on human raters to assess the quality of examinees' work. The judgment of human…

Descriptors: National Competency Tests, Responses, Interrater Reliability, Error of Measurement

Charting the Future of Assessments. Full Report

Download full text

Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024

Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…

Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias

From OLS to Multilevel Multidimensional Mixture IRT: A Model Refinement Approach to Investigating Patterns of Relationships in PISA 2012 Data

Direct link

Gulsah Gurkan – ProQuest LLC, 2021

Secondary analyses of international large-scale assessments (ILSA) commonly characterize relationships between variables of interest using correlations. However, the accuracy of correlation estimates is impaired by artefacts such as measurement error and clustering. Despite advancements in methodology, conventional correlation estimates or…

Descriptors: Secondary School Students, Achievement Tests, International Assessment, Foreign Countries

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Maintaining Equivalent Cut Scores for Small Sample Test Forms

Peer reviewed

Direct link

Dwyer, Andrew C. – Journal of Educational Measurement, 2016

This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…

Descriptors: Cutting Scores, Equivalency Tests, Test Format, Academic Standards

The Accuracy of Aggregate Student Growth Percentiles as Indicators of Educator Performance

Peer reviewed

Direct link

Castellano, Katherine E.; McCaffrey, Daniel F. – Educational Measurement: Issues and Practice, 2017

Mean or median student growth percentiles (MGPs) are a popular measure of educator performance, but they lack rigorous evaluation. This study investigates the error in MGP due to test score measurement error (ME). Using analytic derivations, we find that errors in the commonly used MGP are correlated with average prior latent achievement: Teachers…

Descriptors: Teacher Evaluation, Teacher Effectiveness, Value Added Models, Achievement Gains

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational Measurement:…	2
Educational and Psychological…	2
Journal of Educational…	2
ProQuest LLC	2
Advances in Physiology…	1
American Educational Research…	1
Applied Measurement in…	1
Assessment	1
Audio-Visual Language Journal	1
Autism: The International…	1
ETS Research Institute	1
Educational Research and…	1
Gifted Child Today	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Agronomic…	1
Journal of College Science…	1
Journal of Experimental…	1
Measurement and Evaluation in…	1
Multivariate Behavioral…	1
Regional Educational…	1
Research & Practice in…	1
School Psychology Review	1
More ▼

Aksu, Gökhan	1
Amanda Timmerman	1
Amit Sevak	1
Audrey Linden	1
Bakeman, Roger	1
Bang Quan Zheng	1
Bardhoshi, Gerta	1
Bateman, Andrea	1
Cason, Gerald J.	1
Castellano, Katherine E.	1
Catherine J. Madill	1
Chen, Hsueh-Chu	1
Cheung, K. C.	1
Christ, Theodore J.	1
Christopher L. Payten	1
Coffman, Donna L.	1
Cooper, Terence H.	1
Daniel Fishtein	1
Dunivant, Noel	1
Dwyer, Andrew C.	1
Elizabeth Pellicano	1
Emrick, John A.	1
Erford, Bradley T.	1
Eser, Mehmet Taha	1
More ▼