ERIC - Search Results

Publication Date

In 2025	23
Since 2024	72
Since 2021 (last 5 years)	137
Since 2016 (last 10 years)	249
Since 2006 (last 20 years)	586

Descriptor

Evaluation Methods	1402
Test Reliability	1402
Test Validity	948
Student Evaluation	337
Test Construction	301
Foreign Countries	215
Higher Education	183
Measurement Techniques	170
Psychometrics	168
Elementary Secondary Education	146
Evaluation Criteria	122
Measures (Individuals)	117
Questionnaires	102
Scores	100
College Students	99
Rating Scales	92
Interrater Reliability	90
Testing	90
Factor Analysis	87
Statistical Analysis	87
Program Evaluation	83
Correlation	81
Academic Achievement	79
Standardized Tests	78
Student Attitudes	78
More ▼

Education Level

Higher Education	173
Postsecondary Education	121
Elementary Secondary Education	81
Elementary Education	75
Secondary Education	55
Early Childhood Education	41
Middle Schools	32
High Schools	30
Junior High Schools	21
Preschool Education	17
Primary Education	16
Adult Education	15
Grade 6	12
Grade 5	10
Grade 8	10
Kindergarten	10
Grade 1	9
Intermediate Grades	9
Grade 3	8
Grade 10	7
Grade 4	7
Grade 7	6
Grade 2	5
Grade 9	4
Grade 12	3
More ▼

Audience

Researchers	74
Practitioners	72
Teachers	29
Administrators	18
Policymakers	11
Students	4
Counselors	3
Support Staff	3
Community	1
Parents	1

Location

Australia	24
United Kingdom	22
Canada	18
Turkey	16
China	14
United States	14
California	11
Netherlands	10
Florida	9
Texas	8
United Kingdom (England)	8
Israel	7
Taiwan	7
Germany	6
Indonesia	6
Pennsylvania	6
Spain	6
Illinois	5
Iran	5
Japan	5
Minnesota	5
Greece	4
India	4
Kansas	4
Malaysia	4
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…	6
Individuals with Disabilities…	5
Elementary and Secondary…	4
No Child Left Behind Act 2001	4
Rehabilitation Act 1973…	3
Elementary and Secondary…	2
American Recovery and…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Race to the Top	1
Womens Educational Equity Act	1
More ▼

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 1,402 results Save | Export

Technical Adequacy-Reliability

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2025

The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…

Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Evidence-Based Evaluation of Student and Marker Performances in Assessment and Examination

Peer reviewed

Direct link

Ole J. Kemi – Advances in Physiology Education, 2025

Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…

Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards

Constructing a Roadmap to Measure the Quality of Business Assessments Aimed at Curriculum Management

Peer reviewed

Direct link

Silva, Thanuci; Santos, Regiane dos; Mallet, Débora – Journal of Education for Business, 2023

Assuring the quality of education is a concern of learning institutions. To do so, it is necessary to have assertive learning management, with consistent data on students' outcomes. This research provides associate deans and researchers, a roadmap with which to gather evidence to improve the quality of open-ended assessments. Based on statistical…

Descriptors: Student Evaluation, Evaluation Methods, Business Education, Higher Education

Validity and Reliability of Child-Friendly School Policy Evaluation Instruments in Primary Schools: Confirmatory Factor Analysis

Peer reviewed
PDF on ERIC

Download full text

Riana Nurhayati; Suranto Aw; Siti Irene Astuti Dwiningrum; Mami Hajaroh; Herwin Herwin – International Journal of Educational Methodology, 2024

Evaluation of child-friendly school (CFS) policies is essential to determine the achievements of school efforts in reducing violence cases. This research aims to proving the reliability and validity of CFS policy evaluation instruments in elementary schools with different locations. This investigation uses the Context Input Process Product (CIPP)…

Descriptors: Validity, Reliability, School Policy, Program Evaluation

Design of a Simple Rubric to Peer-Evaluate the Teamwork Skills of Engineering Students

Peer reviewed

Direct link

Swapneel Thite; Jayashri Ravishankar; Inmaculada Tomeo-Reyes; Araceli Martinez Ortiz – European Journal of Engineering Education, 2024

Effectively working in an engineering workplace requires strong teamwork skills, yet the existing literature within various disciplines reveals discrepancies in evaluating these skills. This complicates the design of a generic teamwork peer evaluation tool for engineering students. This study aims to address this gap by introducing the DRIVE…

Descriptors: Scoring Rubrics, Evaluation Methods, Peer Evaluation, Teamwork

The Value of Expanding Perspectives on Assessment

Peer reviewed

Direct link

Janice Kinghorn; Katherine McGuire; Bethany L. Miller; Aaron Zimmerman – Assessment Update, 2024

In this article, the authors share their reflections on how different experiences and paradigms have broadened their understanding of the work of assessment in higher education. As they collaborated to create a panel for the 2024 International Conference on Assessing Quality in Higher Education, they recognized that they, as assessment…

Descriptors: Higher Education, Assessment Literacy, Evaluation Criteria, Evaluation Methods

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

Quantifying Multimodality: The Validity and Reliability of the QEMT and QEMR

Direct link

Paul Alexander Siegel – ProQuest LLC, 2024

While multimodality and multiliteracies has been a concept for 25 years (Kalantzis & Cope, 2023; The New London Group, 1996), research on and application of the concept within text complexity measures has been limited. Attempts to assess multiliteracies and multimodality (Jacobs, 2013; Schmerbeck & Lucht, 2017; Wyatt-Smith & Kimber,…

Descriptors: Multiple Literacies, Learning Modalities, Test Validity, Test Reliability

"LFK" Index Does Not Reliably Detect Small-Study Effects in Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Guido Schwarzer; Gerta Rücker; Cristina Semaca – Research Synthesis Methods, 2024

The "LFK" index has been promoted as an improved method to detect bias in meta-analysis. Putatively, its performance does not depend on the number of studies in the meta-analysis. We conducted a simulation study, comparing the "LFK" index test to three standard tests for funnel plot asymmetry in settings with smaller or larger…

Descriptors: Bias, Meta Analysis, Simulation, Evaluation Methods

Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021

Peer reviewed

Direct link

Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023

Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…

Descriptors: Chemistry, Periodicals, Journal Articles, Science Education

A Unified Approach to Estimating the Intraclass Correlation Coefficient and Its Bias: An Exploratory Study

Direct link

Kelvin Terrell Pompey – ProQuest LLC, 2021

Many methods are used to measure interrater reliability for studies where each target receives ratings by a different set of judges. The purpose of this study is to explore the use of hierarchical modeling for estimating interrater reliability using the intraclass correlation coefficient. This study provides a description of how the ICC can be…

Descriptors: Interrater Reliability, Evaluation Methods, Test Reliability, Correlation

Evaluation of Maximal Reliability for Multidimensional Measuring Instruments Using Structural Equation Modeling

Peer reviewed

Direct link

Tenko Raykov; Bingsheng Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Multidimensional measuring instruments are often used in behavioral, social, educational, marketing, and biomedical research. For these scales, the paper discusses how to find the optimal score based on their components that is associated with the highest possible reliability. Within the framework of structural equation modeling, an approach to…

Descriptors: Multidimensional Scaling, Measurement Equipment, Measurement Techniques, Test Reliability

Psychometric Assessment of the Rett Syndrome Caregiver Assessment of Symptom Severity (RCASS)

Peer reviewed

Direct link

Melissa Raspa; Angela Gwaltney; Carla Bann; Jana von Hehn; Timothy A. Benke; Eric D. Marsh; Sarika U. Peters; Amitha Ananth; Alan K. Percy; Jeffrey L. Neul – Journal of Autism and Developmental Disorders, 2025

Rett syndrome is a severe neurodevelopmental disorder that affects about 1 in 10,000 females. Clinical trials of disease modifying therapies are on the rise, but there are few psychometrically sound caregiver-reported outcome measures available to assess treatment benefit. We report on a new caregiver-reported outcome measure, the Rett Caregiver…

Descriptors: Neurodevelopmental Disorders, Genetic Disorders, Females, Test Validity

Using Simulated Retests to Estimate the Reliability of Diagnostic Assessment Systems

Peer reviewed

Direct link

Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023

As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…

Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 94

Educational and Psychological…	23
Journal of Autism and…	21
ProQuest LLC	21
Measurement and Evaluation in…	13
Online Submission	13
Assessment & Evaluation in…	12
Research in Developmental…	12
Diagnostique	11
Journal of Psychoeducational…	11
Journal of Educational…	10
American Journal on Mental…	9
Child Abuse & Neglect: The…	9
Grantee Submission	9
Psychological Assessment	9
ETS Research Report Series	8
Assessment for Effective…	7
Journal of Chemical Education	7
Psychology in the Schools	7
Reading Teacher	7
Research on Social Work…	7
Academic Medicine	6
Assessment	6
Assessment and Evaluation in…	6
Behavioral Disorders	6
Evaluation Review	6
More ▼

Epstein, Michael H.	7
Matson, Johnny L.	6
Amrein-Beardsley, Audrey	4
Erford, Bradley T.	4
Gill, Brian	4
Booker, Kevin	3
Brown, James Dean	3
Capie, William	3
Deno, Stanley L.	3
Elliott, Stephen N.	3
Feldt, Leonard S.	3
Fitz-Gibbon, Carol Taylor	3
Fuchs, Lynn S.	3
Halle, Tamara	3
Horner, Robert H.	3
Lembke, Erica S.	3
Morris, Lynn Lyons	3
Thomson, Peter	3
Tindal, Gerald	3
Abedi, Jamal	2
Algozzine, Bob	2
Algozzine, Kate	2
Baglio, Christopher S.	2
Bagnato, Stephen J.	2
More ▼

Journal Articles	908
Reports - Research	715
Reports - Evaluative	247
Reports - Descriptive	150
Speeches/Meeting Papers	118
Information Analyses	77
Tests/Questionnaires	71
Opinion Papers	57
Guides - Non-Classroom	49
Dissertations/Theses -…	21
Guides - Classroom - Teacher	19
Books	16
Numerical/Quantitative Data	13
ERIC Publications	9
Collected Works - Proceedings	8
Reference Materials -…	8
Collected Works - General	6
ERIC Digests in Full Text	6
Reports - General	6
Collected Works - Serials	4
Guides - General	3
Guides - Classroom - Learner	2
Book/Product Reviews	1
Collected Works - Serial	1
Historical Materials	1
More ▼

Wechsler Intelligence Scale…	7
Woodcock Johnson Tests of…	6
ACT Assessment	5
Child Behavior Checklist	5
National Assessment of…	5
Program for International…	5
Aberrant Behavior Checklist	4
Bayley Scales of Infant…	4
Minnesota Multiphasic…	4
Peabody Picture Vocabulary…	4
Praxis Series	4
Teacher Performance…	4
Beck Anxiety Inventory	3
Graduate Management Admission…	3
MacArthur Communicative…	3
Self Directed Learning…	3
Stanford Achievement Tests	3
Advanced Placement…	2
Autism Diagnostic Observation…	2
Battelle Developmental…	2
Beck Depression Inventory	2
Behavioral and Emotional…	2
Brief Symptom Inventory	2
Child Abuse Potential…	2
Clinical Evaluation of…	2
More ▼