ERIC - Search Results

Publication Date

In 2025	21
Since 2024	58
Since 2021 (last 5 years)	105
Since 2016 (last 10 years)	190
Since 2006 (last 20 years)	362

Descriptor

Evaluation Methods	716
Test Reliability	716
Test Validity	476
Student Evaluation	156
Foreign Countries	155
Test Construction	147
Higher Education	92
Psychometrics	86
Measures (Individuals)	69
Measurement Techniques	68
Factor Analysis	66
College Students	61
Questionnaires	60
Scores	57
Evaluation Criteria	55
Elementary Secondary Education	54
Student Attitudes	54
Interrater Reliability	52
Correlation	47
Rating Scales	46
Statistical Analysis	43
Teacher Attitudes	43
Adults	41
Children	40
Computer Assisted Testing	39
More ▼

Publication Type

Reports - Research	716
Journal Articles	549
Speeches/Meeting Papers	64
Tests/Questionnaires	43
Information Analyses	17
Numerical/Quantitative Data	11
Guides - Non-Classroom	6
Reports - Evaluative	6
Reports - Descriptive	4
Opinion Papers	3
Reports - General	2
Collected Works - General	1
Collected Works - Proceedings	1
Multilingual/Bilingual…	1
More ▼

Education Level

Higher Education	115
Postsecondary Education	89
Elementary Education	56
Secondary Education	42
Elementary Secondary Education	36
Early Childhood Education	28
Middle Schools	27
High Schools	21
Junior High Schools	18
Primary Education	12
Grade 6	11
Preschool Education	11
Grade 1	8
Intermediate Grades	8
Kindergarten	8
Grade 5	7
Grade 8	7
Grade 10	5
Grade 2	5
Grade 7	5
Adult Education	4
Grade 3	4
Grade 12	3
Grade 4	3
Grade 9	3
More ▼

Audience

Researchers	39
Practitioners	15
Administrators	6
Teachers	4
Counselors	2
Policymakers	2

Location

Australia	18
Canada	15
United Kingdom	13
China	12
Turkey	12
United States	10
Netherlands	7
California	6
Indonesia	6
Israel	6
Taiwan	6
Texas	6
Germany	5
Iran	5
Spain	5
Florida	4
Arizona	3
Finland	3
Greece	3
Illinois	3
India	3
Kansas	3
North Carolina	3
Ohio	3
South Korea	3
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	2
American Recovery and…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1
Race to the Top	1

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 716 results Save | Export

"LFK" Index Does Not Reliably Detect Small-Study Effects in Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Guido Schwarzer; Gerta Rücker; Cristina Semaca – Research Synthesis Methods, 2024

The "LFK" index has been promoted as an improved method to detect bias in meta-analysis. Putatively, its performance does not depend on the number of studies in the meta-analysis. We conducted a simulation study, comparing the "LFK" index test to three standard tests for funnel plot asymmetry in settings with smaller or larger…

Descriptors: Bias, Meta Analysis, Simulation, Evaluation Methods

Psychometric Assessment of the Rett Syndrome Caregiver Assessment of Symptom Severity (RCASS)

Peer reviewed

Direct link

Melissa Raspa; Angela Gwaltney; Carla Bann; Jana von Hehn; Timothy A. Benke; Eric D. Marsh; Sarika U. Peters; Amitha Ananth; Alan K. Percy; Jeffrey L. Neul – Journal of Autism and Developmental Disorders, 2025

Rett syndrome is a severe neurodevelopmental disorder that affects about 1 in 10,000 females. Clinical trials of disease modifying therapies are on the rise, but there are few psychometrically sound caregiver-reported outcome measures available to assess treatment benefit. We report on a new caregiver-reported outcome measure, the Rett Caregiver…

Descriptors: Neurodevelopmental Disorders, Genetic Disorders, Females, Test Validity

Using Simulated Retests to Estimate the Reliability of Diagnostic Assessment Systems

Peer reviewed

Direct link

Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023

As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…

Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy

The Development of Knowledge of Content and Teaching Task Instruments for Pre-Service Mathematics Teacher

Peer reviewed
PDF on ERIC

Download full text

Siti Suprihatiningsih; Masriyah; Rooselyna Ekawati – Journal of Education and Learning (EduLearn), 2025

The knowledge of the materials to be taught to the students is the basic knowledge that preservice mathematics teachers should possess, as they need to prepare themselves for teaching. In order to research preservice teachers' understanding of the subject matter and teaching skils, valid and reliable test instruments are required. Knowledge of…

Descriptors: Preservice Teachers, Pedagogical Content Knowledge, Preservice Teacher Education, Mathematics Teachers

The Proposed Specifiers for Conduct Disorder (PSCD): External Correlates and Incremental Validity over Alternate Psychopathy Measures

Peer reviewed

Direct link

Mojtaba Elhami Athar; Randall T. Salekin; Mahdi Hassanabadi; Parnian Rezaei; Golnoush Fakhr; Elham Zamani – Child & Youth Care Forum, 2025

The Proposed Specifiers for Conduct Disorder (PSCD) assesses psychopathy components of grandiose-manipulative (GM), callous-unemotional (CU), daring-impulsive (DI), and conduct disorder (CD). Research on PSCD is still in its infancy, and further research is necessary to examine its psychometric properties. We investigated the correlations between…

Descriptors: Preadolescents, Adolescents, Psychopathology, Behavior Disorders

Between Two Worlds: Locating Climate Literacy between Modern Educational Frameworks and Assessment Needs

Peer reviewed

Direct link

Dirk Gellermann; Hanno Michel; Ute Harms – Mind, Brain, and Education, 2025

In order for climate literacy assessments to be applicable in large-scale studies, it is essential that they comply with the standards of test administration while maintaining consistency with a comprehensive definition of the concept. In alignment with the different educational frameworks and the Climate Literacy Principles of the U.S. Global…

Descriptors: Climate, Environmental Education, Literacy, Measures (Individuals)

Estimating the Reliability of Skill Transitions in Longitudinal Diagnostic Classification Models

Peer reviewed

Direct link

Madeline A. Schellman; Matthew J. Madison – Grantee Submission, 2024

Diagnostic classification models (DCMs) have grown in popularity as stakeholders increasingly desire actionable information related to students' skill competencies. Longitudinal DCMs offer a psychometric framework for providing estimates of students' proficiency status transitions over time. For both cross-sectional and longitudinal DCMs, it is…

Descriptors: Diagnostic Tests, Classification, Models, Psychometrics

Empirical Evaluation of a Differentiated Assessment of Data Structures: The Role of Prerequisite Skills

Peer reviewed
PDF on ERIC

Download full text

Marjahan Begum; Pontus Haglund; Ari Korhonen; Violetta Lonati; Mattia Monga; Filip Strömbäck; Artturi Tilanterä – Informatics in Education, 2024

There can be many reasons why students fail to answer correctly to summative tests in advanced computer science courses: often the cause is a lack of prerequisites or misconceptions about topics presented in previous courses. One of the ITiCSE 2020 working groups investigated the possibility of designing assessments suitable for differentiating…

Descriptors: Foreign Countries, College Students, Prerequisites, Computer Science Education

Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System

Peer reviewed

Direct link

Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025

In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…

Descriptors: Automation, Grading, Computer Assisted Testing, Scoring

Which Scale Short Form Development Method Is Better? A Comparison of ACO, TS, and SCOFA

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2022

The purpose of this study is to identify which scale short-form development method produces better findings in different factor structures. A simulation study was designed based on this purpose. Three different factor structures and three simulation conditions were selected. As the findings of this simulation study, the model-data fit and…

Descriptors: Test Construction, Measures (Individuals), Factor Structure, Test Reliability

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Factors to Assess Teacher Design Knowledge Competencies: Data Literacies Practice, Design Practice, and Distributed Epistemic Practice (3Ds)

Peer reviewed

Direct link

Kim, Mi Song – International Journal of Technology and Design Education, 2022

Teacher design work has gained increasing attention by re-conceptualizing teachers as designers rather than curriculum deliverers. However, assessing teacher design work can be challenging given that there are very few research tools to assess teacher design knowledge (TDK) competencies. To fill that gap, this study proposes a survey that assesses…

Descriptors: Design, Teacher Characteristics, Teacher Competencies, Teacher Evaluation

The Motivation to Teach Computer Science (MTCS) Scale: Development, Validation, and Implications for Use

Peer reviewed

Direct link

Nicole D. Martin; Stephanie N. Baker; Madeline Haynes; Jayce R. Warner – Computer Science Education, 2024

Background and Context: As computer science (CS) education expands and the need for well-prepared CS teachers grows, understanding what motivates teachers to teach CS can help address challenges to recruiting, preparing, and retaining teachers. Objective: The goal of this work was to develop and validate a scale that measures teachers' motivation…

Descriptors: Computer Science Education, Teacher Motivation, Measurement Techniques, Construct Validity

Domain-General Auditory Processing as a Conceptual and Measurement Framework for Second Language Speech Learning Aptitude: A Test-Retest Reliability Study

Peer reviewed

Direct link

Kazuya Saito; Adam Tierney – Studies in Second Language Acquisition, 2024

This article proposes a conceptual and measurement framework for postpubertal, L2 speech learning aptitude that is centered around domain-general auditory processing (i.e., representing spectral and temporal characteristics of sounds). To this end, we examine the construct and reliability of a battery of auditory processing tests by presenting the…

Descriptors: Second Language Learning, Auditory Tests, Auditory Perception, Listening Comprehension Tests

Enhancing Model Fit Evaluation in SEM: Practical Tips for Optimizing Chi-Square Tests

Peer reviewed

Direct link

Bang Quan Zheng; Peter M. Bentler – Structural Equation Modeling: A Multidisciplinary Journal, 2025

This paper aims to advocate for a balanced approach to model fit evaluation in structural equation modeling (SEM). The ongoing debate surrounding chi-square test statistics and fit indices has been characterized by ambiguity and controversy. Despite the acknowledged limitations of relying solely on the chi-square test, its careful application can…

Descriptors: Monte Carlo Methods, Structural Equation Models, Goodness of Fit, Robustness (Statistics)

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 48

Journal of Autism and…	14
Educational and Psychological…	11
Research in Developmental…	9
Assessment & Evaluation in…	8
Grantee Submission	8
Measurement and Evaluation in…	8
American Journal on Mental…	7
Diagnostique	7
ETS Research Report Series	7
Journal of Educational…	7
Journal of Psychoeducational…	7
Online Submission	7
Research on Social Work…	6
Child Abuse & Neglect: The…	5
Evaluation Review	5
Journal of Communication…	5
Psychology in the Schools	5
Assessment	4
Behavioral Disorders	4
Educational Evaluation and…	4
Evaluation and the Health…	4
Gerontologist	4
International Journal of…	4
Journal of Chemical Education	4
Journal of Consulting and…	4
More ▼

Epstein, Michael H.	4
Matson, Johnny L.	4
Lembke, Erica S.	3
Algozzine, Bob	2
Algozzine, Kate	2
Baglio, Christopher S.	2
Boyle, Michael H.	2
Bretz, Stacey Lowery	2
Bullis, Michael	2
Capie, William	2
Cason, Gerald J.	2
Christ, Theodore J.	2
Crawford, Angela R.	2
Cunningham, Charles E.	2
Cusumano, Dale	2
Davis, Cheryl	2
Erford, Bradley T.	2
Erica S. Lembke	2
Floyd, Randy G.	2
Gearhart, Maryl	2
Horner, Robert H.	2
Johnson, Evelyn S.	2
Kane, Thomas J.	2
Kartowagiran, Badrun	2
More ▼

Child Behavior Checklist	5
Aberrant Behavior Checklist	4
Wechsler Intelligence Scale…	4
Woodcock Johnson Tests of…	4
Beck Anxiety Inventory	3
Praxis Series	3
Program for International…	3
ACT Assessment	2
Bayley Scales of Infant…	2
Brief Symptom Inventory	2
Child Abuse Potential…	2
Computer Attitude Scale	2
Diagnostic Assessment for the…	2
Graduate Record Examinations	2
Hamilton Rating Scale for…	2
Minnesota Multiphasic…	2
Motivated Strategies for…	2
SAT (College Admission Test)	2
Trends in International…	2
Academic Motivation Scale	1
Adaptive Behavior Scale	1
Adjective Check List	1
Adjustment Scales for…	1
Autism Diagnostic Observation…	1
Beck Depression Inventory	1
More ▼