ERIC - Search Results

Publication Date

In 2025	23
Since 2024	61
Since 2021 (last 5 years)	110
Since 2016 (last 10 years)	203
Since 2006 (last 20 years)	493

Descriptor

Evaluation Methods	908
Test Reliability	908
Test Validity	609
Student Evaluation	209
Foreign Countries	182
Test Construction	167
Psychometrics	137
Higher Education	108
Measures (Individuals)	98
Measurement Techniques	88
College Students	77
Scores	75
Factor Analysis	72
Elementary Secondary Education	70
Interrater Reliability	68
Questionnaires	68
Adults	60
Correlation	56
Children	55
Rating Scales	55
Student Attitudes	54
Evaluation Criteria	53
Evaluation Research	50
Computer Assisted Testing	48
Disabilities	44
More ▼

Publication Type

Journal Articles	908
Reports - Research	548
Reports - Evaluative	194
Reports - Descriptive	107
Information Analyses	52
Opinion Papers	38
Tests/Questionnaires	31
Guides - Non-Classroom	9
Guides - Classroom - Teacher	7
Reports - General	5
Speeches/Meeting Papers	2
Collected Works - Serial	1
Collected Works - Serials	1
ERIC Publications	1
Guides - General	1
Multilingual/Bilingual…	1
Translations	1
More ▼

Education Level

Higher Education	154
Postsecondary Education	110
Elementary Education	57
Elementary Secondary Education	49
Secondary Education	44
Early Childhood Education	29
Middle Schools	26
High Schools	23
Junior High Schools	18
Preschool Education	15
Adult Education	13
Primary Education	12
Grade 6	9
Grade 1	8
Kindergarten	8
Grade 8	7
Intermediate Grades	7
Grade 4	6
Grade 5	6
Grade 10	5
Grade 2	5
Grade 3	5
Grade 7	5
Grade 12	3
Grade 11	2
More ▼

Audience

Researchers	50
Practitioners	31
Teachers	8
Administrators	4
Counselors	2

Location

United Kingdom	19
Australia	17
Canada	16
China	14
Turkey	14
United States	11
Netherlands	8
Taiwan	7
California	6
Indonesia	6
Texas	6
Iran	5
Israel	5
Spain	5
India	4
United Kingdom (England)	4
Finland	3
France	3
Germany	3
Greece	3
Japan	3
Malaysia	3
Massachusetts	3
Minnesota	3
New York	3
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	2
American Recovery and…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Race to the Top	1

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 908 results Save | Export

Technical Adequacy-Reliability

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2025

The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…

Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement

"LFK" Index Does Not Reliably Detect Small-Study Effects in Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Guido Schwarzer; Gerta Rücker; Cristina Semaca – Research Synthesis Methods, 2024

The "LFK" index has been promoted as an improved method to detect bias in meta-analysis. Putatively, its performance does not depend on the number of studies in the meta-analysis. We conducted a simulation study, comparing the "LFK" index test to three standard tests for funnel plot asymmetry in settings with smaller or larger…

Descriptors: Bias, Meta Analysis, Simulation, Evaluation Methods

Evaluation of Maximal Reliability for Multidimensional Measuring Instruments Using Structural Equation Modeling

Peer reviewed

Direct link

Tenko Raykov; Bingsheng Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Multidimensional measuring instruments are often used in behavioral, social, educational, marketing, and biomedical research. For these scales, the paper discusses how to find the optimal score based on their components that is associated with the highest possible reliability. Within the framework of structural equation modeling, an approach to…

Descriptors: Multidimensional Scaling, Measurement Equipment, Measurement Techniques, Test Reliability

Psychometric Assessment of the Rett Syndrome Caregiver Assessment of Symptom Severity (RCASS)

Peer reviewed

Direct link

Melissa Raspa; Angela Gwaltney; Carla Bann; Jana von Hehn; Timothy A. Benke; Eric D. Marsh; Sarika U. Peters; Amitha Ananth; Alan K. Percy; Jeffrey L. Neul – Journal of Autism and Developmental Disorders, 2025

Rett syndrome is a severe neurodevelopmental disorder that affects about 1 in 10,000 females. Clinical trials of disease modifying therapies are on the rise, but there are few psychometrically sound caregiver-reported outcome measures available to assess treatment benefit. We report on a new caregiver-reported outcome measure, the Rett Caregiver…

Descriptors: Neurodevelopmental Disorders, Genetic Disorders, Females, Test Validity

Using Simulated Retests to Estimate the Reliability of Diagnostic Assessment Systems

Peer reviewed

Direct link

Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023

As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…

Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy

The Development of Knowledge of Content and Teaching Task Instruments for Pre-Service Mathematics Teacher

Peer reviewed
PDF on ERIC

Download full text

Siti Suprihatiningsih; Masriyah; Rooselyna Ekawati – Journal of Education and Learning (EduLearn), 2025

The knowledge of the materials to be taught to the students is the basic knowledge that preservice mathematics teachers should possess, as they need to prepare themselves for teaching. In order to research preservice teachers' understanding of the subject matter and teaching skils, valid and reliable test instruments are required. Knowledge of…

Descriptors: Preservice Teachers, Pedagogical Content Knowledge, Preservice Teacher Education, Mathematics Teachers

The Proposed Specifiers for Conduct Disorder (PSCD): External Correlates and Incremental Validity over Alternate Psychopathy Measures

Peer reviewed

Direct link

Mojtaba Elhami Athar; Randall T. Salekin; Mahdi Hassanabadi; Parnian Rezaei; Golnoush Fakhr; Elham Zamani – Child & Youth Care Forum, 2025

The Proposed Specifiers for Conduct Disorder (PSCD) assesses psychopathy components of grandiose-manipulative (GM), callous-unemotional (CU), daring-impulsive (DI), and conduct disorder (CD). Research on PSCD is still in its infancy, and further research is necessary to examine its psychometric properties. We investigated the correlations between…

Descriptors: Preadolescents, Adolescents, Psychopathology, Behavior Disorders

Between Two Worlds: Locating Climate Literacy between Modern Educational Frameworks and Assessment Needs

Peer reviewed

Direct link

Dirk Gellermann; Hanno Michel; Ute Harms – Mind, Brain, and Education, 2025

In order for climate literacy assessments to be applicable in large-scale studies, it is essential that they comply with the standards of test administration while maintaining consistency with a comprehensive definition of the concept. In alignment with the different educational frameworks and the Climate Literacy Principles of the U.S. Global…

Descriptors: Climate, Environmental Education, Literacy, Measures (Individuals)

A Tutorial on Aggregating Evidence from Conceptual Replication Studies Using the Product Bayes Factor

Peer reviewed

Direct link

Caspar J. Van Lissa; Eli-Boaz Clapper; Rebecca Kuiper – Research Synthesis Methods, 2024

The product Bayes factor (PBF) synthesizes evidence for an informative hypothesis across heterogeneous replication studies. It can be used when fixed- or random effects meta-analysis fall short. For example, when effect sizes are incomparable and cannot be pooled, or when studies diverge significantly in the populations, study designs, and…

Descriptors: Hypothesis Testing, Evaluation Methods, Replication (Evaluation), Sample Size

Reliability Generalization Meta-Analysis of Seven Wisdom Self-Rating Scales from 2004 to 2023

Peer reviewed

Direct link

Hongyi Lin; Fengyan Wang – Journal of Psychoeducational Assessment, 2024

Accurate measurement of wisdom is the cornerstone of wisdom research. To provide a representative reference for the reliability level and moderating factors of various wisdom self-rating scales, we carried out a reliability generalization meta-analysis of Chinese and English references retrieved from 2004 to 2023. A total of 149 articles were…

Descriptors: Thinking Skills, Intelligence, Cognitive Psychology, Cognitive Measurement

How Valid and Reliable Are Teachers' Assessments of Gifted Students?

Peer reviewed
PDF on ERIC

Download full text

Sümeyye Arkan; Sema Tan – International Journal of Assessment Tools in Education, 2025

Teachers' perceptions, attitudes, and opinions about students, curricula, or evaluation methods contribute to the development of students' talents. Thus, researchers often collect data from teachers to identify gifted students, determine educational practices to meet the students' needs and assess gifted education programs. Researchers often…

Descriptors: Talent Identification, Academically Gifted, Evaluation Methods, Measurement Techniques

Empirical Evaluation of a Differentiated Assessment of Data Structures: The Role of Prerequisite Skills

Peer reviewed
PDF on ERIC

Download full text

Marjahan Begum; Pontus Haglund; Ari Korhonen; Violetta Lonati; Mattia Monga; Filip Strömbäck; Artturi Tilanterä – Informatics in Education, 2024

There can be many reasons why students fail to answer correctly to summative tests in advanced computer science courses: often the cause is a lack of prerequisites or misconceptions about topics presented in previous courses. One of the ITiCSE 2020 working groups investigated the possibility of designing assessments suitable for differentiating…

Descriptors: Foreign Countries, College Students, Prerequisites, Computer Science Education

Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System

Peer reviewed

Direct link

Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025

In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…

Descriptors: Automation, Grading, Computer Assisted Testing, Scoring

How Reliable and Valid Are the Evaluations of Digital Competence in Higher Education: A Systematic Mapping Study

Peer reviewed

Direct link

Saltos-Rivas, Rafael; Novoa-Hernández, Pavel; Serrano Rodríguez, Rocío – SAGE Open, 2022

Evaluating digital competencies has become a topic of growing interest in recent years. Although several reviews and studies have summarized the main elements of progress and shortcomings in this area, some issues are yet to be explored. Very little information is available about the ways of ensuring the validity and reliability of the instrument…

Descriptors: Test Reliability, Test Validity, Evaluation Methods, Technological Literacy

Which Scale Short Form Development Method Is Better? A Comparison of ACO, TS, and SCOFA

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2022

The purpose of this study is to identify which scale short-form development method produces better findings in different factor structures. A simulation study was designed based on this purpose. Three different factor structures and three simulation conditions were selected. As the findings of this simulation study, the model-data fit and…

Descriptors: Test Construction, Measures (Individuals), Factor Structure, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 61

Journal of Autism and…	21
Educational and Psychological…	20
Measurement and Evaluation in…	13
Assessment & Evaluation in…	12
Research in Developmental…	12
Diagnostique	11
Journal of Psychoeducational…	11
American Journal on Mental…	9
Child Abuse & Neglect: The…	9
Journal of Educational…	9
Psychological Assessment	9
ETS Research Report Series	8
Assessment for Effective…	7
Journal of Chemical Education	7
Psychology in the Schools	7
Research on Social Work…	7
Academic Medicine	6
Assessment	6
Assessment and Evaluation in…	6
Behavioral Disorders	6
Evaluation Review	6
Journal of Applied Research…	6
Journal of Visual Impairment…	6
Mental Retardation	6
Research in Developmental…	6
More ▼

Epstein, Michael H.	6
Matson, Johnny L.	6
Amrein-Beardsley, Audrey	4
Erford, Bradley T.	4
Deno, Stanley L.	3
Lembke, Erica S.	3
Tindal, Gerald	3
Abedi, Jamal	2
Baglio, Christopher S.	2
Bagnato, Stephen J.	2
Baker, Eva L.	2
Bardhoshi, Gerta	2
Barthelemy, C.	2
Boisjoli, Jessica A.	2
Boyle, Michael H.	2
Bretz, Stacey Lowery	2
Bricker, Diane D.	2
Bullis, Michael	2
Charter, Richard A.	2
Christ, Theodore J.	2
Cullinan, Douglas	2
Cunningham, Charles E.	2
Davis, Cheryl	2
Elliott, Stephen N.	2
More ▼

Wechsler Intelligence Scale…	6
Child Behavior Checklist	5
Aberrant Behavior Checklist	4
Minnesota Multiphasic…	4
Woodcock Johnson Tests of…	4
Bayley Scales of Infant…	3
Beck Anxiety Inventory	3
MacArthur Communicative…	3
Program for International…	3
Teacher Performance…	3
Advanced Placement…	2
Autism Diagnostic Observation…	2
Beck Depression Inventory	2
Brief Symptom Inventory	2
Child Abuse Potential…	2
Clinical Evaluation of…	2
Computer Attitude Scale	2
Conners Rating Scales	2
Diagnostic Assessment for the…	2
Graduate Management Admission…	2
Hamilton Rating Scale for…	2
Peabody Picture Vocabulary…	2
SAT (College Admission Test)	2
Self Directed Learning…	2
Teacher Rating Scale	2
More ▼