ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	27

Descriptor

Robustness (Statistics)	29
Test Reliability	29
Test Validity	29
Item Analysis	8
Evaluation Methods	7
Evaluation Problems	7
Evaluation Research	6
Foreign Countries	6
Standardized Tests	6
Student Evaluation	6
Evaluation Criteria	5
Measures (Individuals)	5
Achievement Gains	4
Achievement Rating	4
Factor Analysis	4
Predictor Variables	4
Research Methodology	4
Correlation	3
Educational Assessment	3
Educational Indicators	3
Educational Research	3
Error of Measurement	3
Factor Structure	3
Item Response Theory	3
Measurement Techniques	3
More ▼

Publication Type

Journal Articles	25
Reports - Research	15
Reports - Evaluative	10
Reports - Descriptive	3
Dissertations/Theses -…	1
Information Analyses	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Higher Education	11
Elementary Secondary Education	7
Postsecondary Education	5
Secondary Education	3
Elementary Education	1
Grade 10	1
Grade 3	1
Grade 4	1
Grade 7	1
High Schools	1

Audience

Location

Taiwan	2
Australia	1
California	1
Florida	1
Portugal	1
Tennessee	1
Texas	1
United Kingdom	1
United Kingdom (Leeds)	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Autism Diagnostic Observation…	1
Florida Comprehensive…	1
Program for International…	1
Work Values Inventory	1

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

Are the Signs of Factor Loadings Arbitrary in Confirmatory Factor Analysis? Problems and Solutions

Peer reviewed

Direct link

Dandan Tang; Steven M. Boker; Xin Tong – Structural Equation Modeling: A Multidisciplinary Journal, 2025

The replication crisis in social and behavioral sciences has raised concerns about the reliability and validity of empirical studies. While research in the literature has explored contributing factors to this crisis, the issues related to analytical tools have received less attention. This study focuses on a widely used analytical tool -…

Descriptors: Test Validity, Factor Analysis, Replication (Evaluation), Social Science Research

Is It Actually Reliable? Examining Statistical Methods for Inter-Rater Reliability of a Rubric in Graduate Education

Peer reviewed
PDF on ERIC

Download full text

Brent J. Goertzen; Kaley Klaus – Research & Practice in Assessment, 2023

When evaluating student learning, educators often employ scoring rubrics, for which quality can be determined through evaluating validity and reliability. This article discusses the norming process utilized in a graduate organizational leadership program for a capstone scoring rubric. Concepts of validity and reliability are discussed, as is the…

Descriptors: Graduate Students, Graduate Study, Graduate School Faculty, Scoring Rubrics

The Development and Validation of the Digital Literacy Questionnaire and the Evaluation of Students' Digital Literacy

Peer reviewed

Direct link

Chu-Yang Chang; Hsu-Chan Kuo – Education and Information Technologies, 2025

The rapid advancement of educational technologies in recent decades has underscored the increasing importance of digital literacy (DL) as a core competency for all students, as recognised in various educational policies and programs. Evaluating students' DL is crucial for providing valuable insights to guide future educational initiatives. This…

Descriptors: Digital Literacy, Questionnaires, Test Construction, Test Validity

Identifying Dynamic Shifts to Careless and Insufficient Effort Behavior in Questionnaire Responses; a Novel Approach and Experimental Validation

Peer reviewed

Direct link

Zachary J. Roman; Patrick Schmidt; Jason M. Miller; Holger Brandt – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Careless and insufficient effort responding (C/IER) is a situation where participants respond to survey instruments without considering the item content. This phenomena adds noise to data leading to erroneous inference. There are multiple approaches to identifying and accounting for C/IER in survey settings, of these approaches the best performing…

Descriptors: Structural Equation Models, Bayesian Statistics, Response Style (Tests), Robustness (Statistics)

Psychological Well-Being, Resilience, Self-Determination and Grit: The 'Novelty' Role in Physical Education Classes

Peer reviewed

Direct link

Ruben Trigueros; Alejandro García-Mas – British Journal of Educational Psychology, 2025

Introduction: In recent years, the incorporation of novelty as a psychological need and the study of the frustration of needs have become a recurring theme in the research on psychological needs in the educational environment. Currently, there are two scales available to assess the frustration of basic psychological needs (FBN) in the context of…

Descriptors: Psychological Patterns, Well Being, Resilience (Psychology), Self Determination

Identifying Attrition Risk Based on the First Year Experience

Peer reviewed

Direct link

Naylor, Ryan; Baik, Chi; Arkoudis, Sophia – Higher Education Research and Development, 2018

Using data collected from a recent national survey of Australian first-year students, this paper defines and validates four scales--belonging, feeling supported, intellectual engagement and workload stress--to measure the student experience of university. These scales provide insights into the university experience for both groups and individual…

Descriptors: Student Attrition, At Risk Students, College Freshmen, National Surveys

Hybrid Computerized Adaptive Testing: From Group Sequential Design to Fully Sequential Design

Peer reviewed

Direct link

Wang, Shiyu; Lin, Haiyan; Chang, Hua-Hua; Douglas, Jeff – Journal of Educational Measurement, 2016

Computerized adaptive testing (CAT) and multistage testing (MST) have become two of the most popular modes in large-scale computer-based sequential testing. Though most designs of CAT and MST exhibit strength and weakness in recent large-scale implementations, there is no simple answer to the question of which design is better because different…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Sequential Approach

The Screening Accuracy of the Parent and Teacher-Reported Social Responsiveness Scale (SRS): Comparison with the 3Di and ADOS

Peer reviewed

Direct link

Duvekot, Jorieke; van der Ende, Jan; Verhulst, Frank C.; Greaves-Lord, Kirstin – Journal of Autism and Developmental Disorders, 2015

The screening accuracy of the parent and teacher-reported Social Responsiveness Scale (SRS) was compared with an autism spectrum disorder (ASD) classification according to (1) the Developmental, Dimensional, and Diagnostic Interview (3Di), (2) the Autism Diagnostic Observation Schedule (ADOS), (3) both the 3Di and ADOS, in 186 children referred to…

Descriptors: Accuracy, Screening Tests, Parent Teacher Cooperation, Pervasive Developmental Disorders

Evaluating Teacher Preparation Using Graduates' Observational Ratings

Peer reviewed

Direct link

Ronfeldt, Matthew; Campbell, Shanyce L. – Educational Evaluation and Policy Analysis, 2016

Despite growing calls for more accountability of teacher education programs (TEPs), there is little consensus about how to evaluate them. This study investigates the potential for using observational ratings of program completers to evaluate TEPs. Drawing on statewide data on almost 9,500 program completers, representing 44 providers (183…

Descriptors: Teacher Education Programs, Program Effectiveness, Program Evaluation, Observation

Correcting for Sample Problems in PISA and the Improvement in Portuguese Students' Performance

Peer reviewed

Direct link

Freitas, Pedro; Nunes, Luís Catela; Balcão Reis, Ana; Seabra, Carmo; Ferro, Adriana – Assessment in Education: Principles, Policy & Practice, 2016

The results of large-scale international assessments such as Programme for International Student Assessment (PISA) have attracted a considerable attention worldwide and are often used by policy-makers to support educational policies. To ensure that the published results represent the actual population, these surveys go through a thorough scrutiny…

Descriptors: International Assessment, Student Characteristics, Weighted Scores, Evaluation Problems

Stability of Scores on Super's Work Values Inventory-Revised

Peer reviewed

Direct link

Leuty, Melanie E. – Measurement and Evaluation in Counseling and Development, 2013

Test-retest data on Super's Work Values Inventory-Revised for a group of predominantly White ("N" = 995) women (mean age = 23.5 years, SD = 8.07) and men (mean age = 21.5 years, SD = 5.80) showed stability in mean-level scores over a period of 1 year for the sample as a whole. However, low raw score and rank order stability coefficients…

Descriptors: Robustness (Statistics), Scores, Individual Differences, Item Analysis

Replication and Robustness in Developmental Research

Peer reviewed

Direct link

Duncan, Greg J.; Engel, Mimi; Claessens, Amy; Dowsett, Chantelle J. – Developmental Psychology, 2014

Replications and robustness checks are key elements of the scientific method and a staple in many disciplines. However, leading journals in developmental psychology rarely include explicit replications of prior research conducted by different investigators, and few require authors to establish in their articles or online appendices that their key…

Descriptors: Replication (Evaluation), Robustness (Statistics), Developmental Psychology, Educational Research

Variety and Drift in the Functions and Purposes of Assessment in K-12 Education

Peer reviewed

Direct link

Ho, Andrew D. – Teachers College Record, 2014

Background/Context: The target of assessment validation is not an assessment but the use of an assessment for a purpose. Although the validation literature often provides examples of assessment purposes, comprehensive reviews of these purposes are rare. Additionally, assessment purposes posed for validation are generally described as discrete and…

Descriptors: Elementary Secondary Education, Standardized Tests, Measurement Objectives, Educational Change

Validation of the Chinese Version of the Life Orientation Test with a Robust Weighted Least Squares Approach

Peer reviewed

Direct link

Li, Cheng-Hsien – Psychological Assessment, 2012

Of the several measures of optimism presently available in the literature, the Life Orientation Test (LOT; Scheier & Carver, 1985) has been the most widely used in empirical research. This article explores, confirms, and cross-validates the factor structure of the Chinese version of the LOT with ordinal data by using robust weighted least…

Descriptors: Measures (Individuals), Psychological Testing, Chinese, Test Validity

The Public Understanding of Error in Educational Assessment

Peer reviewed

Direct link

Gardner, John – Oxford Review of Education, 2013

Evidence from recent research suggests that in the UK the public perception of errors in national examinations is that they are simply mistakes; events that are preventable. This perception predominates over the more sophisticated technical view that errors arise from many sources and create an inevitable variability in assessment outcomes. The…

Descriptors: Educational Assessment, Public Opinion, Error of Measurement, Foreign Countries

Previous Page | Next Page »

Pages: 1 | 2

Structural Equation Modeling:…	2
Applied Measurement in…	1
Assessment in Education:…	1
British Journal of…	1
CATESOL Journal	1
Center for Education Data &…	1
College Student Journal	1
College and University	1
Developmental Psychology	1
Economics of Education Review	1
Education and Information…	1
Educational Evaluation and…	1
Educational Research and…	1
Educational and Psychological…	1
Higher Education Research and…	1
International Journal of…	1
Journal of Autism and…	1
Journal of Educational…	1
Journal of Teacher Education	1
Measurement and Evaluation in…	1
National Education Policy…	1
Oxford Review of Education	1
ProQuest LLC	1
Psychological Assessment	1
Research & Practice in…	1
More ▼

Alejandro García-Mas	1
Arkoudis, Sophia	1
Baik, Chi	1
Balcão Reis, Ana	1
Ballou, Dale	1
Booker, Kevin	1
Brent J. Goertzen	1
Camilli, Gregory	1
Campbell, Shanyce L.	1
Chang, Hua-Hua	1
Chaplin, Duncan	1
Chu-Yang Chang	1
Claessens, Amy	1
Dandan Tang	1
Dillon, Amanda	1
Dorans, Neil J.	1
Douglas, Jeff	1
Dowsett, Chantelle J.	1
Duncan, Greg J.	1
Duvekot, Jorieke	1
Engel, Mimi	1
English, Taylor	1
Ferro, Adriana	1
Freitas, Pedro	1
Froman, Terry	1
More ▼