Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 5 |
| Since 2017 (last 10 years) | 6 |
| Since 2007 (last 20 years) | 27 |
Descriptor
| Robustness (Statistics) | 29 |
| Test Reliability | 29 |
| Test Validity | 29 |
| Item Analysis | 8 |
| Evaluation Methods | 7 |
| Evaluation Problems | 7 |
| Evaluation Research | 6 |
| Foreign Countries | 6 |
| Standardized Tests | 6 |
| Student Evaluation | 6 |
| Evaluation Criteria | 5 |
| More ▼ | |
Source
Author
| Alejandro García-Mas | 1 |
| Arkoudis, Sophia | 1 |
| Baik, Chi | 1 |
| Balcão Reis, Ana | 1 |
| Ballou, Dale | 1 |
| Booker, Kevin | 1 |
| Brent J. Goertzen | 1 |
| Camilli, Gregory | 1 |
| Campbell, Shanyce L. | 1 |
| Chang, Hua-Hua | 1 |
| Chaplin, Duncan | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 25 |
| Reports - Research | 15 |
| Reports - Evaluative | 10 |
| Reports - Descriptive | 3 |
| Dissertations/Theses -… | 1 |
| Information Analyses | 1 |
| Numerical/Quantitative Data | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Higher Education | 11 |
| Elementary Secondary Education | 7 |
| Postsecondary Education | 5 |
| Secondary Education | 3 |
| Elementary Education | 1 |
| Grade 10 | 1 |
| Grade 3 | 1 |
| Grade 4 | 1 |
| Grade 7 | 1 |
| High Schools | 1 |
Audience
Location
| Taiwan | 2 |
| Australia | 1 |
| California | 1 |
| Florida | 1 |
| Portugal | 1 |
| Tennessee | 1 |
| Texas | 1 |
| United Kingdom | 1 |
| United Kingdom (Leeds) | 1 |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
| Autism Diagnostic Observation… | 1 |
| Florida Comprehensive… | 1 |
| Program for International… | 1 |
| Work Values Inventory | 1 |
What Works Clearinghouse Rating
Dandan Tang; Steven M. Boker; Xin Tong – Structural Equation Modeling: A Multidisciplinary Journal, 2025
The replication crisis in social and behavioral sciences has raised concerns about the reliability and validity of empirical studies. While research in the literature has explored contributing factors to this crisis, the issues related to analytical tools have received less attention. This study focuses on a widely used analytical tool -…
Descriptors: Test Validity, Factor Analysis, Replication (Evaluation), Social Science Research
Brent J. Goertzen; Kaley Klaus – Research & Practice in Assessment, 2023
When evaluating student learning, educators often employ scoring rubrics, for which quality can be determined through evaluating validity and reliability. This article discusses the norming process utilized in a graduate organizational leadership program for a capstone scoring rubric. Concepts of validity and reliability are discussed, as is the…
Descriptors: Graduate Students, Graduate Study, Graduate School Faculty, Scoring Rubrics
Chu-Yang Chang; Hsu-Chan Kuo – Education and Information Technologies, 2025
The rapid advancement of educational technologies in recent decades has underscored the increasing importance of digital literacy (DL) as a core competency for all students, as recognised in various educational policies and programs. Evaluating students' DL is crucial for providing valuable insights to guide future educational initiatives. This…
Descriptors: Digital Literacy, Questionnaires, Test Construction, Test Validity
Zachary J. Roman; Patrick Schmidt; Jason M. Miller; Holger Brandt – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Careless and insufficient effort responding (C/IER) is a situation where participants respond to survey instruments without considering the item content. This phenomena adds noise to data leading to erroneous inference. There are multiple approaches to identifying and accounting for C/IER in survey settings, of these approaches the best performing…
Descriptors: Structural Equation Models, Bayesian Statistics, Response Style (Tests), Robustness (Statistics)
Ruben Trigueros; Alejandro García-Mas – British Journal of Educational Psychology, 2025
Introduction: In recent years, the incorporation of novelty as a psychological need and the study of the frustration of needs have become a recurring theme in the research on psychological needs in the educational environment. Currently, there are two scales available to assess the frustration of basic psychological needs (FBN) in the context of…
Descriptors: Psychological Patterns, Well Being, Resilience (Psychology), Self Determination
Naylor, Ryan; Baik, Chi; Arkoudis, Sophia – Higher Education Research and Development, 2018
Using data collected from a recent national survey of Australian first-year students, this paper defines and validates four scales--belonging, feeling supported, intellectual engagement and workload stress--to measure the student experience of university. These scales provide insights into the university experience for both groups and individual…
Descriptors: Student Attrition, At Risk Students, College Freshmen, National Surveys
Wang, Shiyu; Lin, Haiyan; Chang, Hua-Hua; Douglas, Jeff – Journal of Educational Measurement, 2016
Computerized adaptive testing (CAT) and multistage testing (MST) have become two of the most popular modes in large-scale computer-based sequential testing. Though most designs of CAT and MST exhibit strength and weakness in recent large-scale implementations, there is no simple answer to the question of which design is better because different…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Format, Sequential Approach
Duvekot, Jorieke; van der Ende, Jan; Verhulst, Frank C.; Greaves-Lord, Kirstin – Journal of Autism and Developmental Disorders, 2015
The screening accuracy of the parent and teacher-reported Social Responsiveness Scale (SRS) was compared with an autism spectrum disorder (ASD) classification according to (1) the Developmental, Dimensional, and Diagnostic Interview (3Di), (2) the Autism Diagnostic Observation Schedule (ADOS), (3) both the 3Di and ADOS, in 186 children referred to…
Descriptors: Accuracy, Screening Tests, Parent Teacher Cooperation, Pervasive Developmental Disorders
Ronfeldt, Matthew; Campbell, Shanyce L. – Educational Evaluation and Policy Analysis, 2016
Despite growing calls for more accountability of teacher education programs (TEPs), there is little consensus about how to evaluate them. This study investigates the potential for using observational ratings of program completers to evaluate TEPs. Drawing on statewide data on almost 9,500 program completers, representing 44 providers (183…
Descriptors: Teacher Education Programs, Program Effectiveness, Program Evaluation, Observation
Freitas, Pedro; Nunes, Luís Catela; Balcão Reis, Ana; Seabra, Carmo; Ferro, Adriana – Assessment in Education: Principles, Policy & Practice, 2016
The results of large-scale international assessments such as Programme for International Student Assessment (PISA) have attracted a considerable attention worldwide and are often used by policy-makers to support educational policies. To ensure that the published results represent the actual population, these surveys go through a thorough scrutiny…
Descriptors: International Assessment, Student Characteristics, Weighted Scores, Evaluation Problems
Leuty, Melanie E. – Measurement and Evaluation in Counseling and Development, 2013
Test-retest data on Super's Work Values Inventory-Revised for a group of predominantly White ("N" = 995) women (mean age = 23.5 years, SD = 8.07) and men (mean age = 21.5 years, SD = 5.80) showed stability in mean-level scores over a period of 1 year for the sample as a whole. However, low raw score and rank order stability coefficients…
Descriptors: Robustness (Statistics), Scores, Individual Differences, Item Analysis
Duncan, Greg J.; Engel, Mimi; Claessens, Amy; Dowsett, Chantelle J. – Developmental Psychology, 2014
Replications and robustness checks are key elements of the scientific method and a staple in many disciplines. However, leading journals in developmental psychology rarely include explicit replications of prior research conducted by different investigators, and few require authors to establish in their articles or online appendices that their key…
Descriptors: Replication (Evaluation), Robustness (Statistics), Developmental Psychology, Educational Research
Ho, Andrew D. – Teachers College Record, 2014
Background/Context: The target of assessment validation is not an assessment but the use of an assessment for a purpose. Although the validation literature often provides examples of assessment purposes, comprehensive reviews of these purposes are rare. Additionally, assessment purposes posed for validation are generally described as discrete and…
Descriptors: Elementary Secondary Education, Standardized Tests, Measurement Objectives, Educational Change
Li, Cheng-Hsien – Psychological Assessment, 2012
Of the several measures of optimism presently available in the literature, the Life Orientation Test (LOT; Scheier & Carver, 1985) has been the most widely used in empirical research. This article explores, confirms, and cross-validates the factor structure of the Chinese version of the LOT with ordinal data by using robust weighted least…
Descriptors: Measures (Individuals), Psychological Testing, Chinese, Test Validity
Gardner, John – Oxford Review of Education, 2013
Evidence from recent research suggests that in the UK the public perception of errors in national examinations is that they are simply mistakes; events that are preventable. This perception predominates over the more sophisticated technical view that errors arise from many sources and create an inevitable variability in assessment outcomes. The…
Descriptors: Educational Assessment, Public Opinion, Error of Measurement, Foreign Countries
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
