Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 16 |
Descriptor
| Reliability | 61 |
| Test Interpretation | 61 |
| Validity | 29 |
| Test Construction | 18 |
| Statistical Analysis | 14 |
| Psychometrics | 12 |
| Scores | 12 |
| Scoring | 10 |
| Elementary Secondary Education | 9 |
| Evaluation Methods | 9 |
| Error of Measurement | 8 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Higher Education | 3 |
| Elementary Education | 2 |
| Elementary Secondary Education | 2 |
| Postsecondary Education | 2 |
| Grade 7 | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Secondary Education | 1 |
Location
| United Kingdom (Wales) | 2 |
| Australia | 1 |
| California (Los Angeles) | 1 |
| Jordan | 1 |
| Louisiana | 1 |
| Massachusetts | 1 |
| Taiwan | 1 |
| United Kingdom | 1 |
| United Kingdom (England) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U. – Educational Measurement: Issues and Practice, 2015
This article uses definitions provided by Cronbach in his seminal paper for coefficient a to show the concepts of reliability, dimensionality, and internal consistency are distinct but interrelated. The article begins with a critique of the definition of reliability and then explores mathematical properties of Cronbach's a. Internal consistency…
Descriptors: Reliability, Definitions, Mathematics, Test Interpretation
Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016
ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…
Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement
Schmidgall, Jonathan – Applied Measurement in Education, 2017
This study utilizes an argument-based approach to validation to examine the implications of reliability in order to further differentiate the concepts of score and decision consistency. In a methodological example, the framework of generalizability theory was used to estimate appropriate indices of score consistency and evaluations of the…
Descriptors: Scores, Reliability, Validity, Generalizability Theory
de Vaan, Gitta; Vervloed, Mathijs P. J.; Hoevenaars-van den Boom, Marella; Antonissen, Anneke; Knoors, Harry; Verhoeven, Ludo – Journal of Mental Health Research in Intellectual Disabilities, 2016
Instruments that are used for diagnosing of, or screening for, autism spectrum disorder (ASD) may not be applicable to people with sensory disabilities in addition to intellectual disabilities. First, because they do not account for equifinality, the possibility that different conditions may lead to the same outcome. Second, because they do not…
Descriptors: Screening Tests, Clinical Diagnosis, Diagnostic Tests, Autism
Papageorgiou, Spiros; Morgan, Rick; Becker, Valerie – International Journal of Testing, 2015
The purpose of this study was to enhance the meaning of the scores of an English-language test by developing performance levels and descriptors for reporting overall test performance. The levels and descriptors were intended to accompany the total scale scores of TOEFL Junior® Standard, an international test of English as a second/foreign…
Descriptors: Language Proficiency, Language Tests, English (Second Language), Second Language Learning
Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014
In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…
Descriptors: Generalizability Theory, Measurement, Reliability, Correlation
Moses, Tim – ETS Research Report Series, 2013
The purpose of this report is to review ETS psychometric contributions that focus on test scores. Two major sections review contributions based on assessing test scores' measurement characteristics and other contributions about using test scores as predictors in correlational and regression relationships. An additional section reviews additional…
Descriptors: Psychometrics, Scores, Correlation, Regression (Statistics)
Plucker, Jonathan A.; Qian, Meihua; Schmalensee, Stephanie L. – Creativity Research Journal, 2014
In recent years, the social sciences have seen a resurgence in the study of divergent thinking (DT) measures. However, many of these recent advances have focused on abstract, decontextualized DT tasks (e.g., list as many things as you can think of that have wheels). This study provides a new perspective by exploring the reliability and validity…
Descriptors: Creative Thinking, Creativity Tests, Scoring Formulas, Evaluation Methods
Kane, Michael T. – Journal of Educational Measurement, 2013
To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Descriptors: Test Interpretation, Validity, Scores, Test Use
Crawford, John R.; Garthwaite, Paul H.; Morrice, Nicola; Duff, Kevin – Psychological Assessment, 2012
Supplementary methods for the analysis of the Repeatable Battery for the Assessment of Neuropsychological Status are made available, including (a) quantifying the number of abnormally low Index scores and abnormally large differences exhibited by a case and accompanying this with estimates of the percentages of the normative population expected to…
Descriptors: Neurological Impairments, Cognitive Tests, Psychological Testing, Adults
Al-Shara'H, Nayel Darweesh – Education, 2013
The study aimed at investigating Jordanian EFL teachers' self-reported frequencies of using the procedures of preparing, correcting, analyzing, interpreting an achievement test, and discussing its results with students. To achieve this, a 31-item questionnaire was used. The questionnaire was administered to 118 basic stage EFL teachers after…
Descriptors: Foreign Countries, English (Second Language), Second Language Instruction, Test Construction
Johnson, Sandra – Routledge, Taylor & Francis Group, 2011
"Assessing Learning in the Primary Classroom" is an accessible introduction to the concepts critical to a professional understanding of this vital aspect of a teacher's role. It comprehensively considers the principles underpinning effective assessment, the different forms it can take and the different purposes it serves, both within and beyond…
Descriptors: Student Evaluation, Elementary Education, Educational Assessment, Validity
Tuccitto, Daniel E.; Giacobbi, Peter R., Jr.; Leite, Walter L. – Educational and Psychological Measurement, 2010
This study tested five confirmatory factor analytic (CFA) models of the Positive Affect Negative Affect Schedule (PANAS) to provide validity evidence based on its internal structure. A sample of 223 club sport athletes indicated their emotions during the past week. Results revealed that an orthogonal two-factor CFA model, specifying error…
Descriptors: Factor Analysis, Models, Affective Measures, Validity
Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores
Lang, W. Steve; Wilkerson, Judy R. – Online Submission, 2008
The National Council for Accreditation of Teacher Education (NCATE, 2002) requires teacher education units to develop assessment systems and evaluate both the success of candidates and unit operations. Because of a stated, but misguided, fear of statistics, NCATE fails to use accepted terminology to assure the quality of institutional evaluative…
Descriptors: State Standards, Validity, Resource Materials, Reliability

Peer reviewed
Direct link
