Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 6 |
Descriptor
Test Interpretation | 42 |
Test Validity | 42 |
Test Reliability | 29 |
Test Construction | 15 |
Scoring | 13 |
Scores | 12 |
Elementary Secondary Education | 10 |
Testing | 10 |
Higher Education | 7 |
Student Evaluation | 7 |
Test Manuals | 7 |
More ▼ |
Source
Educational and Psychological… | 2 |
Assessment for Effective… | 1 |
Education Sciences | 1 |
Educational Measurement:… | 1 |
European Journal of Science… | 1 |
Journal of Educational… | 1 |
Language Assessment Quarterly | 1 |
Author
Publication Type
Tests/Questionnaires | 42 |
Reports - Research | 16 |
Guides - Non-Classroom | 10 |
Journal Articles | 8 |
Speeches/Meeting Papers | 4 |
Books | 2 |
Guides - Classroom - Teacher | 2 |
Dissertations/Theses -… | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
Secondary Education | 2 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Practitioners | 9 |
Researchers | 3 |
Teachers | 3 |
Counselors | 1 |
Students | 1 |
Location
Canada | 2 |
China | 1 |
Connecticut | 1 |
Kentucky (Louisville) | 1 |
New York | 1 |
Northern Mariana Islands | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
ACT Assessment | 1 |
ACT Interest Inventory | 1 |
New Jersey College Basic… | 1 |
Parenting Stress Index | 1 |
Pennsylvania Educational… | 1 |
Piers Harris Childrens Self… | 1 |
Woodcock Diagnostic Reading… | 1 |
What Works Clearinghouse Rating
Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023
Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…
Descriptors: Test Interpretation, Scores, Test Use, Test Validity
Stephen M. Leach; Jason C. Immekus; Jeffrey C. Valentine; Prathiba Batley; Dena Dossett; Tamara Lewis; Thomas Reece – Assessment for Effective Intervention, 2025
Educators commonly use school climate survey scores to inform and evaluate interventions for equitably improving learning and reducing educational disparities. Unfortunately, validity evidence to support these (and other) score uses often falls short. In response, Whitehouse et al. proposed a collaborative, two-part validity testing framework for…
Descriptors: School Surveys, Measurement, Hierarchical Linear Modeling, Educational Environment
Li, Xu; Ouyang, Fan; Liu, Jianwen; Wei, Chengkun; Chen, Wenzhi – Journal of Educational Computing Research, 2023
The computer-supported writing assessment (CSWA) has been widely used to reduce instructor workload and provide real-time feedback. Interpretability of CSWA draws extensive attention because it can benefit the validity, transparency, and knowledge-aware feedback of academic writing assessments. This study proposes a novel assessment tool,…
Descriptors: Computer Assisted Testing, Writing Evaluation, Feedback (Response), Natural Language Processing
Bitzenbauer, Philipp – European Journal of Science and Mathematics Education, 2021
This article reports the development and validation of a test instrument to assess secondary school students' declarative quantum optics knowledge. With that, we respond to modern developments from physics education research: Numerous researchers propose quantum optics-based introductory courses in quantum physics, focusing on experiments with…
Descriptors: Test Construction, Test Validity, Measures (Individuals), Secondary School Students
Zou, Shen; Xu, Qian – Language Assessment Quarterly, 2017
Washback and fairness are interrelated in validity research, and thus an investigation into washback inevitably involves fairness. This article reports Phase One of a washback study of "Test for English Majors for Grade Eight" (TEM8). Phase One was a questionnaire survey administered to university program administrators. Two research…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Test Bias
Langan, Anthony Mark; Dunleavy, Peter; Fielding, Alan – Education Sciences, 2013
Many countries use national-level surveys to capture student opinions about their university experiences. It is necessary to interpret survey results in an appropriate context to inform decision-making at many levels. To provide context to national survey outcomes, we describe patterns in the ratings of science and engineering subjects from the…
Descriptors: Models, National Surveys, Undergraduate Students, College Science
Hawley, Peggy – 1977
The 35-item attitude scale was developed to test the hypothesis that male views of appropriate female behavior significantly influence the processes underlying females' career development. The rating scale is a Likert-type instrument and does not provide for a neutral response. The test is group administered, takes approximately 30 minutes, and is…
Descriptors: Answer Keys, Attitude Measures, Sex Role, Test Interpretation

Platten, Marvin R.; Williams, Larry R. – Educational and Psychological Measurement, 1979
The Piers-Harris Children's Self-Concept Scale was administered twice to a sample of elementary school pupils and both sets of data were factor analyzed. Results led the authors to question the factor stability of the instrument. (Items are included). (JKS)
Descriptors: Factor Structure, Intermediate Grades, Orthogonal Rotation, Self Concept Measures
Frary, Robert B.; And Others – 1985
Students in an introductory college course (n=275) responded to equivalent 20-item halves of a test under number-right and formula-scoring instructions. Formula scores of those who omitted items overaged about one point lower than their comparable (formula adjusted) scores on the test half administered under number-right instructions. In contrast,…
Descriptors: Guessing (Tests), Higher Education, Multiple Choice Tests, Questionnaires

Hakstian, A. Ralph; Woolsey, Lorette K. – Educational and Psychological Measurement, 1985
The paper presented evidence of the criterion related validity of several Comprehensive Ability Battery (CAB) tests relative to criterion variables representing first year college achievement. Information regarding the criterion related validity of nontraditional tests of the CAB reflecting divergent production hypothesized to be associated with…
Descriptors: Academic Achievement, Aptitude Tests, College Freshmen, Higher Education
Woodcock, Richard W. – 1997
This boxed set contains a wide-range, comprehensive set of tests for measuring reading achievement and related abilities. The tests are administered individually, and norms are provided from age 4 to age 90. Special college/university norms are also provided. It consists of 10 measures. The scores from different combinations of these tests provide…
Descriptors: Elementary Secondary Education, Higher Education, Reading Achievement, Reading Tests
Gallagher, Robert; And Others – 1983
The manual was intended to provide instructions and technical data for using the Keystone Adaptive Behavior Profile, a measure of personal independence and social responsibility for handicapped students. Information on the selection and development of the scale is followed by detailed suggestions, in separate chapters, on administration and…
Descriptors: Adaptive Behavior (of Disabled), Behavior Rating Scales, Disabilities, Elementary Secondary Education
Dass, Jane; Pine, Charles – 1981
The New Jersey College Basic Skills Placement Test (NJCBSPT) is designed to measure certain basic language and mathematics skills of students entering New Jersey colleges. The primary purpose of the two mathematics sections is to determine whether students are prepared to begin certain college-level work without a handicap in computation or…
Descriptors: Algebra, Basic Skills, College Freshmen, Computation
Lambrecht, Judith J. – 1981
An aptitude test requiring 10-minutes' administration time was administered to high school students learning Forkner, Century 21, and Gregg shorthand for the purpose of determining test validity for different shorthand systems. Validity data were obtained from approximately 2000 students. Aptitude test reliability ranged from KR20=0.88 to 0.90.…
Descriptors: Academic Achievement, Aptitude Tests, Correlation, Dropout Rate
Alberta Dept. of Education, Edmonton. Planning Services Branch. – 1983
The School Subjects Attitude Scales is an instrument for measuring students' attitudes toward school subjects for grades 5-12. Twenty-four bipolar word pairs are used with evaluation, usefulness, and difficulty scales. The word pairs were selected on the basis of discussions and analysis of trial forms. Results can be used in program evaluation…
Descriptors: Attitude Measures, Elementary Secondary Education, Foreign Countries, Rating Scales