Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 12 |
Descriptor
Evaluation Methods | 44 |
Test Format | 44 |
Test Validity | 44 |
Test Reliability | 23 |
Test Construction | 18 |
Student Evaluation | 14 |
Higher Education | 11 |
Test Use | 8 |
Test Interpretation | 7 |
Testing | 7 |
Evaluation Criteria | 6 |
More ▼ |
Source
Author
Ory, John C. | 2 |
Salend, Spencer J. | 2 |
Alemi, Minoo | 1 |
Anderson, Colette | 1 |
Baker, Holly | 1 |
Benavidez, Charlotte | 1 |
Bracey, Gerald W. | 1 |
Brady, Michael P. | 1 |
Braun, Carl | 1 |
Brown, William R. | 1 |
Buser, Karen | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 4 |
Higher Education | 3 |
Adult Education | 2 |
Grade 8 | 2 |
Postsecondary Education | 2 |
Elementary Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Practitioners | 7 |
Teachers | 6 |
Administrators | 4 |
Location
Iran | 1 |
Netherlands | 1 |
Sweden | 1 |
United Kingdom (England) | 1 |
United Kingdom (Northern… | 1 |
United Kingdom (Wales) | 1 |
Utah | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
Minnesota Multiphasic… | 1 |
Trends in International… | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Han, Chao – Language Testing, 2022
Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…
Descriptors: Translation, Language Tests, Testing, Evaluation Methods
Cui, Ying; Chen, Fu; Lutsyk, Alina; Leighton, Jacqueline P.; Cutumisu, Maria – Assessment in Education: Principles, Policy & Practice, 2023
With the exponential increase in the volume of data available in the 21st century, data literacy skills have become vitally important in work places and everyday life. This paper provides a systematic review of available data literacy assessments targeted at different audiences and educational levels. The results can help researchers and…
Descriptors: Data, Information Literacy, 21st Century Skills, Competence
Jung Youn, Soo – Language Testing, 2023
As access to smartphones and emerging technologies has become ubiquitous in our daily lives and in language learning, technology-mediated social interaction has become common in teaching and assessing L2 speaking. The changing ecology of L2 spoken interaction provides language educators and testers with opportunities for renewed test design and…
Descriptors: Test Construction, Test Validity, Second Language Learning, Telecommunications
Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022
In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…
Descriptors: Computer Assisted Testing, Tests, Scores, Scoring
Ford, Jeremy W.; Conoyer, Sarah J.; Lembke, Erica S.; Smith, R. Alex; Hosp, John L. – Assessment for Effective Intervention, 2018
In the present study, two types of curriculum-based measurement (CBM) tools in science, Vocabulary Matching (VM) and Statement Verification for Science (SV-S), a modified Sentence Verification Technique, were compared. Specifically, this study aimed to determine whether the format of information presented (i.e., SV-S vs. VM) produces differences…
Descriptors: Curriculum Based Assessment, Evaluation Methods, Measurement Techniques, Comparative Analysis
Dwyer, Andrew C. – Journal of Educational Measurement, 2016
This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…
Descriptors: Cutting Scores, Equivalency Tests, Test Format, Academic Standards
Thomas, Jason E.; Hornsey, Philip E. – Journal of Instructional Research, 2014
Formative Classroom Assessment Techniques (CAT) have been well-established instructional tools in higher education since their exposition in the late 1980s (Angelo & Cross, 1993). A large body of literature exists surrounding the strengths and weaknesses of formative CATs. Simpson-Beck (2011) suggested insufficient quantitative evidence exists…
Descriptors: Classroom Techniques, Nontraditional Education, Adult Education, Formative Evaluation
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Salend, Spencer J. – Educational Leadership, 2011
Creating a fair, reliable, teacher-made test is a challenge. Every year poorly designed tests fail to accurately measure many students' learning--and negatively affect their academic futures. Salend, a well-known writer on assessment for at-risk students who consults with schools on assessment procedures, offers guidelines for creating tests that…
Descriptors: At Risk Students, Test Construction, Student Evaluation, Evaluation Methods
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Alemi, Minoo; Miraghaee, Apama – Journal on English Language Teaching, 2011
The present study was carried out to find out whether regular administration of cloze test improved the students' knowledge of grammar more than the multiple choice one. Subjects participating in this study were 84 Iranian pre-university students of Allameh-Gotb-e Ravandi University, aged between 18 and 35 and enrolled in a grammar course. To…
Descriptors: Foreign Countries, Comparative Analysis, Grammar, Knowledge Level
Caldwell, Robert M.; Marcel, Marvin – Training, 1985
Examines Southwestern Bell's Interdepartmental Training Center's program of providing objective evaluations of trainers and the training process. Elements that are discussed include the evaluation format, the form of the evaluation instrument and its emphasis, the validation process, and refinements to the system. (CT)
Descriptors: Evaluation Methods, Guidelines, Teacher Evaluation, Test Construction

Silverstein, A. B. – Perceptual and Motor Skills, 1982
Estimates of the validity of random short forms can serve as benchmarks against which to appraise the validity of particular short forms. Formulas are presented for estimating the validity of random short forms and illustrated with Wechsler Adult Intelligence Scale-Revised (WAIS-R) and Minnesota Multiphasic Personality Inventory data. (Author/CM)
Descriptors: Evaluation Methods, Intelligence Tests, Mathematical Formulas, Personality Measures

Kirkley, Karen N.; Fisher, Anne G. – Journal of Outcome Measurement, 1999
The alternate-forms reliability of the Assessment of Motor and Process Skills (AMPS) (A. Fisher, 1997), where alternate forms means different pairs of AMPS tasks, was studied with 91 people who had performed four AMPS tasks. Results support use of the AMPS activities of daily-living motor and process scales. (SLD)
Descriptors: Adults, Daily Living Skills, Diagnostic Tests, Disabilities

Rabiner, Donna J.; And Others – Evaluation and Program Planning, 1994
A 14-item instrument, the Dentist Satisfaction Survey-14, a form of a previously validated instrument, is described. Use with 522 dentists, and 29 in a follow-up, indicates that the short form is a parsimonious tool for general evaluation of dentists' job satisfaction. (SLD)
Descriptors: Attitude Measures, Dentists, Evaluation Methods, Followup Studies