Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 8 |
Descriptor
Scores | 60 |
Test Reliability | 60 |
Testing Problems | 60 |
Test Validity | 25 |
Test Interpretation | 20 |
Standardized Tests | 18 |
Elementary Secondary Education | 15 |
Achievement Tests | 12 |
Test Bias | 12 |
Error of Measurement | 10 |
Higher Education | 10 |
More ▼ |
Source
Author
Airasian, Peter W. | 1 |
Anderson, Paul S. | 1 |
Attali, Yigal | 1 |
Avery, Richard O. | 1 |
Baig, Basim | 1 |
Barker, Pierce | 1 |
Bergquist, Constance | 1 |
Bormuth, John R. | 1 |
Brown, Jonathan R. | 1 |
Burns, Edward | 1 |
Burns, Marilyn | 1 |
More ▼ |
Publication Type
Education Level
Early Childhood Education | 1 |
Elementary Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Preschool Education | 1 |
Secondary Education | 1 |
Audience
Researchers | 4 |
Practitioners | 2 |
Parents | 1 |
Location
China | 2 |
Texas | 1 |
United Kingdom | 1 |
United States | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Jiayi Wang; Michael T. Kalkbrenner; Riley Schaner – Psychology in the Schools, 2025
Teaching is a stressful profession with a high turnover rate. Schools and related institutions need to take more action to support teachers and keep teacher stress at a manageable level. The continued research and practical effort require measures to examine teachers' stress in a briefer and accurate manner. The Teacher Stress Scale is a recently…
Descriptors: Elementary School Teachers, Secondary School Teachers, Preschool Teachers, Stress Variables
LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022
Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…
Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering
Fu, Jianbin; Qu, Yanxuan – ETS Research Report Series, 2018
Various subscore estimation methods that use auxiliary information to improve subscore accuracy and stability have been developed. This report provides a review of various subscore estimation methods described in the literature. The methodology of each method is described, then research studies on these subscore estimation methods are summarized.…
Descriptors: Scores, Evaluation Methods, Item Response Theory, Test Items
Isbell, Dan; Winke, Paula – Language Testing, 2019
The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…
Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning
Min, Shangchao; He, Lianzhen; Zhang, Jie – Language Teaching, 2020
This article reviews a selected sample of 70 empirical studies in journal articles and doctoral dissertations on language assessment in China between 2011 and 2018. Following a brief introduction to the history and current state of language assessment in China, the article presents a critical review of language assessment research on six themes…
Descriptors: Language Tests, Test Reliability, Test Validity, Journal Articles
Zhao, Hulin; Gu, Xiangdong – Language Testing, 2016
Test Purpose: The CATTI aims to measure competence in translation and interpreting (including simultaneous and consecutive interpreting) between Chinese and seven foreign languages: English, Japanese, French, Arabic, Russian, German, or Spanish. The test is intended to cover a wide range of domains including business, government, academia, and…
Descriptors: Accreditation (Institutions), Foreign Countries, Translation, Chinese
Kettler, Ryan J. – Review of Research in Education, 2015
This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…
Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations
Dixon, Rebecca R. – New Directions for Testing and Measurement, 1981
The emphasis placed on test scores in college admissions is discussed. The need for colleges to periodically analyze and define their admissions policy is recommended. (Author/AL)
Descriptors: Admission Criteria, College Entrance Examinations, Predictive Validity, Scores

Willson, Victor L.; Reynolds, Cecil R. – Educational and Psychological Measurement, 1984
Samples in research on individual and group differences may be selected based on whole scores which differ from the population mean. Children are diagnosed in clinical practice with a whole score. These procedures produce regression to the population mean which can affect accuracy and adequacy of part score interpretations. (Author/DWH)
Descriptors: Correlation, Intelligence Tests, Profiles, Scores

Hale, Gordon, A.; And Others – Language Learning, 1983
Addresses the issues of whether test scores are affected by the prior availability of the items on a test. Concludes that, while disclosing items significantly affects test scores, the magnitude of the disclosure effect drecreases with an increase in the size of the disclosed pool. (EKN)
Descriptors: English (Second Language), Language Tests, Scores, Second Language Learning

Griswold, Philip A. – NASSP Bulletin, 1990
Outlines some practical procedures for assessing test quality. Tests are relevant when learning outcomes have been correctly defined, when test content is aligned with instructional objectives, and when test and instructional formats are similar. Reliable tests follow administration, scoring, and interpretation procedures and consider difficulty…
Descriptors: Elementary Secondary Education, Scores, Teacher Made Tests, Test Reliability

Wakefield, John F. – Psychology: A Quarterly Journal of Human Behavior, 1983
Examined whether lengthy responses to the blank card reflect a contaminating factor such as glibness in creativity research. Two groups of college students completed the Remote Associates Test, Thematic Apperception Test, or Hand Test. Results suggested that blank cards among ambiguous stimuli evoke not glibness but economy of expression. (JAC)
Descriptors: College Students, Creativity Research, Higher Education, Response Style (Tests)

Zimmerman, Donald W.; And Others – Journal of Experimental Education, 1981
Reliability coefficients of linear combinations of observed scores have anomalous properties which have led to difficulties in the investigation of difference scores and gain scores in test theory. Discrepancies between classical results and correct results obtained from more general formulas, which allow for correlated errors, are examined…
Descriptors: Error of Measurement, Mathematical Formulas, Mathematical Models, Scores
Neill, D. Monty; Medina, Noe J. – Phi Delta Kappan, 1989
Standarized, multiple-choice tests have become the major criterion for a wide range of school decisions affecting student placement, curriculum format, and teaching style. Improved assessment will not reform education. The more insightful and powerful the assessment tool, the more damage is caused by its misuse. Includes 70 references. (MLH)
Descriptors: Elementary Secondary Education, School Readiness, Scores, Standardized Tests
Hoover, Randy L.; Kadunc, Nancy – 1983
The purpose of this paper is to examine the nature of discrepancy score phenomena of the Myers-Briggs Type Indicator (MBTI), as related to internal consistency and construct validity of the instrument. Data were collected from 140 university research managers. The data suggest internal consistency problems: only 37.3 percent of the subjects…
Descriptors: Adults, Personality Measures, Personality Traits, Sampling