ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	8

Descriptor

Scores	60
Test Reliability	60
Testing Problems	60
Test Validity	25
Test Interpretation	20
Standardized Tests	18
Elementary Secondary Education	15
Achievement Tests	12
Test Bias	12
Error of Measurement	10
Higher Education	10
Scoring	9
Statistical Analysis	9
College Entrance Examinations	8
Test Construction	8
Testing	8
Comparative Analysis	7
Measurement Techniques	7
Mathematical Models	6
Norm Referenced Tests	6
Test Results	6
Computer Assisted Testing	5
Correlation	5
Educational Testing	5
Intelligence Tests	5
More ▼

Publication Type

Reports - Research	29
Journal Articles	28
Reports - Evaluative	12
Speeches/Meeting Papers	9
Guides - Non-Classroom	5
Opinion Papers	5
Information Analyses	4
Reports - Descriptive	3
Books	2
Collected Works - Proceedings	2
Collected Works - Serials	2
More ▼

Education Level

Early Childhood Education	1
Elementary Education	1
Higher Education	1
Postsecondary Education	1
Preschool Education	1
Secondary Education	1

Audience

Researchers	4
Practitioners	2
Parents	1

Location

China	2
Texas	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Elementary and Secondary…	1
Individuals with Disabilities…	1

Assessments and Surveys

SAT (College Admission Test)	5
California Achievement Tests	2
Wechsler Adult Intelligence…	2
ACT Assessment	1
ACTFL Oral Proficiency…	1
Childrens Depression Inventory	1
General Aptitude Test Battery	1
Law School Admission Test	1
Metropolitan Achievement Tests	1
Myers Briggs Type Indicator	1
National Assessment of…	1
Peabody Picture Vocabulary…	1
Remote Associates Test	1
Slosson Intelligence Test	1
Test of English as a Foreign…	1
Thematic Apperception Test	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 60 results Save | Export

Measurement Invariance of Scores on the Teacher Stress Scale: International Sample of PreK-12 Teachers

Peer reviewed

Direct link

Jiayi Wang; Michael T. Kalkbrenner; Riley Schaner – Psychology in the Schools, 2025

Teaching is a stressful profession with a high turnover rate. Schools and related institutions need to take more action to support teachers and keep teacher stress at a manageable level. The continued research and practical effort require measures to examine teachers' stress in a briefer and accurate manner. The Teacher Stress Scale is a recently…

Descriptors: Elementary School Teachers, Secondary School Teachers, Preschool Teachers, Stress Variables

Digital-First Assessments: A Security Framework

Peer reviewed

Direct link

LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022

Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…

Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering

A Review of Subscore Estimation Methods. ETS RR-18-17

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Qu, Yanxuan – ETS Research Report Series, 2018

Various subscore estimation methods that use auxiliary information to improve subscore accuracy and stability have been developed. This report provides a review of various subscore estimation methods described in the literature. The methodology of each method is described, then research studies on these subscore estimation methods are summarized.…

Descriptors: Scores, Evaluation Methods, Item Response Theory, Test Items

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

Review of Recent Empirical Research (2011-2018) on Language Assessment in China

Peer reviewed

Direct link

Min, Shangchao; He, Lianzhen; Zhang, Jie – Language Teaching, 2020

This article reviews a selected sample of 70 empirical studies in journal articles and doctoral dissertations on language assessment in China between 2011 and 2018. Following a brief introduction to the history and current state of language assessment in China, the article presents a critical review of language assessment research on six themes…

Descriptors: Language Tests, Test Reliability, Test Validity, Journal Articles

China Accreditation Test for Translators and Interpreters (CATTI): Test Review Based on the Language Pairing of English and Chinese

Peer reviewed

Direct link

Zhao, Hulin; Gu, Xiangdong – Language Testing, 2016

Test Purpose: The CATTI aims to measure competence in translation and interpreting (including simultaneous and consecutive interpreting) between Chinese and seven foreign languages: English, Japanese, French, Arabic, Russian, German, or Spanish. The test is intended to cover a wide range of domains including business, government, academia, and…

Descriptors: Accreditation (Institutions), Foreign Countries, Translation, Chinese

Adaptations and Access to Assessment of Common Core Content

Peer reviewed

Direct link

Kettler, Ryan J. – Review of Research in Education, 2015

This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…

Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations

After the Test: There Should Be No Mystery.

Dixon, Rebecca R. – New Directions for Testing and Measurement, 1981

The emphasis placed on test scores in college admissions is discussed. The need for colleges to periodically analyze and define their admissions policy is recommended. (Author/AL)

Descriptors: Admission Criteria, College Entrance Examinations, Predictive Validity, Scores

Regression Effects on Part Scores Based on Whole-Score Selected Samples.

Peer reviewed

Willson, Victor L.; Reynolds, Cecil R. – Educational and Psychological Measurement, 1984

Samples in research on individual and group differences may be selected based on whole scores which differ from the population mean. Children are diagnosed in clinical practice with a whole score. These procedures produce regression to the population mean which can affect accuracy and adequacy of part score interpretations. (Author/DWH)

Descriptors: Correlation, Intelligence Tests, Profiles, Scores

Effects of Test Disclosure on Performance on the Test of English as a Foreign Language.

Peer reviewed

Hale, Gordon, A.; And Others – Language Learning, 1983

Addresses the issues of whether test scores are affected by the prior availability of the items on a test. Concludes that, while disclosing items significantly affects test scores, the magnitude of the disclosure effect drecreases with an increase in the size of the disclosed pool. (EKN)

Descriptors: English (Second Language), Language Tests, Scores, Second Language Learning

Assessing Relevance and Reliability to Improve the Quality of Teacher-Made Tests.

Peer reviewed

Griswold, Philip A. – NASSP Bulletin, 1990

Outlines some practical procedures for assessing test quality. Tests are relevant when learning outcomes have been correctly defined, when test content is aligned with instructional objectives, and when test and instructional formats are similar. Reliable tests follow administration, scoring, and interpretation procedures and consider difficulty…

Descriptors: Elementary Secondary Education, Scores, Teacher Made Tests, Test Reliability

Controlling for Economy of Expression in Creativity Research.

Peer reviewed

Wakefield, John F. – Psychology: A Quarterly Journal of Human Behavior, 1983

Examined whether lengthy responses to the blank card reflect a contaminating factor such as glibness in creativity research. Two groups of college students completed the Remote Associates Test, Thematic Apperception Test, or Hand Test. Results suggested that blank cards among ambiguous stimuli evoke not glibness but economy of expression. (JAC)

Descriptors: College Students, Creativity Research, Higher Education, Response Style (Tests)

The Reliability of Sums and Differences of Test Scores: Some New Results and Anomalies.

Peer reviewed

Zimmerman, Donald W.; And Others – Journal of Experimental Education, 1981

Reliability coefficients of linear combinations of observed scores have anomalous properties which have led to difficulties in the investigation of difference scores and gain scores in test theory. Discrepancies between classical results and correct results obtained from more general formulas, which allow for correlated errors, are examined…

Descriptors: Error of Measurement, Mathematical Formulas, Mathematical Models, Scores

Standardized Testing: Harmful to Educational Health.

Neill, D. Monty; Medina, Noe J. – Phi Delta Kappan, 1989

Standarized, multiple-choice tests have become the major criterion for a wide range of school decisions affecting student placement, curriculum format, and teaching style. Improved assessment will not reform education. The more insightful and powerful the assessment tool, the more damage is caused by its misuse. Includes 70 references. (MLH)

Descriptors: Elementary Secondary Education, School Readiness, Scores, Standardized Tests

The Myers-Briggs Type Indicator: Analysis of Discrepancy Score Phenomenon in a Real World Sample.

Download full text

Hoover, Randy L.; Kadunc, Nancy – 1983

The purpose of this paper is to examine the nature of discrepancy score phenomena of the Myers-Briggs Type Indicator (MBTI), as related to internal consistency and construct validity of the instrument. Data were collected from 140 university research managers. The data suggest internal consistency problems: only 37.3 percent of the subjects…

Descriptors: Adults, Personality Measures, Personality Traits, Sampling

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Language Testing	2
NCME Measurement in Education	2
Phi Delta Kappan	2
Psychology in the Schools	2
AEDS Monitor	1
Applied Measurement in…	1
Applied Psychological…	1
ETS Research Report Series	1
Education and Training in…	1
Educational Leadership	1
Educational Measurement:…	1
Educational and Psychological…	1
Freshman English News	1
Illinois School Research and…	1
International Journal of…	1
Journal of Computer Assisted…	1
Journal of Consulting and…	1
Journal of Experimental…	1
Journal of Reading Behavior	1
Language Learning	1
Language Teaching	1
Language, Speech, and Hearing…	1
NASSP Bulletin	1
NJEA Review	1
New Directions for Testing…	1
More ▼

Airasian, Peter W.	1
Anderson, Paul S.	1
Attali, Yigal	1
Avery, Richard O.	1
Baig, Basim	1
Barker, Pierce	1
Bergquist, Constance	1
Bormuth, John R.	1
Brown, Jonathan R.	1
Burns, Edward	1
Burns, Marilyn	1
Coleman, Marilyn	1
Craig, Robert	1
Crowley, Susan L.	1
Dixon, Rebecca R.	1
Dunlap, William P.	1
Evans, Franklin R.	1
Ferguson, Richard L.	1
Foster, Jeff L.	1
Fowler, R. Clarke	1
Fruen, Mary	1
Fu, Jianbin	1
Gallas, Edwin J.	1
Gilmer, Jerry S.	1
More ▼