ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	4

Descriptor

Test Interpretation	32
Test Reliability	32
Test Theory	32
Test Validity	17
Test Construction	14
Criterion Referenced Tests	9
Item Analysis	8
Higher Education	7
Psychometrics	7
Testing Problems	7
Career Development	6
Measurement Techniques	6
Norm Referenced Tests	6
Standardized Tests	6
Statistical Analysis	6
Testing	6
Achievement Tests	5
Comparative Analysis	5
Elementary Secondary Education	5
Error of Measurement	5
Scores	5
Scoring	5
Test Items	5
Evaluation Methods	4
Item Sampling	4
More ▼

Source

Educational Measurement:…	2
Journal of School Psychology	2
Alberta Journal of…	1
American Psychologist	1
Annual Review of Applied…	1
Applied Psychological…	1
Executive Review	1
International Journal of…	1
Journal of Educational…	1
Performance and Instruction	1
Psychology in the Schools	1
Society for Research on…	1
More ▼

Publication Type

Reports - Research	12
Journal Articles	11
Information Analyses	5
Reports - Descriptive	5
Speeches/Meeting Papers	5
Reports - Evaluative	3
Books	2
Collected Works - Serials	2
Guides - Non-Classroom	2
Numerical/Quantitative Data	2
Opinion Papers	2
Collected Works - Proceedings	1
Guides - Classroom - Learner	1
Reference Materials - General	1
More ▼

Education Level

Elementary Secondary Education	1
Higher Education	1

Audience

Practitioners	3
Teachers	3
Researchers	1
Students	1

Location

Australia	1
New York	1
New York (New York)	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
California Achievement Tests	1
Graduate Record Examinations	1
Kaufman Assessment Battery…	1
Preliminary Scholastic…	1
SAT (College Admission Test)	1
Woodcock Johnson Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

A General Method for Adjusting Test Score Distributions to Account for Rescoring and Retesting

Peer reviewed

Direct link

Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021

Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…

Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas

The Contestant Perspective on Taking Tests: Emanations from the Statue within

Peer reviewed

Direct link

Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012

Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…

Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability

Tests in Europe: Where We Are and Where We Should Go

Peer reviewed

Direct link

Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012

Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…

Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries

The Attenuation Paradox of Traditional Test Theory as a Breakdown of Local Independence in Person-Item Response Theory.

Andrich, David – 1984

Both the attenuation paradox of traditional test theory and the assumption of local independence in person-item response theory have caused problems in interpretation. This paper demonstrates that the two are related concepts, and, through this demonstration, both are clarified. It is demonstrated that the breakdown of local independence leads to…

Descriptors: Latent Trait Theory, Test Interpretation, Test Items, Test Reliability

A Primer of Testing.

Peer reviewed

Green, Bert F. – American Psychologist, 1981

Discusses classical test theory, including test construction, administration, and use. Covers basic statistical concepts in measurement, reliability, and validity; principles of sound test construction and item analysis; test administration and scoring; procedures for transforming raw test data into scaled scores; and future prospects in test…

Descriptors: Scores, Statistics, Test Construction, Test Interpretation

Influences on and Limitations of Classical Test Theory Reliability Estimates.

Download full text

Arnold, Margery E. – 1996

It is incorrect to say "the test is reliable" because reliability is a function not only of the test itself, but of many factors. The present paper explains how different factors affect classical reliability estimates such as test-retest, interrater, internal consistency, and equivalent forms coefficients. Furthermore, the limits of classical test…

Descriptors: Estimation (Mathematics), Generalizability Theory, Heuristics, Interrater Reliability

A Tribute to Robert L. Ebel: Scholar, Teacher, Mentor, and Statesman

Peer reviewed

Direct link

Cizek, Gregory J.; Crocker, Linda; Frisbie, David A.; Mehrens, William A.; Stiggins, Richard J. – Educational Measurement: Issues and Practice, 2006

The authors describe the significant contributions of Robert Ebel to educational measurement theory and its applications. A biographical sketch details Ebel's roots and professional resume. His influence on classroom assessment views and procedures are explored. Classic publications associated with validity, reliability, and score interpretation…

Descriptors: Test Theory, Educational Assessment, Psychometrics, Test Reliability

Stability of the Kaufman Assessment Battery for Children for a Sample of At-Risk Preschool Children.

Peer reviewed

Lyon, Mark A.; Smith, Douglas K. – Psychology in the Schools, 1987

Examined stability of the Kaufman Assessment Battery for 53 at-risk preschool children. Over 9 months the stability coefficients for the global scales ranged from .78 to .88, and for the subtests from .65 to .79. Concluded that scores display adequate stability, but the Simultaneous scale is less stable than the Sequential or Achievement scales.…

Descriptors: Cognitive Measurement, High Risk Students, Preschool Children, Preschool Education

A Review of Estimation Procedures for the Rasch Model with an Eye toward Longish Tests.

Peer reviewed

Morgan, Anne; Wainer, Howard – Journal of Educational Statistics, 1980

Two estimation procedures for the Rasch Model of test analysis are reviewed in detail, particularly with respect to new developments that make the more statistically rigorous conditional maximum likelihood estimation practical for use with longish tests. (Author/JKS)

Descriptors: Error of Measurement, Latent Trait Theory, Maximum Likelihood Statistics, Psychometrics

Uniqueness and General Factor Characteristics of the Woodcock-Johnson Tests of Cognitive Ability-Revised.

Peer reviewed

McGrew, Kevin; Murphy, Suzanne – Journal of School Psychology, 1995

Investigates the general factor and uniqueness characteristics of the individual tests of the Woodcock-Johnson Test of Cognitive Ability-Revised (WJTCA-R). Only 2 of the 19 WJTCA-R tests examined had low general factor loadings, while 2 had low uniqueness. All other tests had medium or high uniqueness. Discusses implications for clinical…

Descriptors: Academic Ability, Cognitive Ability, Intelligence, Intelligence Tests

Ten Psychometric Reasons Why Similar Tests Produce Dissimilar Results.

Peer reviewed

Bracken, Bruce A. – Journal of School Psychology, 1988

Notes that significantly different results frequently exist between tests that purport to measure the same skill when the same child is tested on both instruments. Considers discrepancies related to examinee, examiner, examinee-examiner interactions, environment, and psychometric characteristics of the tests employed. Cites 10 major psychometric…

Descriptors: Educational Diagnosis, Individual Differences, Psychological Evaluation, Psychological Testing

Criterion-Referenced Test Interpretations of "Classical" Measurement Theory.

Download full text

Epstein, Kenneth I.; Knerr, Claramae S. – 1976

The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…

Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling

Developments in Language Testing.

Peer reviewed

Douglas, Dan – Annual Review of Applied Linguistics, 1995

Reviews recent theoretical, methodological, and analytical developments in language testing, focusing on more refined models of language ability, reliability and validity, performance testing, innovative test formats, new applications of Item Response Theory and Generalizability Theory to test performance. An annotated bibliography discusses seven…

Descriptors: Annotated Bibliographies, Evaluation Methods, Language Proficiency, Language Tests

Introduction to Classical and Modern Test Theory.

Crocker, Linda; Algina, James – 1986

This text was written to help the reader acquire a base of knowledge about classical psychometrics and to integrate new ideas into that framework of knowledge. The material is organized into five units: (1) introduction to measurement theory; (2) reliability; (3) validity; (4) item analysis in test development; and (5) test scoring and…

Descriptors: Item Analysis, Measurement Techniques, Psychometrics, Scoring

Basic Concepts in Generalizability Theory: A More Powerful Approach to Evaluating Reliability.

Download full text

Naizer, Gilbert – 1992

A measurement approach called generalizability theory (G-theory) is an important alternative to the more familiar classical measurement theory that yields less useful coefficients such as alpha or the KR-20 coefficient. G-theory is a theory about the dependability of behavioral measurements that allows the simultaneous estimation of multiple…

Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Higher Education

Previous Page | Next Page »

Pages: 1 | 2 | 3

Crocker, Linda	2
Haladyna, Tom	2
Algina, James	1
Andrich, David	1
Arnold, Margery E.	1
Beard, John D., Ed.	1
Bormuth, John R.	1
Bracken, Bruce A.	1
Bullock, Lyndal M.	1
Chase, Clinton I.	1
Cizek, Gregory J.	1
Coffman, William E.	1
Dorans, Neil J.	1
Douglas, Dan	1
Elosua, Paula	1
Epstein, Kenneth I.	1
Frisbie, David A.	1
Goodstein, H. A.	1
Green, Bert F.	1
Iliescu, Dragos	1
Jacobs, Lucy Cheser	1
Knerr, Claramae S.	1
Lord, Frederic M.	1
Lyon, Mark A.	1
More ▼