Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 4 |
Descriptor
Test Interpretation | 32 |
Test Reliability | 32 |
Test Theory | 32 |
Test Validity | 17 |
Test Construction | 14 |
Criterion Referenced Tests | 9 |
Item Analysis | 8 |
Higher Education | 7 |
Psychometrics | 7 |
Testing Problems | 7 |
Career Development | 6 |
More ▼ |
Source
Author
Crocker, Linda | 2 |
Haladyna, Tom | 2 |
Algina, James | 1 |
Andrich, David | 1 |
Arnold, Margery E. | 1 |
Beard, John D., Ed. | 1 |
Bormuth, John R. | 1 |
Bracken, Bruce A. | 1 |
Bullock, Lyndal M. | 1 |
Chase, Clinton I. | 1 |
Cizek, Gregory J. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 1 |
Higher Education | 1 |
Audience
Practitioners | 3 |
Teachers | 3 |
Researchers | 1 |
Students | 1 |
Location
Australia | 1 |
New York | 1 |
New York (New York) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
ACT Assessment | 1 |
California Achievement Tests | 1 |
Graduate Record Examinations | 1 |
Kaufman Assessment Battery… | 1 |
Preliminary Scholastic… | 1 |
SAT (College Admission Test) | 1 |
Woodcock Johnson Tests of… | 1 |
What Works Clearinghouse Rating
Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021
Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…
Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas
Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012
Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…
Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability
Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012
Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…
Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries
Andrich, David – 1984
Both the attenuation paradox of traditional test theory and the assumption of local independence in person-item response theory have caused problems in interpretation. This paper demonstrates that the two are related concepts, and, through this demonstration, both are clarified. It is demonstrated that the breakdown of local independence leads to…
Descriptors: Latent Trait Theory, Test Interpretation, Test Items, Test Reliability

Green, Bert F. – American Psychologist, 1981
Discusses classical test theory, including test construction, administration, and use. Covers basic statistical concepts in measurement, reliability, and validity; principles of sound test construction and item analysis; test administration and scoring; procedures for transforming raw test data into scaled scores; and future prospects in test…
Descriptors: Scores, Statistics, Test Construction, Test Interpretation
Arnold, Margery E. – 1996
It is incorrect to say "the test is reliable" because reliability is a function not only of the test itself, but of many factors. The present paper explains how different factors affect classical reliability estimates such as test-retest, interrater, internal consistency, and equivalent forms coefficients. Furthermore, the limits of classical test…
Descriptors: Estimation (Mathematics), Generalizability Theory, Heuristics, Interrater Reliability
Cizek, Gregory J.; Crocker, Linda; Frisbie, David A.; Mehrens, William A.; Stiggins, Richard J. – Educational Measurement: Issues and Practice, 2006
The authors describe the significant contributions of Robert Ebel to educational measurement theory and its applications. A biographical sketch details Ebel's roots and professional resume. His influence on classroom assessment views and procedures are explored. Classic publications associated with validity, reliability, and score interpretation…
Descriptors: Test Theory, Educational Assessment, Psychometrics, Test Reliability
Stability of the Kaufman Assessment Battery for Children for a Sample of At-Risk Preschool Children.

Lyon, Mark A.; Smith, Douglas K. – Psychology in the Schools, 1987
Examined stability of the Kaufman Assessment Battery for 53 at-risk preschool children. Over 9 months the stability coefficients for the global scales ranged from .78 to .88, and for the subtests from .65 to .79. Concluded that scores display adequate stability, but the Simultaneous scale is less stable than the Sequential or Achievement scales.…
Descriptors: Cognitive Measurement, High Risk Students, Preschool Children, Preschool Education

Morgan, Anne; Wainer, Howard – Journal of Educational Statistics, 1980
Two estimation procedures for the Rasch Model of test analysis are reviewed in detail, particularly with respect to new developments that make the more statistically rigorous conditional maximum likelihood estimation practical for use with longish tests. (Author/JKS)
Descriptors: Error of Measurement, Latent Trait Theory, Maximum Likelihood Statistics, Psychometrics

McGrew, Kevin; Murphy, Suzanne – Journal of School Psychology, 1995
Investigates the general factor and uniqueness characteristics of the individual tests of the Woodcock-Johnson Test of Cognitive Ability-Revised (WJTCA-R). Only 2 of the 19 WJTCA-R tests examined had low general factor loadings, while 2 had low uniqueness. All other tests had medium or high uniqueness. Discusses implications for clinical…
Descriptors: Academic Ability, Cognitive Ability, Intelligence, Intelligence Tests

Bracken, Bruce A. – Journal of School Psychology, 1988
Notes that significantly different results frequently exist between tests that purport to measure the same skill when the same child is tested on both instruments. Considers discrepancies related to examinee, examiner, examinee-examiner interactions, environment, and psychometric characteristics of the tests employed. Cites 10 major psychometric…
Descriptors: Educational Diagnosis, Individual Differences, Psychological Evaluation, Psychological Testing
Epstein, Kenneth I.; Knerr, Claramae S. – 1976
The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…
Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling

Douglas, Dan – Annual Review of Applied Linguistics, 1995
Reviews recent theoretical, methodological, and analytical developments in language testing, focusing on more refined models of language ability, reliability and validity, performance testing, innovative test formats, new applications of Item Response Theory and Generalizability Theory to test performance. An annotated bibliography discusses seven…
Descriptors: Annotated Bibliographies, Evaluation Methods, Language Proficiency, Language Tests
Crocker, Linda; Algina, James – 1986
This text was written to help the reader acquire a base of knowledge about classical psychometrics and to integrate new ideas into that framework of knowledge. The material is organized into five units: (1) introduction to measurement theory; (2) reliability; (3) validity; (4) item analysis in test development; and (5) test scoring and…
Descriptors: Item Analysis, Measurement Techniques, Psychometrics, Scoring
Naizer, Gilbert – 1992
A measurement approach called generalizability theory (G-theory) is an important alternative to the more familiar classical measurement theory that yields less useful coefficients such as alpha or the KR-20 coefficient. G-theory is a theory about the dependability of behavioral measurements that allows the simultaneous estimation of multiple…
Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Higher Education