Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 36 |
| Since 2017 (last 10 years) | 115 |
| Since 2007 (last 20 years) | 378 |
Descriptor
| Test Theory | 1166 |
| Test Items | 262 |
| Test Reliability | 252 |
| Test Construction | 246 |
| Test Validity | 245 |
| Psychometrics | 183 |
| Scores | 176 |
| Item Response Theory | 168 |
| Foreign Countries | 160 |
| Item Analysis | 141 |
| Statistical Analysis | 134 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| United States | 17 |
| United Kingdom (England) | 15 |
| Canada | 14 |
| Australia | 13 |
| Turkey | 12 |
| Sweden | 8 |
| United Kingdom | 8 |
| Netherlands | 7 |
| Texas | 7 |
| New York | 6 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 3 |
| Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedStenner, A. Jackson; And Others – Journal of Educational Measurement, 1983
In an attempt to restore the symmetry and balance between the study of person and item variation, this paper presents a novel methodology construct specification equations, which allows one to ascertain from the lawful behavior of items what an instrument is measuring. (Author/PN)
Descriptors: Measurement Objectives, Measurement Techniques, Research Methodology, Test Construction
Peer reviewedThayer, Dorothy T. – Psychometrika, 1983
Estimation techniques for generating the covariance matrix for two new tests and an existing test without the necessity of any examinee having to take two complete tests is presented. An application of these techniques to linear, observed-score, test equating is presented. (Author/JKS)
Descriptors: Correlation, Equated Scores, Estimation (Mathematics), Matrices
van den Brink, Wulfert – Evaluation in Education: International Progress, 1982
Binomial models for domain-referenced testing are compared, emphasizing the assumptions underlying the beta-binomial model. Advantages and disadvantages are discussed. A proposed item sampling model is presented which takes the effect of guessing into account. (Author/CM)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Sampling, Measurement Techniques
Peer reviewedMurphy, R. J. L. – British Journal of Educational Psychology, 1982
To study sex differences in test performance, the performance of males and females on 16 General Certificate of Education exams was analyzed in England. Results show that males perform better on objective tests than females. (Author/JJD)
Descriptors: Achievement, Foreign Countries, Objective Tests, Prediction
Duncan, R. Eric – Measurement and Evaluation in Guidance, 1983
Reanalyzes data provided by Swanson (1976) and Straton and Catts (1980) to test claims of superiority for the three-alternative multiple-choice item test and to present possible oversights made by these researchers. Results suggest it is doubtful that three-alternative test items are better than four-alternative items. (PAS)
Descriptors: Achievement Tests, Adults, Guidance Personnel, Multiple Choice Tests
Peer reviewedWinne, Philip H.; Belfry, M. Joan – Journal of Educational Measurement, 1982
This review of issues about correcting for attenuation concludes that the basic difficulty lies in being able to identify and equate sources of variance in estimates of validity and reliability. Recommendations are proposed for cautious use of correction for attenuation. (Author/CM)
Descriptors: Correlation, Error of Measurement, Research Methodology, Statistical Analysis
Juni, Samuel; Koenig, Esther J. – Measurement and Evaluation in Guidance, 1982
Critiques the Jackson Vocational Interest Survey. Found the randomized technique of matching statements into forced-choice item pairs is a major flaw in the instrument. Explores conceptual issues arising when item clustering is used with forced-choice statement matching. Introduces contingency validity in item construction. (R)
Descriptors: Forced Choice Technique, Interest Inventories, Item Analysis, Psychometrics
Peer reviewedHamilton, Lawrence C. – Journal of Educational Measurement, 1981
Errors in self-reports of three academic performance measures are analyzed. Empirical errors are shown to depart radically from both no-error and random-error assumptions. Self-reports by females depart farther from the no-error and random-error models for all three performance measures. (Author/BW)
Descriptors: Academic Achievement, Error Patterns, Grade Point Average, Models
Peer reviewedReynolds, Thomas J. – Educational and Psychological Measurement, 1981
Cliff's Index "c" derived from an item dominance matrix is utilized in a clustering approach, termed extracting Reliable Guttman Orders (ERGO), to isolate Guttman-type item hierarchies. A comparison of factor analysis to the ERGO is made on social distance data involving multiple ethnic groups. (Author/BW)
Descriptors: Cluster Analysis, Difficulty Level, Factor Analysis, Item Analysis
Peer reviewedWilcox, Rand R. – Educational and Psychological Measurement, 1981
The paper considers the problem of selecting the t best of k normal populations and simultaneously determining whether the selected populations have a mean larger than a known standard. Illustrations are given for selecting the t best of k examinees when the binomial error model applies. (Author)
Descriptors: Competitive Selection, Criterion Referenced Tests, Decision Making, Mathematical Models
Peer reviewedSchulman, Robert S. – Psychometrika, 1979
An alternative to the uniform probability distribution model for ordinal data is considered. Implications for statistics and for test theory are discussed. (JKS)
Descriptors: Career Development, Correlation, Mathematical Models, Nonparametric Statistics
Peer reviewedHogan, Thomas P.; Brezinski, Kristen L. – Mathematical Thinking and Learning, 2003
Examines relationships among tests for numerosity, measurement, and computational estimation, and recognizes tests for numerical facility and quantitative reasoning using principal components analysis. Identifies two components. The first component aligned computational estimation with numerical facility and general quantitative reasoning while…
Descriptors: Elementary Secondary Education, Factor Analysis, Mathematics Instruction, Mathematics Tests
Peer reviewedReckase, Mark D. – Psychological Assessment, 1996
Summarizes the current state of the art in test construction and contrasts it with previous conceptual models, some of which are wrong or misleading. New methodologies for item selection and review are presented, with current thinking on the specification of technical characteristics of tests. (Author/SLD)
Descriptors: Mathematical Models, Psychological Testing, Selection, State of the Art Reviews
Peer reviewedGlas, Cees A. W. – International Journal of Testing, 2002
"Test Scoring" provides insight into psychometric procedures as used by a professional testing company or in large-scale projects. The book contains an overview of standard test theory, a discussion of factor analytic theory, and an exploration of special applications and problems. (SLD)
Descriptors: Educational Testing, Factor Analysis, Measurement Techniques, Psychometrics
Peer reviewedBachman, Lyle F. – Annual Review of Applied Linguistics, 1988
Discusses three research/testing interfaces in second-language (L2) testing: the covariance structure analysis of ex post facto correlational data, the qualitative investigation of test-taking processes, and the development of L2 assessment instruments based on developmental sequences in L2 acquisition. (61 references) (GLR)
Descriptors: Language Proficiency, Language Research, Language Tests, Multivariate Analysis


