Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 36 |
| Since 2017 (last 10 years) | 115 |
| Since 2007 (last 20 years) | 378 |
Descriptor
| Test Theory | 1166 |
| Test Items | 262 |
| Test Reliability | 252 |
| Test Construction | 246 |
| Test Validity | 245 |
| Psychometrics | 183 |
| Scores | 176 |
| Item Response Theory | 168 |
| Foreign Countries | 160 |
| Item Analysis | 141 |
| Statistical Analysis | 134 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| United States | 17 |
| United Kingdom (England) | 15 |
| Canada | 14 |
| Australia | 13 |
| Turkey | 12 |
| Sweden | 8 |
| United Kingdom | 8 |
| Netherlands | 7 |
| Texas | 7 |
| New York | 6 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 3 |
| Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedSeddon, G. M. – British Educational Research Journal, 1988
Demonstrates that some commonly used indices can be misleading in their quantification of reliability. The effects are most pronounced on gain or difference scores. Proposals are made to avoid sources of invalidity by using a procedure to assess reliability in terms of upper and lower limits for the true scores of each examinee. (Author/JDH)
Descriptors: Foreign Countries, Higher Education, Research Problems, Statistical Studies
Peer reviewedJarjoura, David – Psychometrika, 1983
The problem of predicting universe scores for samples of examinees based on their responses to samples of items is treated. The measurement model categorizes items according to the cells of a table of test specifications, and the linear function derived for minimizing error variance in prediction uses responses to these categories. (Author/JKS)
Descriptors: Error of Measurement, Generalizability Theory, Item Sampling, Prediction
Peer reviewedVan De Vijver, Fon J. R.; Poortinga, Ype H. – Journal of Educational Measurement, 1985
McCauley and Colberg described a theory of transportability and provided data to demonstrate the feasibility of their approach. It is argued that the transportability notion resembles earlier cross cultural work and does not add new insights into cross cultural comparison. Their statistical checks do not preclude the possibility of bias.…
Descriptors: Cross Cultural Studies, Cultural Interrelationships, Culture Fair Tests, Research Problems
Peer reviewedMasters, Geofferey N. – Psychometrika, 1985
Latent trait and latent class analyses of Likert-type data are compared. Key similarities and differences between these methods are described and illustrated by applying a latent trait model and a latent class model to the analysis of a set of "life satisfaction" data. (Author/NSF)
Descriptors: Attitude Measures, Goodness of Fit, Latent Trait Theory, Mathematical Models
Peer reviewedDavison, Mark L. – Psychological Bulletin, 1985
Considers the relationship between coordinate estimates in components analysis and multidimensional scaling. Reports three small Monte Carlo studies comparing nonmetric scaling solutions to components analysis. Results are related to other methodological issues surrounding research on the general ability factor, response tendencies in…
Descriptors: Ability, Monte Carlo Methods, Personnel Evaluation, Scaling
Reese, Lynda M. – 1999
This study extended prior Law School Admission Council (LSAC) research related to the item response theory (IRT) local item independence assumption into the realm of classical test theory. Initially, results from the Law School Admission Test (LSAT) and two other tests were investigated to determine the approximate state of local item independence…
Descriptors: College Entrance Examinations, Item Response Theory, Law Schools, Test Construction
Peer reviewedFischer, Gerhard H. – Psychometrika, 1983
Two linearly constrained models based on the Rasch model are discussed. Necessary and sufficient conditions for the existence of unique conditional maximum likelihood estimators are derived. Methods for hypothesis testing within this framework are proposed. (Author/JKS)
Descriptors: Estimation (Mathematics), Hypothesis Testing, Latent Trait Theory, Mathematical Models
Barron, Frank – New Directions for Testing and Measurement, 1982
The need to use images and imagery in language in schools, and to recognize and measure such aptitudes are discussed. Tests of imagination are presented, and the San Francisco Art Institute Study of predictors of ability in art illustrates methods and implications for measurement of creativity in educational programs. (CM)
Descriptors: Aptitude Tests, Art Expression, Creative Development, Imagination
Peer reviewedZimmerman, Donald W.; And Others – Journal of Experimental Education, 1981
Reliability coefficients of linear combinations of observed scores have anomalous properties which have led to difficulties in the investigation of difference scores and gain scores in test theory. Discrepancies between classical results and correct results obtained from more general formulas, which allow for correlated errors, are examined…
Descriptors: Error of Measurement, Mathematical Formulas, Mathematical Models, Scores
Peer reviewedChatterji, Madhabi – Journal of Applied Measurement, 2002
Examined the validity of data generated by the School Readiness for Reforms: Leader Questionnaire using an iterative procedure that combined classical and Rasch rating scale analysis for the analysis of responses from 167 leaders. The combined approach yielded comprehensive diagnostic information on the quality of the instrument's five subscales.…
Descriptors: Administrator Attitudes, Administrators, Educational Change, Elementary Secondary Education
Peer reviewedStrong, Shawn; Smith, Roger – Engineering Design Graphics Journal, 2002
Describes the development of a test designed to allow meaningful and widespread computerized testing of various spatial factors. Examines the differences between traditional paper and pencil and computerized versions of the same test. Compares an interactive test designed to measure a working memory factor to the computerized version of…
Descriptors: Computer Uses in Education, Engineering Education, Higher Education, Spatial Ability
Peer reviewedStudy, Nancy E. – Engineering Design Graphics Journal, 2002
Compares results of Successive Perception Test I (SPT) for the study population of freshman engineering students to their results on the group-administered Purdue Spatial Visualization Test: Visualization of Rotations (PSVT) and the individually administered Haptic Visual Discrimination Test (HVDT). Concludes that either visual and haptic…
Descriptors: Computer Uses in Education, Engineering Education, Higher Education, Spatial Ability
Peer reviewedSinar, Evan F.; Zickar, Michael J. – Applied Psychological Measurement, 2002
Examined the influence of deviant scale items on item parameter estimates of focal scale items and person parameter estimates through a comparison of item response theory (IRT) and classical test theory (CTT) models. Used Monte Carlo methods to explore results from a pilot investigation of job attitude data. Discusses implications for researchers…
Descriptors: Attitudes, Estimation (Mathematics), Monte Carlo Methods, Robustness (Statistics)
Peer reviewedDrasgow, Fritz; And Others – Applied Psychological Measurement, 1989
Multilinear formula scoring (MFS) is reviewed, with emphasis on estimating option characteristic curves (OCSs). MFS was used to estimate OCSs for the arithmetic reasoning subtest of the Armed Services Vocational Aptitude Battery for 2,978 examinees. A second analysis obtained OCSs for simulated data. The use of MFS is discussed. (SLD)
Descriptors: Estimation (Mathematics), Mathematical Models, Multiple Choice Tests, Scores
Peer reviewedChronbach, Lee J. – Educational Measurement: Issues and Practice, 1989
The book reviewed is a compendium of current thinking about measurement theory and test use. It includes content by 26 authors at 3 levels: (1) accessible to educators, policy makers, and graduate students; (2) suited for technical students; and (3) written for qualified measurement specialists. Strengths and weaknesses are noted. (SLD)
Descriptors: Book Reviews, Educational Assessment, Evaluation Methods, Measurement Techniques


