Publication Date
In 2025 | 3 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 41 |
Since 2016 (last 10 years) | 126 |
Since 2006 (last 20 years) | 395 |
Descriptor
Test Theory | 1161 |
Test Items | 261 |
Test Reliability | 252 |
Test Construction | 245 |
Test Validity | 245 |
Psychometrics | 181 |
Scores | 176 |
Item Response Theory | 165 |
Foreign Countries | 159 |
Item Analysis | 141 |
Statistical Analysis | 134 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
United States | 17 |
United Kingdom (England) | 15 |
Canada | 14 |
Australia | 13 |
Turkey | 12 |
Sweden | 8 |
United Kingdom | 8 |
Netherlands | 7 |
Texas | 7 |
New York | 6 |
Taiwan | 6 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Elementary and Secondary… | 3 |
Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Strasler, Gregg M. – 1980
The relationship between classical discrimination indices (CDI) and criterion-referenced discrimination indices (CRDI) and the appropriateness of each for use on criterion-referenced tests are investigated. A CRDI is proposed that attempts to separate those who master material from those who do not master material. A 26 item multiple-choice…
Descriptors: Criterion Referenced Tests, Discriminant Analysis, Higher Education, Mastery Learning
Ellett, Frederick S., Jr. – 1981
Basic issues in criterion-referenced measurement are addressed. In section II, issues involved in determining what a person does and can do are considered. A preliminary analysis of "can" is given which shows that there are several important senses of "can". In section III, results of an analysis of "ability" are…
Descriptors: Academic Ability, Behavior Theories, Criterion Referenced Tests, Induction

Samejima, Fumiko – Applied Psychological Measurement, 1977
Several important implications in latent trait theory, with implications for individualized or tailored testing, are pointed out. A way of using the information function in tailored testing in connection with the standard error estimation of the ability level using maximum likelihood estimation is suggested. (Author/JKS)
Descriptors: Adaptive Testing, Career Development, Error of Measurement, Item Analysis

Budescu, David – Journal of Educational Measurement, 1985
An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)
Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests

Tallmadge, G. Kasten – Journal of Educational Measurement, 1985
Support for the validity of the equipercentile assumption is presented in contrast with the conclusion of Powers, Slaughter, and Helmick (EJ 289 091). Observed "gains" from pre- to posttests are better attributed to stakeholder bias, posttests that match curriculum content too closely, or a combination of these factors. (Author/DWH)
Descriptors: Data Interpretation, Evaluation Methods, Norm Referenced Tests, Predictive Measurement

Tuman, Myron C.; Miles, Thomas H. – Teaching English in the Two-Year College, 1987
Indicates that if the cloze scores of students in a small group are well distributed, then it is possible to identify which essays would be selected as best and worst by an English professor. Shows that cloze testing constitutes a relatively effective placement instrument when the readers are unschooled. Includes statistical tables. (JD)
Descriptors: Cloze Procedure, Reading Tests, Student Placement, Test Reliability

Divgi, D. R. – Journal of Educational Measurement, 1986
This paper discusses various issues involved in using the Rasch Model with multiple-choice tests and questions the suitability of this model for multiple-choice items. Results of some past studies supporting the model are shown to be irrelevant. The effects of the model's misfit on test equating are demonstrated. (Author JAZ)
Descriptors: Equated Scores, Goodness of Fit, Latent Trait Theory, Mathematical Models

Angoff, William H.; Cowell, William R. – Journal of Educational Measurement, 1986
Linear conversions were developed relating scores on recent forms of the Graduate Record Examinations. Conversions based on specially selected subpopulations were compared with total-group conversions and evaluated. Conclusions indicated that the data clearly support the assumption of population independence for homogenoeous tests, but not quite…
Descriptors: College Entrance Examinations, Equated Scores, Groups, Higher Education

Movshovitz-Hadar, Nitsa; And Others – Journal for Research in Mathematics Education, 1987
A content-oriented analysis of written solutions to test items in Israeli high school graduation examinations in mathematics yielded six error categories: misused data; misinterpreted language; logically invalid inference; distorted theorem or definition; unverified solution; and technical error. (Authors/MNS)
Descriptors: Educational Research, Error Patterns, Mathematics Instruction, Research Reports

Huynh, Huynh – Journal of Educational Statistics, 1986
Under the assumptions of classical measurement theory and the condition of normality, a formula is derived for the reliability of composite scores. The formula represents an extension of the Spearman-Brown formula to the case of truncated data. (Author/JAZ)
Descriptors: Computer Simulation, Error of Measurement, Expectancy Tables, Scoring Formulas

Beck, Michael D. – Educational Measurement: Issues and Practice, 1986
Tracing the development of the Otis test series, the author argues that there will be a continuing demand for group-administered general mental ability tests in education. He foresees a need for better ways of relating ability test scores with skills and achievements to make them more educationally useful. (Author/JAZ)
Descriptors: Cognitive Ability, Cognitive Measurement, Cognitive Tests, Educational History

Collis, Kevin F.; And Others – Journal for Research in Mathematics Education, 1986
Described are procedures followed in developing, administering, and scoring a set of mathematical problem-solving superitems and examining their construct validity through a recently developed evaluation technique associated with a taxonomy of the structure of learned outcomes. Data strongly support the validity of the underlying theoretical…
Descriptors: Educational Research, Elementary Secondary Education, Mathematics Education, Problem Solving

Feldt, Leonard S. – Educational and Psychological Measurement, 1984
The binomial error model includes form-to-form difficulty differences as error variance and leads to Ruder-Richardson formula 21 as an estimate of reliability. If the form-to-form component is removed from the estimate of error variance, the binomial model leads to KR 20 as the reliability estimate. (Author/BW)
Descriptors: Achievement Tests, Difficulty Level, Error of Measurement, Mathematical Formulas

Lord, Frederic M. – Psychometrika, 1985
Given a loss function, an asymptotic method for optimal test design for a specified target population of examinees is presented. Also, of more practical use, given an existing unidimensional test and target population, a way is presented to find the loss function for which the test is optimal. (NSF)
Descriptors: Error of Measurement, Higher Education, Item Sampling, Latent Trait Theory

Neeper, Ronald; Lahey, Benjamin B. – School Psychology Review, 1986
A factor analysis of a revised and expanded 102-item version of the Children's Behavior Scale (CBRS) was conducted to complete development of this teacher rating scale. Results confirmed previous research on the CBRS that indicated that two independent dimensions of cognitive deficits can be identified in teacher ratings. (Author/LMO)
Descriptors: Behavior Rating Scales, Correlation, Elementary Secondary Education, Factor Analysis