Publication Date
In 2025 | 3 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 41 |
Since 2016 (last 10 years) | 126 |
Since 2006 (last 20 years) | 395 |
Descriptor
Test Theory | 1161 |
Test Items | 261 |
Test Reliability | 252 |
Test Construction | 245 |
Test Validity | 245 |
Psychometrics | 181 |
Scores | 176 |
Item Response Theory | 165 |
Foreign Countries | 159 |
Item Analysis | 141 |
Statistical Analysis | 134 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
United States | 17 |
United Kingdom (England) | 15 |
Canada | 14 |
Australia | 13 |
Turkey | 12 |
Sweden | 8 |
United Kingdom | 8 |
Netherlands | 7 |
Texas | 7 |
New York | 6 |
Taiwan | 6 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Elementary and Secondary… | 3 |
Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Wilcox, Rand R. – 1978
Two fundamental problems in mental test theory are to estimate true score and to estimate the amount of error when testing an examinee. In this report, three probability models which characterize a single test item in terms of a population of examinees are described. How these models may be modified to characterize a single examinee in terms of an…
Descriptors: Achievement Tests, Comparative Analysis, Error of Measurement, Mathematical Models
Office of Personnel Management, Washington, DC. – 1979
The stimulus for this colloquium was the convergence of several significant developments bearing on the construct validation of standardized tests and other assessment methods. Of these developments, some were fundamental to psychology as a science; others reflected socio-political pressures on measurement in education and employment. The ten…
Descriptors: Aptitude Tests, Educational Practices, Educational Testing, Employment Practices
Garcia-Quintana, Roan A. – 1981
Person fit in the Rasch one parameter model is investigated. The first set of data deals with grade 3 students responding to the mathematics computation and reading vocabulary subscales of the Comprehensive Test of Basic Skills Forms (CTBS/S). The second set of data deals with grade 3 students responding to the Basic Skills Assessment Program…
Descriptors: Basic Skills, Criterion Referenced Tests, Goodness of Fit, Grade 3
Stetson, Elton G. – 1973
After employees of private firms completed several rapid reading classes and achieved remarkable gains on the Nelson-Denny Reading Test, the question was raised as to whether the increases in scores were due to the increased number of items attempted on the posttest. A preliminary analysis indicated that students attempted an average of 14.6 and…
Descriptors: Adults, Reading Achievement, Reading Comprehension, Reading Research
Kane, Michael T. – 1980
The reliability and validity of measurement is analyzed by a sampling model based on generalizability theory. A model for the relationship between a measurement procedure and an attribute is developed from an analysis of how measurements are used and interpreted in science. The model provides a basis for analyzing the concept of an error of…
Descriptors: Attribution Theory, Behavioral Sciences, Error of Measurement, Mathematical Models
Haladyna, Tom; And Others – 1980
A theory was conceived to explain student, teacher, and classroom environment characteristics or constructs, which may influence student attitudes toward school and various subjects. A questionnaire representing the constructs, the Inventory of Affective Aspects of Schooling (IAAS), was developed and administered to 601 students in grade 4. Factor…
Descriptors: Affective Measures, Classroom Environment, Factor Structure, Intermediate Grades
Epstein, Kenneth I.; Knerr, Claramae S. – 1976
The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…
Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling
Powell, J. C. – 1976
The results of five studies into the characteristics of wrong answers as a class of divergent behavior are presented. The evidence from these studies, when taken in combination, suggests that the tendency of researchers to ignore wrong answers has been a fundamental procedural error of broad scope and serious consequences. Instead of the straight…
Descriptors: Behavior Change, Career Development, Developmental Stages, Divergent Thinking

Werts, C. E.; And Others – Educational and Psychological Measurement, 1977
The psychometric application of Joreskog's procedure for simultaneous factor analysis in several populations is illustrated. Using Scholastic Aptitude Test data from two samples, procedures are shown for checking test construction assumptions about units of measurement and error variance, within and between samples. (Author)
Descriptors: Career Development, Factor Analysis, Goodness of Fit, High School Students

Tindal, Gerald; And Others – Remedial and Special Education (RASE), 1987
The study examined the hypothesis that different evaluative interpretations of studies of special education effectiveness may be a function of the manner in which data are summarized and reported. Four metrics are compared including raw score, grade-equivalent score, z-score, and discrepancy index. Criteria for selecting metrics for program…
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Methods, Grade Equivalent Scores

Jaradat, Derar; Sawaged, Sari – Journal of Educational Measurement, 1986
The impact of the Subset Selection Technique (SST) for multiple-choice items on certain properties of a test was compared with that of two other methods, the Number Right and the Correction for Guessing Formula. Results indicated that SST outperformed the other two, producing higher reliability and validity without favoring high risk takers.…
Descriptors: Foreign Countries, Grade 9, Guessing (Tests), Measurement Techniques

Hawk, Jane W.; And Others – Educational and Psychological Measurement, 1984
The Mikulecky Behavioral Reading Attitude Measure (MBRAM) was designed to measure secondary and postsecondary respondents' attitudes toward reading based on Krathwohl's affective development model. This study investigated the factorial validity of the MBRAM using the responses of 411 gifted junior high school students. (Author/BS)
Descriptors: Attitude Measures, Developmental Stages, Factor Structure, Gifted
O'Neil, Harold F., Jr.; Schacter, John – 1997
This document reviews several theoretical frameworks of problem-solving, provides a definition of the construct, suggests ways of measuring the construct, focuses on issues for assessment, and provides specifications for the computer-based assessment of problem solving. As defined in the model of the Center for Research on Evaluation, Standards,…
Descriptors: Computer Assisted Testing, Computer Software, Criteria, Educational Assessment
van der Linden, Wim J. – Evaluation in Education: International Progress, 1982
In mastery testing a linear relationship between an optimal passing score and test length is presented with a new optimization criterion. The usual indifference zone approach, a binomial error model, decision errors, and corrections for guessing are discussed. Related results in sequential testing and the latent class approach are included. (CM)
Descriptors: Cutting Scores, Educational Testing, Mastery Tests, Mathematical Models

Lennon, Roger T. – Educational Measurement: Issues and Practice, 1982
Continuing attention to test theory, test development, test interpretation and use, test monitoring and control, test consumer education, and the social and political consequences of testing is suggested as the primary concern of the National Council on Measurement in Education (NCME). (CM)
Descriptors: Consumer Education, Educational Testing, Elementary Secondary Education, Measurement Objectives