Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 36 |
| Since 2017 (last 10 years) | 115 |
| Since 2007 (last 20 years) | 378 |
Descriptor
| Test Theory | 1166 |
| Test Items | 262 |
| Test Reliability | 252 |
| Test Construction | 246 |
| Test Validity | 245 |
| Psychometrics | 183 |
| Scores | 176 |
| Item Response Theory | 168 |
| Foreign Countries | 160 |
| Item Analysis | 141 |
| Statistical Analysis | 134 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| United States | 17 |
| United Kingdom (England) | 15 |
| Canada | 14 |
| Australia | 13 |
| Turkey | 12 |
| Sweden | 8 |
| United Kingdom | 8 |
| Netherlands | 7 |
| Texas | 7 |
| New York | 6 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 3 |
| Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Stetson, Elton G. – 1973
After employees of private firms completed several rapid reading classes and achieved remarkable gains on the Nelson-Denny Reading Test, the question was raised as to whether the increases in scores were due to the increased number of items attempted on the posttest. A preliminary analysis indicated that students attempted an average of 14.6 and…
Descriptors: Adults, Reading Achievement, Reading Comprehension, Reading Research
Kane, Michael T. – 1980
The reliability and validity of measurement is analyzed by a sampling model based on generalizability theory. A model for the relationship between a measurement procedure and an attribute is developed from an analysis of how measurements are used and interpreted in science. The model provides a basis for analyzing the concept of an error of…
Descriptors: Attribution Theory, Behavioral Sciences, Error of Measurement, Mathematical Models
Haladyna, Tom; And Others – 1980
A theory was conceived to explain student, teacher, and classroom environment characteristics or constructs, which may influence student attitudes toward school and various subjects. A questionnaire representing the constructs, the Inventory of Affective Aspects of Schooling (IAAS), was developed and administered to 601 students in grade 4. Factor…
Descriptors: Affective Measures, Classroom Environment, Factor Structure, Intermediate Grades
Epstein, Kenneth I.; Knerr, Claramae S. – 1976
The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…
Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling
Powell, J. C. – 1976
The results of five studies into the characteristics of wrong answers as a class of divergent behavior are presented. The evidence from these studies, when taken in combination, suggests that the tendency of researchers to ignore wrong answers has been a fundamental procedural error of broad scope and serious consequences. Instead of the straight…
Descriptors: Behavior Change, Career Development, Developmental Stages, Divergent Thinking
Peer reviewedWerts, C. E.; And Others – Educational and Psychological Measurement, 1977
The psychometric application of Joreskog's procedure for simultaneous factor analysis in several populations is illustrated. Using Scholastic Aptitude Test data from two samples, procedures are shown for checking test construction assumptions about units of measurement and error variance, within and between samples. (Author)
Descriptors: Career Development, Factor Analysis, Goodness of Fit, High School Students
Peer reviewedTindal, Gerald; And Others – Remedial and Special Education (RASE), 1987
The study examined the hypothesis that different evaluative interpretations of studies of special education effectiveness may be a function of the manner in which data are summarized and reported. Four metrics are compared including raw score, grade-equivalent score, z-score, and discrepancy index. Criteria for selecting metrics for program…
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Methods, Grade Equivalent Scores
Peer reviewedJaradat, Derar; Sawaged, Sari – Journal of Educational Measurement, 1986
The impact of the Subset Selection Technique (SST) for multiple-choice items on certain properties of a test was compared with that of two other methods, the Number Right and the Correction for Guessing Formula. Results indicated that SST outperformed the other two, producing higher reliability and validity without favoring high risk takers.…
Descriptors: Foreign Countries, Grade 9, Guessing (Tests), Measurement Techniques
Peer reviewedHawk, Jane W.; And Others – Educational and Psychological Measurement, 1984
The Mikulecky Behavioral Reading Attitude Measure (MBRAM) was designed to measure secondary and postsecondary respondents' attitudes toward reading based on Krathwohl's affective development model. This study investigated the factorial validity of the MBRAM using the responses of 411 gifted junior high school students. (Author/BS)
Descriptors: Attitude Measures, Developmental Stages, Factor Structure, Gifted
O'Neil, Harold F., Jr.; Schacter, John – 1997
This document reviews several theoretical frameworks of problem-solving, provides a definition of the construct, suggests ways of measuring the construct, focuses on issues for assessment, and provides specifications for the computer-based assessment of problem solving. As defined in the model of the Center for Research on Evaluation, Standards,…
Descriptors: Computer Assisted Testing, Computer Software, Criteria, Educational Assessment
van der Linden, Wim J. – Evaluation in Education: International Progress, 1982
In mastery testing a linear relationship between an optimal passing score and test length is presented with a new optimization criterion. The usual indifference zone approach, a binomial error model, decision errors, and corrections for guessing are discussed. Related results in sequential testing and the latent class approach are included. (CM)
Descriptors: Cutting Scores, Educational Testing, Mastery Tests, Mathematical Models
Peer reviewedLennon, Roger T. – Educational Measurement: Issues and Practice, 1982
Continuing attention to test theory, test development, test interpretation and use, test monitoring and control, test consumer education, and the social and political consequences of testing is suggested as the primary concern of the National Council on Measurement in Education (NCME). (CM)
Descriptors: Consumer Education, Educational Testing, Elementary Secondary Education, Measurement Objectives
Peer reviewedBlackburn, John D. – American Business Law Journal, 1980
Since 1970 the CPA Law Exam has been heavily weighted in a few of the 14 content areas, raising the question of whether or not there are too many legal areas for which the student is held responsible. (Journal availability: Fred B. Rothman & Co., 10368 W. Centennial Road, Littleton, CO 80123, $4.00.) (MSE)
Descriptors: Certification, Certified Public Accountants, Content Analysis, Evaluation Criteria
Peer reviewedMoulthrop, Robert – Peabody Journal of Education, 1981
The inclusion of a position on standardized testing in the 1980 Democratic Party platform established testing as a political and legislative issue. Testing regulations have been introduced in 20 states and in Congess, involving critics and supporters such as: the National Education Association, Nader's Public Interest Research Groups, and the…
Descriptors: College Entrance Examinations, Educational Legislation, Educational Testing, Educationally Disadvantaged
Peer reviewedGordon, Robert A. – Intelligence, 1997
Shows why the role of intelligence in everyday life is often underestimated, drawing an analogy that examines outcomes of life as analogs of items within classical test theory. In addition, a population-IQ model is explained that tests for the pooled effects of intelligence at individual, individual context, and population levels. (SLD)
Descriptors: Context Effect, Daily Living Skills, Individual Differences, Intelligence


