Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 36 |
| Since 2017 (last 10 years) | 115 |
| Since 2007 (last 20 years) | 378 |
Descriptor
| Test Theory | 1166 |
| Test Items | 262 |
| Test Reliability | 252 |
| Test Construction | 246 |
| Test Validity | 245 |
| Psychometrics | 183 |
| Scores | 176 |
| Item Response Theory | 168 |
| Foreign Countries | 160 |
| Item Analysis | 141 |
| Statistical Analysis | 134 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| United States | 17 |
| United Kingdom (England) | 15 |
| Canada | 14 |
| Australia | 13 |
| Turkey | 12 |
| Sweden | 8 |
| United Kingdom | 8 |
| Netherlands | 7 |
| Texas | 7 |
| New York | 6 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 3 |
| Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedTindal, Gerald; And Others – Remedial and Special Education (RASE), 1987
The study examined the hypothesis that different evaluative interpretations of studies of special education effectiveness may be a function of the manner in which data are summarized and reported. Four metrics are compared including raw score, grade-equivalent score, z-score, and discrepancy index. Criteria for selecting metrics for program…
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Methods, Grade Equivalent Scores
Peer reviewedJaradat, Derar; Sawaged, Sari – Journal of Educational Measurement, 1986
The impact of the Subset Selection Technique (SST) for multiple-choice items on certain properties of a test was compared with that of two other methods, the Number Right and the Correction for Guessing Formula. Results indicated that SST outperformed the other two, producing higher reliability and validity without favoring high risk takers.…
Descriptors: Foreign Countries, Grade 9, Guessing (Tests), Measurement Techniques
Peer reviewedHawk, Jane W.; And Others – Educational and Psychological Measurement, 1984
The Mikulecky Behavioral Reading Attitude Measure (MBRAM) was designed to measure secondary and postsecondary respondents' attitudes toward reading based on Krathwohl's affective development model. This study investigated the factorial validity of the MBRAM using the responses of 411 gifted junior high school students. (Author/BS)
Descriptors: Attitude Measures, Developmental Stages, Factor Structure, Gifted
Ozsevgec, Tuncay; Cepni, Salih – Online Submission, 2006
In order to determine students' achievement, science teachers have to develop their own assessment tools. This study attempts to find out the relationship between the teachers' assessment tools and students' cognitive development according to the teachers' teaching experiences. Six open-ended survey questions were developed and delivered to 59…
Descriptors: Foreign Countries, Correlation, Science Teachers, Evaluation Methods
O'Neil, Harold F., Jr.; Schacter, John – 1997
This document reviews several theoretical frameworks of problem-solving, provides a definition of the construct, suggests ways of measuring the construct, focuses on issues for assessment, and provides specifications for the computer-based assessment of problem solving. As defined in the model of the Center for Research on Evaluation, Standards,…
Descriptors: Computer Assisted Testing, Computer Software, Criteria, Educational Assessment
van der Linden, Wim J. – Evaluation in Education: International Progress, 1982
In mastery testing a linear relationship between an optimal passing score and test length is presented with a new optimization criterion. The usual indifference zone approach, a binomial error model, decision errors, and corrections for guessing are discussed. Related results in sequential testing and the latent class approach are included. (CM)
Descriptors: Cutting Scores, Educational Testing, Mastery Tests, Mathematical Models
Peer reviewedLennon, Roger T. – Educational Measurement: Issues and Practice, 1982
Continuing attention to test theory, test development, test interpretation and use, test monitoring and control, test consumer education, and the social and political consequences of testing is suggested as the primary concern of the National Council on Measurement in Education (NCME). (CM)
Descriptors: Consumer Education, Educational Testing, Elementary Secondary Education, Measurement Objectives
Peer reviewedBlackburn, John D. – American Business Law Journal, 1980
Since 1970 the CPA Law Exam has been heavily weighted in a few of the 14 content areas, raising the question of whether or not there are too many legal areas for which the student is held responsible. (Journal availability: Fred B. Rothman & Co., 10368 W. Centennial Road, Littleton, CO 80123, $4.00.) (MSE)
Descriptors: Certification, Certified Public Accountants, Content Analysis, Evaluation Criteria
Peer reviewedMoulthrop, Robert – Peabody Journal of Education, 1981
The inclusion of a position on standardized testing in the 1980 Democratic Party platform established testing as a political and legislative issue. Testing regulations have been introduced in 20 states and in Congess, involving critics and supporters such as: the National Education Association, Nader's Public Interest Research Groups, and the…
Descriptors: College Entrance Examinations, Educational Legislation, Educational Testing, Educationally Disadvantaged
Peer reviewedGordon, Robert A. – Intelligence, 1997
Shows why the role of intelligence in everyday life is often underestimated, drawing an analogy that examines outcomes of life as analogs of items within classical test theory. In addition, a population-IQ model is explained that tests for the pooled effects of intelligence at individual, individual context, and population levels. (SLD)
Descriptors: Context Effect, Daily Living Skills, Individual Differences, Intelligence
Peer reviewedStrein, William – Journal of School Psychology, 1990
Compared the Woodcock-Johnson Tests of Cognitive Ability (WJTCA) score profiles of different cultural groups, using 442 White and 435 non-White subjects drawn from the kindergarten through grade 12 subset of WJTCA standardization data. Determined that data allowed for classification of the subtests by both curve and cultural effects criteria.…
Descriptors: Classification, Cognitive Ability, Cognitive Measurement, Elementary School Students
Jacobson, Linda – American School Board Journal, 1996
Education standards are left to the discretion of individual states. However, efforts to help states and local school districts define world-class standards are intensifying. The U.S. Department of Education, the National Education Goals Panel, and New Standards, a partnership of 17 states and 6 school districts, are among those involved. (MLF)
Descriptors: Academic Standards, Benchmarking, Comparative Analysis, Educational Assessment
Peer reviewedMislevy, Robert J. – Psychometrika, 1994
Educational assessment concerns inference about student knowledge, skills, and accomplishments. Test theory has evolved in part to address questions of weight, coverage, and import of data. Resulting concepts and techniques can be viewed as applications of more general principles for inference in the presence of uncertainty. (SLD)
Descriptors: Bayesian Statistics, Cognitive Psychology, Educational Assessment, Inferences
Peer reviewedOrnstein, Allan C.; Gilman, David A. – Contemporary Education, 1991
Explains and contrasts the techniques and philosophies of norm-referenced (NRT) and criterion-referenced tests (CRT). NRTs are criticized for lack of useful information and control. CRTs are usually teacher-made and customized to fit the classroom needs, offering more control over test content, but few teachers are prepared to develop them. (SM)
Descriptors: Criterion Referenced Tests, Elementary Secondary Education, Norm Referenced Tests, Standardized Tests
Peer reviewedCook, Linda L.; Eignor, Daniel R. – Educational Measurement: Issues and Practice, 1991
This paper provides the basis for understanding score equating through item response theory (IRT). Theoretical justifications and practical advantages of IRT true-score test procedures are discussed. Three steps in the equating process are specified, and a self-test is included. (SLD)
Descriptors: Equated Scores, Equations (Mathematics), Item Response Theory, Mathematical Models


