Publication Date
In 2025 | 3 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 41 |
Since 2016 (last 10 years) | 126 |
Since 2006 (last 20 years) | 395 |
Descriptor
Test Theory | 1161 |
Test Items | 261 |
Test Reliability | 252 |
Test Construction | 245 |
Test Validity | 245 |
Psychometrics | 181 |
Scores | 176 |
Item Response Theory | 165 |
Foreign Countries | 159 |
Item Analysis | 141 |
Statistical Analysis | 134 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
United States | 17 |
United Kingdom (England) | 15 |
Canada | 14 |
Australia | 13 |
Turkey | 12 |
Sweden | 8 |
United Kingdom | 8 |
Netherlands | 7 |
Texas | 7 |
New York | 6 |
Taiwan | 6 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Elementary and Secondary… | 3 |
Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
MacCann, Robert G. – Psychometrika, 2004
For (0, 1) scored multiple-choice tests, a formula giving test reliability as a function of the number of item options is derived, assuming the "knowledge or random guessing model," the parallelism of the new and old tests (apart from the guessing probability), and the assumptions of classical test theory. It is shown that the formula is a more…
Descriptors: Guessing (Tests), Multiple Choice Tests, Test Reliability, Test Theory
Gump, Steven E. – Educational Research Quarterly, 2007
This review presents an overview of selected articles on the leniency hypothesis: the idea that students give higher evaluations to instructors who grade more leniently. Such articles comprise a small subset of the voluminous research on student evaluations of teaching (SETs). In this diverse literature, research methods and aims have frequently…
Descriptors: Student Evaluation of Teacher Performance, Research Methodology, Meta Analysis, Research Problems
Farr, Roger – 1993
To examine the development of the philosophy of education, educators, and educationists, this parody fable tells the story of a king in a faraway land who asked the old tradesmen to teach the teachers of the kingdom how to teach young people to build houses. The children learned well from this instruction and built many fine houses for the…
Descriptors: Educational Objectives, Educational Philosophy, Elementary Education, Evaluation Methods
Svinicki, Marilla; Koch, Bill – Innovation Abstracts, 1984
The decision of whether to use essay tests or multiple choice tests depends on several qualifiers related to the different characteristics of the tests and the needs of the situation. The most important qualifier involves matching the type of test to the instructional objectives being tested, with multiple choice tests being used to measure a…
Descriptors: Comparative Analysis, Essay Tests, Multiple Choice Tests, Test Format
Mumford, Michael D.; Mendoza, Jorge L. – 1983
The present paper reviews the techniques commonly used to correct an observed correlation coefficient for the simultaneous influence of attenuation and range restriction effects. It is noted that the procedure which is currently in use may be somewhat biased because it treats range restriction and attenuation as independent restrictive influences.…
Descriptors: Correlation, Measurement Techniques, Psychometrics, Research Problems
Gonzalez-Tamayo, Eulogio – 1984
The findings and conclusions from research on predictive validity among markedly different groups are discussed. Empirical findings using the regression line neither support nor contradict the existence of bias or the hypothesis of differential validity. Conclusions drawn from the research are questionable. Elimination of bias in a test, contrary…
Descriptors: Educational Research, Groups, Predictive Validity, Regression (Statistics)
DeVito, Anthony J.; And Others – 1983
To assist the clinician or researcher in scale selection, four symposium papers discussed instruments available to measure test anxiety (TA), with special attention given to the newly-developed Test Anxiety Inventory (TAI). Following an integrative summary delivered by the chairperson (DeVito), the first paper (Conetta and Tryon) reviewed the two…
Descriptors: Affective Measures, Higher Education, Psychological Testing, Test Anxiety
Wilcox, Rand R. – 1979
In the past, several latent structure models have been proposed for handling problems associated with measuring the achievement of examinees. Typically, however, these models describe a specific examinee in terms of an item domain or they describe a few items in terms of a population of examinees. In this paper, a model is proposed which allows a…
Descriptors: Achievement Tests, Guessing (Tests), Mathematical Models, Multiple Choice Tests
Hayford, Paul D.; Salter, Ruth – 1978
Reading comprehension involves a number of distinctly different intellectual skills that can be assessed if the proper techniques are employed. As part of a reading assessment system, two measures of literal comprehension were developed: the Literal Comprehension Details Test (LCDT) and the Paraphrase Reading Test (PRT). Both the LCDT and the PRT…
Descriptors: Measurement Techniques, Reading Comprehension, Reading Tests, Test Construction
Levine, Michael V. – 1976
The relatively hard problem of transforming a given set of curves to curves with the same shape can sometimes be reduced to the easier problem of rendering curves parallel. In this paper a group is associated with the given curves, and it is shown that the reduction from the hard problem to the easy problem is valid whenever the group is…
Descriptors: Career Development, Latent Trait Theory, Mathematical Applications, Mathematical Models

Hunyh, Hunyh; Saunders, Joseph C. – 1979
Comparisons were made among various methods of estimating the reliability of pass-fail decisions based on mastery tests. The reliability indices that are considered are p, the proportion of agreements between two estimates, and kappa, the proportion of agreements corrected for chance. Estimates of these two indices were made on the basis of…
Descriptors: Cutting Scores, Error of Measurement, Mastery Tests, Reliability
Kearns, Jack – 1974
Empirical Bayes point estimates of true score may be obtained if the distribution of observed score for a fixed examinee is approximated in one of several ways by a well-known compound binomial model. The Bayes estimates of true score may be expressed in terms of the observed score distribution and the distribution of a hypothetical binomial test.…
Descriptors: Career Development, Error Patterns, Expectation, Mathematical Models

Braun, Carl; And Others – Journal of Educational Research, 1987
Eight fifth-grade children were tested in three contexts to determine the effects of adult support and regulation on their ability to read words in isolation and to read connected discourse. Findings are discussed in terms of Vygotsky's zone of proximal development. (Author/MT)
Descriptors: Context Clues, Educational Environment, Examiners, Grade 5

Aronson, Edith; Farr, Roger – Journal of Reading, 1988
Discusses issues in reading assessment, including the concern that tests can not measure the reading process. Concludes that tests will never be more than useful indicators; calls for more research and examination of existing and new types of reading comprehension tests. (MM)
Descriptors: Reading Comprehension, Reading Research, Reading Tests, Test Bias

Glutting, Joseph J.; And Others – Educational and Psychological Measurement, 1987
This paper discusses the basic theory underlying confidence limits and presents reasons why psychologists should incorporate confidence ranges in their psychodiagnostic reports. Four methods for establishing confidence limits are compared. Three of the methods involve estimated true scores, and the fourth is the standard error of measurement…
Descriptors: Error of Measurement, Mathematical Formulas, Psychological Evaluation, Scores