Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 5 |
Descriptor
Evaluation Methods | 11 |
Test Theory | 11 |
Item Response Theory | 4 |
Student Evaluation | 4 |
Statistical Analysis | 3 |
Test Items | 3 |
Test Reliability | 3 |
Computation | 2 |
Evaluation Research | 2 |
Higher Education | 2 |
Item Analysis | 2 |
More ▼ |
Source
Author
Publication Type
Reports - Descriptive | 11 |
Journal Articles | 10 |
Information Analyses | 1 |
Opinion Papers | 1 |
Education Level
Higher Education | 2 |
Secondary Education | 2 |
High Schools | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Practitioners | 2 |
Administrators | 1 |
Teachers | 1 |
Location
Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Graduate Record Examinations | 1 |
What Works Clearinghouse Rating
Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities
Sabatini, John; Petscher, Yaacov; O'Reilly, Tenaha; Truckenmiller, Adrea – Grantee Submission, 2015
For decades, standardized reading comprehension tests have consisted of a series of passages and associated multiple-choice questions. Although widely used in and out of the classroom, there continues to be considerable disagreement regarding how or whether such tests have net value in the service of advancing educational progress in reading. This…
Descriptors: Middle School Students, High School Students, Reading Comprehension, Reading Tests
Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013
In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…
Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory
Calmettes, Guillaume; Drummond, Gordon B.; Vowler, Sarah L. – Advances in Physiology Education, 2012
A jack knife is a pocket knife that is put to many tasks, because it's ready to hand. Often there could be a better tool for the job, such as a screwdriver, a scraper, or a can-opener, but these are not usually pocket items. In statistical terms, the expression implies making do with what's available. Another simile, of an extreme situation, is…
Descriptors: Statistical Analysis, Computation, Population Distribution, Evaluation Methods
van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012
While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…
Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making
Ding, Lin; Beichner, Robert – Physical Review Special Topics - Physics Education Research, 2009
This paper introduces five commonly used approaches to analyzing multiple-choice test data. They are classical test theory, factor analysis, cluster analysis, item response theory, and model analysis. Brief descriptions of the goals and algorithms of these approaches are provided, together with examples illustrating their applications in physics…
Descriptors: Multiple Choice Tests, Factor Analysis, Data Interpretation, Item Response Theory
Zimmerman, Donald W.; Williams, Richard H.; Zumbo, Bruno D.; Ross, Donald – International Journal of Testing, 2005
This article focuses on Louis Guttman's contributions to the classical theory of educational and psychological tests, one of the lesser known of his many contributions to quantitative methods in the social sciences. Guttman's work in this field provided a rigorous mathematical basis for ideas that, for many decades after Spearman's initial work,…
Descriptors: Evaluation Methods, Test Theory, Social Sciences, Psychological Testing
Raju, Nambury S.; Oshima, T.C. – Educational and Psychological Measurement, 2005
Two new prophecy formulas for estimating item response theory (IRT)-based reliability of a shortened or lengthened test are proposed. Some of the relationships between the two formulas, one of which is identical to the well-known Spearman-Brown prophecy formula, are examined and illustrated. The major assumptions underlying these formulas are…
Descriptors: Item Response Theory, Test Reliability, Evaluation Methods, Computation
Allen, Nancy L.; Holland, Paul W.; Thayer, Dorothy T. – Journal of Educational Measurement, 2005
Allowing students to choose the question(s) that they will answer from among several possible alternatives is often viewed as a mechanism for increasing fairness in certain types of assessments. The fairness of optional topic choice is not a universally accepted fact, however, and various studies have been done to assess this question. We examine…
Descriptors: Test Theory, Test Items, Student Evaluation, Evaluation Methods
Rorvig, Mark – Proceedings of the ASIS Annual Meeting, 2000
Proposes a new technique for the evaluation of question difficulty. Suggests that question dispersion by multidimensional scaling models the question-response pattern required by test theory, but without the population density requirements of the traditional methods. Considers the effect on knowledge management functions, including library…
Descriptors: Difficulty Level, Evaluation Methods, Library Services, Multidimensional Scaling

Schnucker, Robert V. – History Teacher, 1991
Describes the development of a history assessment test by Northeast Missouri State University (Kirksville) faculty. Discusses content, problems with design and cooperation, and theories of assessment testing. Includes sample questions demonstrating the cube plan, a technique that involves using a single question to measure three learning…
Descriptors: Achievement Tests, Educational Assessment, Evaluation Methods, Higher Education

Everett, Kenneth G.; DeLoach, Will S. – Journal of Chemical Education, 1986
Analyzes an old chemistry examination that was given to a college chemistry class in 1984. Contrasts several of the examination's features against most modern ones. Notes the emphasis on observable properties and the practical applications of substances. Argues that the newer examinations may stress too much theory and don't stress communication…
Descriptors: Chemistry, College Science, Evaluation Methods, Higher Education