Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 5 |
Descriptor
Classification | 9 |
Test Theory | 9 |
Measurement Techniques | 3 |
Models | 3 |
Construct Validity | 2 |
Educational Assessment | 2 |
Error of Measurement | 2 |
Foreign Countries | 2 |
Mathematics Tests | 2 |
Psychometrics | 2 |
Reliability | 2 |
More ▼ |
Source
International Journal of… | 2 |
Journal of Educational… | 2 |
Applied Measurement in… | 1 |
Educational Measurement:… | 1 |
Measurement:… | 1 |
Research Papers in Education | 1 |
Author
Chen, Yi-Hsin | 1 |
Dennings, Bruce | 1 |
Downing, Steven M. | 1 |
Gorin, Joanna S. | 1 |
Haladyna, Thomas M. | 1 |
Hayes, Malcolm | 1 |
He, Qingping | 1 |
Jiao, Hong | 1 |
Kupermintz, Haggai | 1 |
Sijtsma, Klaas | 1 |
Sinharay, Sandip | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 9 |
Journal Articles | 8 |
Speeches/Meeting Papers | 2 |
Education Level
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Audience
Practitioners | 1 |
Location
Taiwan | 1 |
United Kingdom (England) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Sinharay, Sandip – Journal of Educational Measurement, 2014
Brennan noted that users of test scores often want (indeed, demand) that subscores be reported, along with total test scores, for diagnostic purposes. Haberman suggested a method based on classical test theory (CTT) to determine if subscores have added value over the total score. One way to interpret the method is that a subscore has added value…
Descriptors: Scores, Test Theory, Classification, Cutting Scores
He, Qingping; Hayes, Malcolm; Wiliam, Dylan – Research Papers in Education, 2013
The accuracy of the results of the national tests in English, mathematics and science taken by 11-year olds in England has been a matter of much debate since their introduction in 1994, with estimates of the proportion of students incorrectly classified varying from 10 to 30%. Using live data from the 2009 and 2010 administration of the national…
Descriptors: Foreign Countries, National Curriculum, Accuracy, Classification
Sijtsma, Klaas – International Journal of Testing, 2009
This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…
Descriptors: Construct Validity, Reliability, Classification, Test Theory
Jiao, Hong – Measurement: Interdisciplinary Research and Perspectives, 2009
Diagnostic assessment is currently an active research area in educational measurement. Literature related to diagnostic modeling has been in existence for several decades, but a great deal of research has been conducted within the last decade or so, especially within the last five years. The author summarizes the key components in the application…
Descriptors: Educational Assessment, Literature Reviews, Test Items, Probability
Chen, Yi-Hsin; Gorin, Joanna S.; Thompson, Marilyn S.; Tatsuoka, Kikumi K. – International Journal of Testing, 2008
As with any test administered across linguistically and culturally diverse groups, evidence suggesting the equivalence of score meaning across countries is needed for valid comparisons. The current study examines the cross-cultural equivalence of score interpretations from the Trends in International Mathematics and Science Study (TIMSS)-1999 from…
Descriptors: Construct Validity, Mathematics Tests, Foreign Countries, Equated Scores
Kupermintz, Haggai – Journal of Educational Measurement, 2004
A decision-theoretic approach to the question of reliability in categorically scored examinations is explored. The concepts of true scores and errors are discussed as they deviate from conventional psychometric definitions and measurement error in categorical scores is cast in terms of misclassifications. A reliability measure based on…
Descriptors: Test Reliability, Error of Measurement, Psychometrics, Test Theory

Haladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989
A taxonomy of 43 rules for writing multiple-choice test items is presented, based on a consensus of 46 textbooks. These guidelines are presented as complete and authoritative, with solid consensus apparent for 33 of the rules. Four rules lack consensus, and 5 rules were cited fewer than 10 times. (SLD)
Descriptors: Classification, Interrater Reliability, Multiple Choice Tests, Objective Tests
Thompson, Bruce; Dennings, Bruce – 1993
Q-technique factor analysis identifies clusters or factors of people, rather than of variables, and has proven very popular, especially with regard to testing typology theories. The present study investigated the utility of three different protocols for obtaining data for Q-technique studies. These three protocols were: (1) a conventional ipsative…
Descriptors: Classification, Comparative Analysis, Data Collection, Factor Analysis
Assessment Theory and Research for Classrooms: From "Taxonomies" to Constructing Meaning in Context.

Tittle, Carol Kehr; And Others – Educational Measurement: Issues and Practice, 1993
Major changes in educational and psychological theories that have come about since the cognitive and affective taxonomies of educational objectives were published in 1956 and 1964 are traced. The changes emphasize the need to understand thinking in the context of students' beliefs and self-directed cognitions. (SLD)
Descriptors: Achievement Tests, Affective Behavior, Classification, Classroom Research