ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	5

Descriptor

Classification	9
Test Theory	9
Measurement Techniques	3
Models	3
Construct Validity	2
Educational Assessment	2
Error of Measurement	2
Foreign Countries	2
Mathematics Tests	2
Psychometrics	2
Reliability	2
Scores	2
Structural Equation Models	2
Test Construction	2
Test Items	2
Test Reliability	2
Accuracy	1
Achievement Tests	1
Affective Behavior	1
Classroom Research	1
Cognitive Processes	1
Comparative Analysis	1
Constructivism (Learning)	1
Context Effect	1
Correlation	1
More ▼

Source

International Journal of…	2
Journal of Educational…	2
Applied Measurement in…	1
Educational Measurement:…	1
Measurement:…	1
Research Papers in Education	1

Author

Chen, Yi-Hsin	1
Dennings, Bruce	1
Downing, Steven M.	1
Gorin, Joanna S.	1
Haladyna, Thomas M.	1
Hayes, Malcolm	1
He, Qingping	1
Jiao, Hong	1
Kupermintz, Haggai	1
Sijtsma, Klaas	1
Sinharay, Sandip	1
Tatsuoka, Kikumi K.	1
Thompson, Bruce	1
Thompson, Marilyn S.	1
Tittle, Carol Kehr	1
Wiliam, Dylan	1
More ▼

Publication Type

Reports - Evaluative	9
Journal Articles	8
Speeches/Meeting Papers	2

Education Level

Elementary Education	1
Elementary Secondary Education	1

Audience

Practitioners

Location

Taiwan	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Analysis of Added Value of Subscores with Respect to Classification

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2014

Brennan noted that users of test scores often want (indeed, demand) that subscores be reported, along with total test scores, for diagnostic purposes. Haberman suggested a method based on classical test theory (CTT) to determine if subscores have added value over the total score. One way to interpret the method is that a subscore has added value…

Descriptors: Scores, Test Theory, Classification, Cutting Scores

Classification Accuracy in Key Stage 2 National Curriculum Tests in England

Peer reviewed

Direct link

He, Qingping; Hayes, Malcolm; Wiliam, Dylan – Research Papers in Education, 2013

The accuracy of the results of the national tests in English, mathematics and science taken by 11-year olds in England has been a matter of much debate since their introduction in 1994, with estimates of the proportion of students incorrectly classified varying from 10 to 30%. Using live data from the 2009 and 2010 administration of the national…

Descriptors: Foreign Countries, National Curriculum, Accuracy, Classification

Correcting Fallacies in Validity, Reliability, and Classification

Peer reviewed

Direct link

Sijtsma, Klaas – International Journal of Testing, 2009

This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…

Descriptors: Construct Validity, Reliability, Classification, Test Theory

Diagnostic Classification Models: Which One Should I Use?

Peer reviewed

Direct link

Jiao, Hong – Measurement: Interdisciplinary Research and Perspectives, 2009

Diagnostic assessment is currently an active research area in educational measurement. Literature related to diagnostic modeling has been in existence for several decades, but a great deal of research has been conducted within the last decade or so, especially within the last five years. The author summarizes the key components in the application…

Descriptors: Educational Assessment, Literature Reviews, Test Items, Probability

Cross-Cultural Validity of the TIMSS-1999 Mathematics Test: Verification of a Cognitive Model

Peer reviewed

Direct link

Chen, Yi-Hsin; Gorin, Joanna S.; Thompson, Marilyn S.; Tatsuoka, Kikumi K. – International Journal of Testing, 2008

As with any test administered across linguistically and culturally diverse groups, evidence suggesting the equivalence of score meaning across countries is needed for valid comparisons. The current study examines the cross-cultural equivalence of score interpretations from the Trends in International Mathematics and Science Study (TIMSS)-1999 from…

Descriptors: Construct Validity, Mathematics Tests, Foreign Countries, Equated Scores

On the Reliability of Categorically Scored Examinations

Peer reviewed

Direct link

Kupermintz, Haggai – Journal of Educational Measurement, 2004

A decision-theoretic approach to the question of reliability in categorically scored examinations is explored. The concepts of true scores and errors are discussed as they deviate from conventional psychometric definitions and measurement error in categorical scores is cast in terms of misclassifications. A reliability measure based on…

Descriptors: Test Reliability, Error of Measurement, Psychometrics, Test Theory

A Taxonomy of Multiple-Choice Item-Writing Rules.

Peer reviewed

Haladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989

A taxonomy of 43 rules for writing multiple-choice test items is presented, based on a consensus of 46 textbooks. These guidelines are presented as complete and authoritative, with solid consensus apparent for 33 of the rules. Four rules lack consensus, and 5 rules were cited fewer than 10 times. (SLD)

Descriptors: Classification, Interrater Reliability, Multiple Choice Tests, Objective Tests

The Unnumbered Graphic Scale as a Data-Collection Method: An Investigation Comparing Three Measurement Strategies in the Context of Q-Technique Factor Analysis.

Download full text

Thompson, Bruce; Dennings, Bruce – 1993

Q-technique factor analysis identifies clusters or factors of people, rather than of variables, and has proven very popular, especially with regard to testing typology theories. The present study investigated the utility of three different protocols for obtaining data for Q-technique studies. These three protocols were: (1) a conventional ipsative…

Descriptors: Classification, Comparative Analysis, Data Collection, Factor Analysis

Assessment Theory and Research for Classrooms: From "Taxonomies" to Constructing Meaning in Context.

Peer reviewed

Tittle, Carol Kehr; And Others – Educational Measurement: Issues and Practice, 1993

Major changes in educational and psychological theories that have come about since the cognitive and affective taxonomies of educational objectives were published in 1956 and 1964 are traced. The changes emphasize the need to understand thinking in the context of students' beliefs and self-directed cognitions. (SLD)

Descriptors: Achievement Tests, Affective Behavior, Classification, Classroom Research