Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Comparative Analysis | 7 |
Test Format | 7 |
Equated Scores | 3 |
Scores | 3 |
Test Items | 3 |
Educational Assessment | 2 |
Foreign Countries | 2 |
International Education | 2 |
Test Construction | 2 |
Test Use | 2 |
Testing Problems | 2 |
More ▼ |
Source
Educational Measurement:… | 7 |
Author
Downing, Steven M. | 1 |
Eignor, Daniel R. | 1 |
Green, Bert F. | 1 |
Kolen, Michael J. | 1 |
Lee, Won-Chan | 1 |
O'Leary, Michael | 1 |
Sireci, Stephen G. | 1 |
Wainer, Howard | 1 |
Publication Type
Journal Articles | 7 |
Reports - Descriptive | 3 |
Reports - Evaluative | 2 |
Information Analyses | 1 |
Reports - Research | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Laws, Policies, & Programs
Assessments and Surveys
Trends in International… | 1 |
What Works Clearinghouse Rating
Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores
Eignor, Daniel R. – Educational Measurement: Issues and Practice, 2008
This article discusses a particular type of concordance table and the potential for test score misuse that may result from employing such a table. The concordance that is discussed is typically created between scores on different, nonequatable versions of a test that share the same or close to the same test title. These concordance tables often…
Descriptors: Scores, Tables (Data), Comparative Analysis, Equated Scores

Wainer, Howard – Educational Measurement: Issues and Practice, 1999
Discusses the comparison of groups of individuals who were administered different forms of a test. Focuses on the situation in which there is little overlap in content between the test forms. Reviews equating problems in national tests in Canada and Israel. (SLD)
Descriptors: Comparative Analysis, Equated Scores, Foreign Countries, National Competency Tests

Downing, Steven M. – Educational Measurement: Issues and Practice, 1992
Research on true-false (TF), multiple-choice, and alternate-choice (AC) tests is reviewed, discussing strengths, weaknesses, and the usefulness in classroom and large-scale testing of each. Recommendations are made for improving use of AC items to overcome some of the problems associated with TF items. (SLD)
Descriptors: Comparative Analysis, Educational Research, Multiple Choice Tests, Objective Tests

O'Leary, Michael – Educational Measurement: Issues and Practice, 2002
Examined the performance of Irish students on multiple-choice, short-answer, and extended-response item sets from the Third International Mathematics and Science Study to determine whether Ireland's relative rank among the more than 40 countries involved remained stable. Findings provide additional evidence that comparing student achievement…
Descriptors: Comparative Analysis, Foreign Countries, International Education, Mathematics Achievement

Green, Bert F. – Educational Measurement: Issues and Practice, 1995
If annual performance assessments are to yield results that can be compared from year to year, many technical problems must be addressed. It is essential that tests to be equated measure the same construct. Methods of equating performance assessment scores, ways of equating system assessments, and standard setting are discussed. (SLD)
Descriptors: Comparative Analysis, Educational Assessment, Educational Change, Equated Scores

Sireci, Stephen G. – Educational Measurement: Issues and Practice, 1997
Different methodologies for linking tests across languages are reviewed and evaluated, focusing on monolingual item response theory, bilingual group designs, and matched monolingual group designs. These methods, although not without weaknesses, are superior for promoting score comparability than methods that rely on translation or expert judgment…
Descriptors: Bilingualism, Comparative Analysis, Cross Cultural Studies, Educational Assessment