Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 9 |
Descriptor
Comparative Analysis | 13 |
Comparative Testing | 13 |
Item Analysis | 13 |
Test Items | 6 |
Academic Achievement | 3 |
Achievement Tests | 3 |
Evaluation Methods | 3 |
Foreign Countries | 3 |
Measurement Techniques | 3 |
Rating Scales | 3 |
Test Validity | 3 |
More ▼ |
Source
Author
Hughes, Carolyn | 2 |
Little, Todd D. | 2 |
Palmer, Susan B. | 2 |
Seo, Hyojeong | 2 |
Shogren, Karrie A. | 2 |
Thompson, James R. | 2 |
Wehmeyer, Michael L. | 2 |
Aldhafri, Said | 1 |
Bauer, Daniel | 1 |
Bejar, Isaac I. | 1 |
Cantrell, Pamela | 1 |
More ▼ |
Publication Type
Reports - Research | 10 |
Journal Articles | 9 |
Dissertations/Theses -… | 1 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Secondary Education | 3 |
Higher Education | 3 |
Elementary Education | 1 |
Grade 4 | 1 |
Postsecondary Education | 1 |
Audience
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
California Achievement Tests | 1 |
California Test of Mental… | 1 |
Stanford Binet Intelligence… | 1 |
What Works Clearinghouse Rating
Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018
Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…
Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests
Seo, Hyojeong; Shogren, Karrie A.; Wehmeyer, Michael L.; Hughes, Carolyn; Thompson, James R.; Little, Todd D.; Palmer, Susan B. – Career Development and Transition for Exceptional Individuals, 2016
This study examined similarities and differences in measurement properties and score comparability of the "Supports Intensity Scale-Adult Version" (16-64 years) and the "Supports Intensity Scale-Children's Version" (5-16 years). Data were collected from 142 adolescents with intellectual disability with both versions of the…
Descriptors: Adolescents, Intellectual Disability, Special Needs Students, Transitional Programs
Murray, Keith B.; Zdravkovic, Srdan – Journal of Education for Business, 2016
Considerable debate continues regarding the efficacy of the website RateMyProfessors.com (RMP). To date, however, virtually no direct, experimental research has been reported which directly bears on questions relating to sampling adequacy or item adequacy in producing what favorable correlations have been reported. The authors compare the data…
Descriptors: Computer Assisted Testing, Computer Software Evaluation, Student Evaluation of Teacher Performance, Item Analysis
Seo, Hyojeong; Shogren, Karrie A.; Wehmeyer, Michael L.; Hughes, Carolyn; Thompson, James R.; Little, Todd D.; Palmer, Susan B. – Grantee Submission, 2016
This study examined similarities and differences in measurement properties and score comparability of the "Supports Intensity Scale-Adult Version" (16-64 years) and the "Supports Intensity Scale-Children's Version" (5-16 years). Data were collected from 142 adolescents with intellectual disability with both versions of the…
Descriptors: Adolescents, Intellectual Disability, Special Needs Students, Transitional Programs
Sparfeldt, Jorn R.; Kimmel, Rumena; Lowenkamp, Lena; Steingraber, Antje; Rost, Detlef H. – Educational Assessment, 2012
Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N[subscript 1] = 230, N[subscript 2] = 340, N[subscript 3] = 194) worked on three…
Descriptors: Test Items, Reading Comprehension, Construct Validity, Grade 4
Zhang, Bin – ProQuest LLC, 2012
Social scientists usually are more interested in consumers' dichotomous choice, such as purchase a product or not, adopt a technology or not, etc. However, up to date, there is nearly no model can help us solve the problem of multi-network effects comparison with a dichotomous dependent variable. Furthermore, the study of multi-network…
Descriptors: Social Networks, Network Analysis, Comparative Analysis, Population Groups
Schulz, Wolfram; Fraillon, Julian – Educational Research and Evaluation, 2011
When comparing data derived from tests or questionnaires in cross-national studies, researchers commonly assume measurement invariance in their underlying scaling models. However, different cultural contexts, languages, and curricula can have powerful effects on how students respond in different countries. This article illustrates how the…
Descriptors: Citizenship Education, International Studies, Item Response Theory, International Education
Klassen, Robert M.; Aldhafri, Said; Mansfield, Caroline F.; Purwanto, Edy; Siu, Angela F. Y.; Wong, Marina W.; Woods-McConney, Amanda – Journal of Experimental Education, 2012
This study explored the validity of the Utrecht Work Engagement Scale in a sample of 853 practicing teachers from Australia, Canada, China (Hong Kong), Indonesia, and Oman. The authors used multigroup confirmatory factor analysis to test the factor structure and measurement invariance across settings, after which they examined the relationships…
Descriptors: Job Satisfaction, Factor Structure, Measures (Individuals), Factor Analysis
McGlynn, Angela Provitera – Education Digest: Essential Readings Condensed for Quick Review, 2008
A new report, "The Proficiency Illusion," released last year by the Thomas B. Fordham Institute states that the tests that states use to measure academic progress under the No Child Left Behind Act (NCLB) are creating a false impression of success, especially in reading and especially in the early grades. The report is a collaboration…
Descriptors: Federal Legislation, Academic Achievement, Rating Scales, Achievement Tests

Garfinkel, Robin; Thorndike, Robert L. – Child Development, 1976
This study was conducted to determine how items of the Stanford-Binet Intelligence Scale, Form L-M, had performed in the 1930's standardization sample in comparison with the 1972 standardization sample. (SB)
Descriptors: Comparative Analysis, Comparative Testing, Group Testing, Intelligence Tests
Cantrell, Pamela – School Science and Mathematics, 2003
The difference in gain scores produced by traditional pretests and those produced by retrospective pretests when compared to posttest scores on the Science Teaching Efficacy Belief Instrument for preservice teachers was investigated in this study. Results indicated that gain scores using the traditional pretest produced significant improvement in…
Descriptors: Pretests Posttests, Validity, Scores, Preservice Teachers
Bejar, Isaac I.; And Others – 1977
Information provided by typical and improved conventional classroom achievement tests was compared with information provided by an adaptive test covering the same subject matter. Both tests were administered to over 700 college students in a general biology course. Using the same scoring method, adaptive testing was found to yield substantially…
Descriptors: Academic Achievement, Achievement Tests, Adaptive Testing, Biology
Clarke, S. C. T.; And Others – 1978
The Edmonton Grade III Achievement: 1956-1977 study is a comparison of achievement in reading, arithmetic, and language involving all of the third grade students in a large school system. Six basic skills tests which were administered to all of the Edmonton third grade students in 1956 were reprinted and administered to all of the third grade…
Descriptors: Academic Achievement, Achievement Tests, Aptitude Tests, Basic Skills