Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 16 |
| Since 2017 (last 10 years) | 20 |
| Since 2007 (last 20 years) | 46 |
Descriptor
Source
Author
| Bracken, Bruce A. | 3 |
| Byrne, Barbara M. | 3 |
| Silverstein, A. B. | 3 |
| Smith, Douglas K. | 3 |
| Thompson, Bruce | 3 |
| DROEGE, ROBERT C. | 2 |
| Gaa, John P. | 2 |
| Gayton, William F. | 2 |
| Green, Kathy | 2 |
| Liberman, Dov | 2 |
| Lunz, Mary E. | 2 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 13 |
| Practitioners | 5 |
| Counselors | 2 |
| Teachers | 2 |
Location
| Canada | 9 |
| Australia | 6 |
| Israel | 4 |
| United Kingdom (England) | 4 |
| Germany | 3 |
| United States | 3 |
| Texas | 2 |
| Alabama | 1 |
| Argentina | 1 |
| Austria | 1 |
| Canada (Edmonton) | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
| Elementary and Secondary… | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Feuer, Michael J. – Educational Testing Service, 2011
Few arguments about education are as effective at galvanizing public attention and motivating political action as those that compare the performance of students with their counterparts in other countries and that connect academic achievement to economic performance. Because data from international large-scale assessments (ILSA) have a powerful…
Descriptors: International Assessment, Test Interpretation, Testing Problems, Comparative Testing
Sparfeldt, Jorn R.; Kimmel, Rumena; Lowenkamp, Lena; Steingraber, Antje; Rost, Detlef H. – Educational Assessment, 2012
Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N[subscript 1] = 230, N[subscript 2] = 340, N[subscript 3] = 194) worked on three…
Descriptors: Test Items, Reading Comprehension, Construct Validity, Grade 4
Steedle, Jeffrey; Kugelmass, Heather; Nemeth, Alex – Change: The Magazine of Higher Learning, 2010
Many postsecondary institutions currently administer standardized tests of general college outcomes; more than a quarter of Association of American Colleges and Universities (AAC&U) member institutions do so. Using standardized tests for accountability purposes has been contentious mainly because these tests do not measure every important…
Descriptors: Test Results, Standardized Tests, Test Validity, Educational Testing
Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010
Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…
Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis
Bradbury, Alice – Journal of Education Policy, 2011
Despite decades of research and debate, the issue of unequal outcomes continues to be a concern in educational systems worldwide. In England, published data relating to pupils' attainment across ethnic groups and by class indicators has been used to demonstrate continued inequalities in schools. This article attempts to deconstruct the…
Descriptors: Ethnic Groups, Urban Areas, Foreign Countries, Educational Policy
Young, John W.; Holtzman, Steven; Steinberg, Jonathan – Educational Testing Service, 2011
In this research investigation of score comparability for language minority students (English language learners [ELLs] and former English language learners), we examined 3 indicators of score comparability (reliability, internal test structure, and differential item functioning) for 4th and 8th grade students who took the NCLB-mandated content…
Descriptors: Language Minorities, Second Language Learning, Grade 8, Minority Group Students
Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012
Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…
Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries
Visone, Jeremy D. – American Secondary Education, 2009
This study explored the relationship between reading and achievement on a science standardized test. A nonfiction reading subtest and the science section of the Connecticut Academic Performance Test were compared for Grade 10 students at 3 Connecticut high schools. Results showed a moderate-to-strong positive relationship between the variables.…
Descriptors: Standardized Tests, Correlation, Test Validity, Science Achievement
Foley-Peres, Kathleen; Poirier, Dawn – Educational Research Quarterly, 2008
Many colleges and university's use SAT math scores or math placement tests to place students in the appropriate math course. This study compares the use of math placement scores and SAT scores for 188 freshman students. The student's grades and faculty observations were analyzed to determine if the SAT scores and/or college math assessment scores…
Descriptors: Educational Indicators, Student Placement, Achievement Tests, Standardized Tests
Pell, Godfrey; Homer, Matthew S.; Roberts, Trudie E. – International Journal of Research & Method in Education, 2008
Increasingly, academic institutions are being required to improve the validity of the assessment process; unfortunately, often this is at the expense of reliability. In medical schools (such as Leeds), standardized tests of clinical skills, such as "Objective Structured Clinical Examinations" (OSCEs) are widely used to assess clinical…
Descriptors: Medical Education, Standardized Tests, Clinical Experience, Criterion Referenced Tests
Liow, Jong-Leng – European Journal of Engineering Education, 2008
Peer assessment has been studied in various situations and actively pursued as a means by which students are given more control over their learning and assessment achievement. This study investigated the reliability of staff and student assessments in two oral presentations with limited feedback for a school-based thesis course in engineering…
Descriptors: Feedback (Response), Student Evaluation, Grade Point Average, Peer Evaluation
Peer reviewedLukens, John – Journal of School Psychology, 1988
Administered the Stanford-Binet, Fourth Edition, to 31 mentally retarded adolescents who had previously been tested with the Stanford-Binet, L-M, with a mean interval between testings of 17.3 months. Found an intertest correlation of .86 and a median intelligence quotient change of three points in either direction. Compatability of scores supports…
Descriptors: Adolescents, Comparative Testing, Intelligence Tests, Mental Retardation
Peer reviewedSandoval, Jonathan; And Others – Psychology in the Schools, 1988
Examined similarity of scores of 30 learning disabled students (aged 16 and 17) on the Wechsler Intelligence Scale for Children-Revised (WISC-R) and the Wechsler Adult Intelligence Scale-Revised (WAIS-R). Results documented similarity between WISC-R and WAIS-R for 16 year-olds who were learning disabled and had average intellectual ability.…
Descriptors: Adolescents, Comparative Testing, Learning Disabilities, Special Education
Peer reviewedSpitz, Herman H. – Intelligence, 1989
Studies involving groups administered the Wechsler Adult Intelligence Scale (WAIS) and the WAIS-Revised were examined to determine the validity of J. R. Flynn's (1987) findings of massive intelligence quotient gains in a single generation in many nations. Results for sampled adults support Flynn for the average intelligence range only. (TJH)
Descriptors: Adults, Comparative Testing, Intelligence Quotient, Test Validity
Peer reviewedRiviere, Michael S. – Educational and Psychological Measurement, 1973
Descriptors: Comparative Testing, Intelligence Tests, Mental Retardation, Test Reliability

Direct link
