Publication Date
| In 2026 | 0 |
| Since 2025 | 4 |
| Since 2022 (last 5 years) | 14 |
| Since 2017 (last 10 years) | 17 |
| Since 2007 (last 20 years) | 36 |
Descriptor
| Comparative Testing | 203 |
| Test Reliability | 203 |
| Test Validity | 95 |
| Higher Education | 47 |
| Test Construction | 47 |
| Foreign Countries | 31 |
| College Students | 28 |
| Test Format | 28 |
| Intelligence Tests | 22 |
| Test Items | 22 |
| Psychometrics | 20 |
| More ▼ | |
Source
Author
| Bracken, Bruce A. | 3 |
| Gallas, Edwin J. | 3 |
| Smith, Douglas K. | 3 |
| Trevisan, Michael S. | 3 |
| Anderson, Paul S. | 2 |
| Breland, Hunter M. | 2 |
| Costantino, Giuseppe | 2 |
| Green, Kathy | 2 |
| Hyers, Albert D. | 2 |
| Karma, Kai | 2 |
| Marsh, Herbert W. | 2 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 16 |
| Postsecondary Education | 11 |
| Elementary Education | 5 |
| Elementary Secondary Education | 4 |
| Secondary Education | 4 |
| Early Childhood Education | 2 |
| Grade 2 | 2 |
| Grade 4 | 2 |
| High Schools | 2 |
| Grade 10 | 1 |
| Grade 7 | 1 |
| More ▼ | |
Audience
| Researchers | 9 |
| Practitioners | 3 |
| Teachers | 2 |
| Counselors | 1 |
Location
| United States | 5 |
| Australia | 4 |
| Canada | 4 |
| China | 4 |
| Ireland | 2 |
| Israel | 2 |
| Singapore | 2 |
| United Kingdom | 2 |
| United Kingdom (England) | 2 |
| Alabama | 1 |
| Argentina | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 2 |
| No Child Left Behind Act 2001 | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018
Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…
Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests
Bijsterbosch, Erik – Geographical Education, 2018
Geography teachers' school-based (internal) examinations in pre-vocational geography education in the Netherlands appear to be in line with the findings in the literature, namely that teachers' assessment practices tend to focus on the recall of knowledge. These practices are strongly influenced by national (external) examinations. This paper…
Descriptors: Foreign Countries, Instructional Effectiveness, National Competency Tests, Geography Instruction
Ward, Samantha L.; Sullivan, Karen A.; Gilmore, Linda – Educational and Developmental Psychologist, 2016
Objective: Limited time and resources necessitate the availability of accurate, inexpensive and rapid diagnostic aids for Autism Spectrum Disorder (ASD). The Autistic Behavioural Indicators Instrument (ABII) was developed for this purpose, but its psychometric properties have not yet been fully established. Method: The clinician-rated ABII, the…
Descriptors: Autism, Pervasive Developmental Disorders, Psychometrics, Diagnostic Tests
Murray, Keith B.; Zdravkovic, Srdan – Journal of Education for Business, 2016
Considerable debate continues regarding the efficacy of the website RateMyProfessors.com (RMP). To date, however, virtually no direct, experimental research has been reported which directly bears on questions relating to sampling adequacy or item adequacy in producing what favorable correlations have been reported. The authors compare the data…
Descriptors: Computer Assisted Testing, Computer Software Evaluation, Student Evaluation of Teacher Performance, Item Analysis
Rutkowski, David; Rutkowski, Leslie; Plucker, Jonathan A. – Phi Delta Kappan, 2015
The OECD and its U.S. administrator, McGraw-Hill Education CTB, have recently concluded the first cycle of the OECD-Test for Schools in the U.S. This test is being marketed to local schools and is designed to compare 15-year-olds from individual participating schools against peers nationally and internationally using the OECD's PISA test as its…
Descriptors: Participation, International Education, Comparative Testing, Comparative Education
Slepkov, Aaron D.; Shiell, Ralph C. – Physical Review Special Topics - Physics Education Research, 2014
Constructed-response (CR) questions are a mainstay of introductory physics textbooks and exams. However, because of the time, cost, and scoring reliability constraints associated with this format, CR questions are being increasingly replaced by multiple-choice (MC) questions in formal exams. The integrated testlet (IT) is a recently developed…
Descriptors: Science Tests, Physics, Responses, Multiple Choice Tests
Totten, Jeff W. – Journal of Learning in Higher Education, 2014
The original SOCO Scale was reduced to 10 items by Thomas, Soutar, and Ryan (2001). The author conducted a pretest and a posttest in his Personal Selling class during the Fall 2009 semester. Significant differences by gender, student sales experience and family member in the sales field were identified. The author once again pretested the…
Descriptors: Test Construction, Program Validation, Pretests Posttests, Questionnaires
Piper, Benjamin; Zuilkowski, Stephanie Simmons – International Review of Education, 2015
In recent years, the Education for All movement has focused more intensely on the quality of education, rather than simply provision. Many recent and current education quality interventions focus on literacy, which is the core skill required for further academic success. Despite this focus on the quality of literacy instruction in developing…
Descriptors: Foreign Countries, Reading Fluency, Reading Tests, Oral Reading
Turgut, Guliz – Clearing House: A Journal of Educational Strategies, Issues and Ideas, 2013
The ranking of the United States in major international tests such as the Progress in International Reading Literacy Study (PIRLS), Trends in International Mathematics and Science Study (TIMSS), and Program for International Student Assessment (PISA) is used as the driving force and rationale for the current educational reforms in the United…
Descriptors: Educational Change, Success, Educational Strategies, Educational Indicators
Jones, Ian; Alcock, Lara – Studies in Higher Education, 2014
Peer assessment typically requires students to judge peers' work against assessment criteria. We tested an alternative approach in which students judged pairs of scripts against one another in the absence of assessment criteria. First year mathematics undergraduates (N?=?194) sat a written test on conceptual understanding of multivariable…
Descriptors: Peer Evaluation, Evaluation Criteria, Alternative Assessment, Undergraduate Students
Morrison, Keith – Educational Research and Evaluation, 2013
This paper reviews the literature on comparing online and paper course evaluations in higher education and provides a case study of a very large randomised trial on the topic. It presents a mixed but generally optimistic picture of online course evaluations with respect to response rates, what they indicate, and how to increase them. The paper…
Descriptors: Literature Reviews, Course Evaluation, Case Studies, Higher Education
Lew, Magdeleine D. N.; Alwis, W. A. M.; Schmidt, Henk G. – Assessment & Evaluation in Higher Education, 2010
The purpose of the two studies presented here was to evaluate the accuracy of students' self-assessment ability, to examine whether this ability improves over time and to investigate whether self-assessment is more accurate if students believe that it contributes to improving learning. To that end, the accuracy of the self-assessments of 3588…
Descriptors: Self Evaluation (Individuals), Beliefs, Learning Processes, Correlation
Ricketts, Chris; Brice, Julie; Coombes, Lee – Advances in Health Sciences Education, 2010
The purpose of multiple choice tests of medical knowledge is to estimate as accurately as possible a candidate's level of knowledge. However, concern is sometimes expressed that multiple choice tests may also discriminate in undesirable and irrelevant ways, such as between minority ethnic groups or by sex of candidates. There is little literature…
Descriptors: Medical Students, Testing Accommodations, Ethnic Groups, Learning Disabilities
Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010
Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…
Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis
Bradbury, Alice – Journal of Education Policy, 2011
Despite decades of research and debate, the issue of unequal outcomes continues to be a concern in educational systems worldwide. In England, published data relating to pupils' attainment across ethnic groups and by class indicators has been used to demonstrate continued inequalities in schools. This article attempts to deconstruct the…
Descriptors: Ethnic Groups, Urban Areas, Foreign Countries, Educational Policy

Peer reviewed
Direct link
