Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 11 |
Descriptor
Test Items | 56 |
Test Use | 56 |
Test Validity | 56 |
Test Construction | 38 |
Test Reliability | 27 |
Foreign Countries | 14 |
Higher Education | 11 |
Scoring | 10 |
Achievement Tests | 9 |
Psychometrics | 9 |
Scores | 9 |
More ▼ |
Source
Author
Vispoel, Walter P. | 2 |
Ackerman, Terry A. | 1 |
Alderson, J. Charles | 1 |
Ayar, Zülal | 1 |
Baker, Eva L. | 1 |
Barrow, Lloyd | 1 |
Basset, Katherine | 1 |
Bennett, Randy Elliot | 1 |
Biancarosa, Gina | 1 |
Bishop, Laurence A. | 1 |
Black, Paul | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 4 |
Postsecondary Education | 4 |
Secondary Education | 2 |
Elementary Education | 1 |
High Schools | 1 |
Location
Georgia | 2 |
New Jersey | 2 |
Tennessee | 2 |
Australia | 1 |
Canada | 1 |
Colorado | 1 |
District of Columbia | 1 |
Idaho | 1 |
Illinois | 1 |
Indonesia | 1 |
Ireland | 1 |
More ▼ |
Laws, Policies, & Programs
Comprehensive Education… | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024
The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…
Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability
Hae In Park – English Teaching, 2024
The present study aimed to validate a 70-item Korean bilingual version of the Vocabulary Size Test (VST) using Rasch modeling. The goal was to assess the applicability of this Korean version of the VST for Korean learners of English in an English as a foreign language (EFL) context by examining validity evidence based on Messick's framework.…
Descriptors: Korean, Bilingualism, English (Second Language), Second Language Learning
Sutiarso, Sugeng; Rosidin, Undang; Sulistiawan, Aan – European Journal of Educational Research, 2022
This research is a developmental research aiming at developing a good mathematical test instrument using polytomous responses based on classical and modern theories. This research design uses the Plomp model, which consists of five stages, (1) preliminary investigation, (2) design, (3) realization/construction, (4) revision, and (5) implementation…
Descriptors: Mathematics Instruction, Mathematics Tests, Item Response Theory, Test Items
College Board, 2023
Over the past several years, content experts, psychometricians, and researchers have been hard at work developing, refining, and studying the digital SAT. The work is grounded in foundational best practices and advances in measurement and assessment design, with fairness for students informing all of the work done. This paper shares learnings from…
Descriptors: College Entrance Examinations, Psychometrics, Computer Assisted Testing, Best Practices
Lehane, Paula; Scully, Darina; O'Leary, Michael – Irish Educational Studies, 2022
In line with the widespread proliferation of digital technology in everyday life, many countries are now beginning to use computer-based exams (CBEs) in their post-primary education systems. To ensure that these CBEs are delivered in a manner that preserves their fairness, validity, utility and credibility, several factors pertaining to their…
Descriptors: Computer Assisted Testing, Secondary School Students, Culture Fair Tests, Test Validity
McClellan, Catherine; Snyder, Rebecca; Woods-Murphy, Maryann; Basset, Katherine – National Network of State Teachers of the Year, 2018
Great teachers recognize great assessments. As policy and education leaders work to make sure state tests are measuring the problem-solving, writing, and critical-thinking skills students need for success, they should convene and rely on teachers to review test quality and help answer the question: Do the questions on our state test reflect…
Descriptors: Student Evaluation, Educational Quality, Standardized Tests, Test Items
Carlson, Sarah E.; Seipel, Ben; Biancarosa, Gina; Davison, Mark L.; Clinton, Virginia – Grantee Submission, 2019
This demonstration introduces and presents an innovative online cognitive diagnostic assessment, developed to identify the types of cognitive processes that readers use during comprehension; specifically, processes that distinguish between subtypes of struggling comprehenders. Cognitive diagnostic assessments are designed to provide valuable…
Descriptors: Reading Comprehension, Standardized Tests, Diagnostic Tests, Computer Assisted Testing
Ayar, Zülal – Novitas-ROYAL (Research on Youth and Language), 2021
As the most prestigious and popular standardized achievement test to certify examinees' proficiency of the English language at the national level, Foreign Language Examination (YDS) has been mostly taken by academic staff, undergraduate and graduate students, state employees, and military personnel for years in Turkey. The current study set out to…
Descriptors: Second Language Learning, Second Language Instruction, Language Tests, Language Proficiency
Romine, William L.; Schaffer, Dane L.; Barrow, Lloyd – International Journal of Science Education, 2015
We describe the development and validation of a three-tiered diagnostic test of the water cycle (DTWC) and use it to evaluate the impact of prior learning experiences on undergraduates' misconceptions. While most approaches to instrument validation take a positivist perspective using singular criteria such as reliability and fit with a measurement…
Descriptors: Undergraduate Students, Diagnostic Tests, Water, Item Response Theory
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Sireci, Stephen G. – Educational Researcher, 2007
Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…
Descriptors: Test Content, Test Validity, Guidelines, Test Items
Zimmerman, Irla L.; Woo-Sam, James M. – 1982
Two kinds of WISC-R short forms, item reduction and subtest reduction, are reviewed in terms of their ability to meet these criteria of adequacy: a significant correlation between the full scale IQ and the short form IQ, a non-significant difference between the full and short form mean IQ, a low percentage of IQ classification changes resulting…
Descriptors: Intelligence Tests, Test Interpretation, Test Items, Test Reliability

Ebel, Robert L. – Journal of Educational Measurement, 1982
Reasonable and practical solutions to two major problems confronting the developer of any test of educational achievement (what to measure and how to measure it) are proposed, defended, and defined. (Author/PN)
Descriptors: Measurement Techniques, Objective Tests, Test Construction, Test Items
Ackerman, Terry A. – 1991
Many researchers have suggested that the main cause of item bias is the misspecification of the latent ability space. That is, items that measure multiple abilities are scored as though they are measuring a single ability. If two different groups of examinees have different underlying multidimensional ability distributions and the test items are…
Descriptors: Equations (Mathematics), Item Bias, Item Response Theory, Mathematical Models

Hastings, Jean; Stewart, James – Journal of Research in Science Teaching, 1983
The current status of "homemade" achievement tests reported in "Journal of Research in Science Teaching" and "Science Education" (January 1975 to January 1980) is examined using Anderson's (EJ 062 750) eight categories of information that a high quality research report should include. Findings from 142 references…
Descriptors: Achievement Tests, Literature Reviews, Science Education, Science Tests