Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 29 |
Since 2006 (last 20 years) | 111 |
Descriptor
Test Items | 205 |
Test Validity | 205 |
Test Construction | 93 |
Test Reliability | 84 |
Psychometrics | 42 |
Foreign Countries | 37 |
Scoring | 35 |
Item Analysis | 32 |
Item Response Theory | 30 |
Evaluation Methods | 24 |
Test Format | 24 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 6 |
Teachers | 5 |
Researchers | 4 |
Administrators | 1 |
Location
Canada | 5 |
Australia | 4 |
Nebraska | 4 |
New York | 4 |
Alabama | 3 |
Florida | 3 |
Netherlands | 3 |
Texas | 3 |
California | 2 |
China | 2 |
Turkey | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024
Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…
Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction
Bruno D. Zumbo – International Journal of Assessment Tools in Education, 2023
In line with the journal volume's theme, this essay considers lessons from the past and visions for the future of test validity. In the first part of the essay, a description of historical trends in test validity since the early 1900s leads to the natural question of whether the discipline has progressed in its definition and description of test…
Descriptors: Test Theory, Test Validity, True Scores, Definitions
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2019
This note discusses the merits of coefficient alpha and their conditions in light of recent critical publications that miss out on significant research findings over the past several decades. That earlier research has demonstrated the empirical relevance and utility of coefficient alpha under certain empirical circumstances. The article highlights…
Descriptors: Test Validity, Test Reliability, Test Items, Correlation
Seah, Rebecca; Horne, Marj – Mathematics Education Research Journal, 2020
This article presents preliminary analysis of a test item in a large-scale study design to promote the development of geometric reasoning progression. Two sets of data were analysed to validate the item designed to assess secondary school students' knowledge of a rectangle. The first data set involved 155 Year 4-10 students from seven trial…
Descriptors: Test Construction, Test Validity, Test Items, Geometric Concepts
Rigney, Alexander M. – Journal of Psychoeducational Assessment, 2019
The "Detroit Tests of Learning Aptitude" has been in use for more than three quarters of a century (Baker & Leland, 1935). Its longevity in the field speaks to its popularity as a broad measure of cognitive abilities. Its most recent iteration, in the form of the "Detroit Tests of Learning Abilities--Fifth Edition" (DTLA-5;…
Descriptors: Aptitude Tests, Cognitive Ability, Test Construction, Test Items
Jacobson, Erik; Svetina, Dubravka – Applied Measurement in Education, 2019
Contingent argument-based approaches to validity require a unique argument for each use, in contrast to more prescriptive approaches that identify the common kinds of validity evidence researchers should consider for every use. In this article, we evaluate our use of an approach that is both prescriptive "and" argument-based to develop a…
Descriptors: Test Validity, Test Items, Test Construction, Test Interpretation
Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019
This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…
Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring
Rivas, Axel; Scasso, Martín Guillermo – Journal of Education Policy, 2021
Since 2000, the PISA test implemented by OECD has become the prime benchmark for international comparisons in education. The 2015 PISA edition introduced methodological changes that altered the nature of its results. PISA made no longer valid non-reached items of the final part of the test, assuming that those unanswered questions were more a…
Descriptors: Test Validity, Computer Assisted Testing, Foreign Countries, Achievement Tests
Gotch, Chad M.; French, Brian F. – Educational Assessment, 2020
The State of Washington requires school districts to file court petitions on students with excessive unexcused absences. The "Washington Assessment of Risks and Needs of Students" (WARNS), a self-report screening instrument developed for use by high school and juvenile court personnel in such situations, purports to measure six facets of…
Descriptors: Risk Assessment, Needs Assessment, Truancy, Measurement Techniques
Adedokun, Omolola A. – Journal of Extension, 2018
This article provides an illustrative description of the pre-post difference index (PPDI), a simple, nontechnical yet robust tool for examining the instructional sensitivity of assessment items. Extension educators often design pretest-posttest instruments to assess the impact of their curricula on participants' knowledge and understanding of the…
Descriptors: Extension Education, Extension Agents, Pretests Posttests, Curriculum Evaluation
Zeidan, Quira; Loertscher, Jennifer; Wolfson, Adele J.; Tansey, John T.; Offerdahl, Erika G.; Kennelly, Peter J.; Dries, Daniel R.; Moore, Victoria Del Gaizo; Dean, Diane M.; Carastro, L. Michael; Villafañe, Sachel M.; Tyler, Ludmila – CBE - Life Sciences Education, 2021
With support from the American Society for Biochemistry and Molecular Biology (ASBMB), a community of biochemistry and molecular biology (BMB) scientist-educators has developed and administered an assessment instrument designed to evaluate student competence across four core concept and skill areas fundamental to BMB. The four areas encompass…
Descriptors: Test Construction, Test Validity, Scoring, Minimum Competency Testing
Raspa, Melissa; Bann, Carla M.; Gwaltney, Angela; Benke, Timothy A.; Fu, Cary; Glaze, Daniel G.; Haas, Richard; Heydemann, Peter; Jones, Mary; Kaufmann, Walter E.; Lieberman, David; Marsh, Eric; Peters, Sarika; Ryther, Robin; Standridge, Shannon; Skinner, Steven A.; Percy, Alan K.; Neul, Jeffrey L. – American Journal on Intellectual and Developmental Disabilities, 2020
Rett syndrome (RTT) is a neurodevelopmental disorder that primarily affects females. Recent work indicates the potential for disease modifying therapies. However, there remains a need to develop outcome measures for use in clinical trials. Using data from a natural history study (n = 1,075), we examined the factor structure, internal consistency,…
Descriptors: Genetic Disorders, Psychometrics, Psychomotor Skills, Physical Disabilities
Nebraska Department of Education, 2021
This technical report documents the processes and procedures implemented to support the Spring 2021 Nebraska Student-Centered Assessment System (NSCAS) Phase I Pilot in English Language Arts (ELA), Mathematics, and Science assessments by NWEA® under the supervision of the Nebraska Department of Education (NDE). The technical report shows how the…
Descriptors: Psychometrics, Standard Setting, English, Language Arts
Wise, Steven L. – Educational Measurement: Issues and Practice, 2017
The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…
Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time
Carney, Michele B.; Cavey, Laurie; Hughes, Gwyneth – Elementary School Journal, 2017
This article illustrates an argument-based approach to presenting validity evidence for assessment items intended to measure a complex construct. Our focus is developing a measure of teachers' ability to analyze and respond to students' mathematical thinking for the purpose of program evaluation. Our validity argument consists of claims addressing…
Descriptors: Mathematics Instruction, Mathematical Logic, Thinking Skills, Evidence