Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 29 |
Descriptor
Educational Testing | 46 |
Psychometrics | 46 |
Test Construction | 46 |
Educational Assessment | 19 |
Student Evaluation | 15 |
Test Items | 14 |
Evaluation Methods | 13 |
Measurement Techniques | 12 |
Test Validity | 12 |
Item Response Theory | 11 |
Measurement | 11 |
More ▼ |
Source
Author
Wright, Benjamin D. | 2 |
Alonzo, Julie | 1 |
Bailey, Alison L. | 1 |
Bertling, Jonas P. | 1 |
Butler, Frances A. | 1 |
Carmona, Guadalupe | 1 |
Chen, Tzu-An | 1 |
Cui, Ying | 1 |
Donovan, Jenny | 1 |
Dorans, Neil J. | 1 |
Egan, Teresa | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 19 |
Higher Education | 8 |
Postsecondary Education | 5 |
Secondary Education | 5 |
Elementary Education | 4 |
High Schools | 4 |
Grade 3 | 2 |
Grade 4 | 2 |
Grade 6 | 2 |
Early Childhood Education | 1 |
Grade 1 | 1 |
More ▼ |
Location
United States | 3 |
New Zealand | 2 |
United Kingdom | 2 |
Australia | 1 |
Germany | 1 |
United Kingdom (England) | 1 |
United Kingdom (Wales) | 1 |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
National Defense Education Act | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Advanced Placement… | 3 |
SAT (College Admission Test) | 3 |
Continuous Performance Test | 1 |
Early Childhood Longitudinal… | 1 |
National Teacher Examinations | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Timothy Donald Folger – ProQuest LLC, 2024
This dissertation aims to bridge the gap between validity theory and the practice of validation. The dissertation employs a three-article approach. Following the introduction in Chapter I, three independent manuscripts representing three empirical studies are presented (i.e., Chapters II - IV). Each chapter is a stand-alone publishable manuscript,…
Descriptors: Educational Testing, Psychological Testing, Test Validity, Delphi Technique
Russell, Mike; Ludlow, Larry; O'Dwyer, Laura – Educational Measurement: Issues and Practice, 2019
The field of educational measurement has evolved considerably since the first doctoral programs were established. In response, programs have typically tacked on courses that address newly developed theories, methods, tools, and techniques. As our review of current programs evidences, this approach produces artificial distinctions among topics and…
Descriptors: Educational Testing, Specialists, Doctoral Programs, Program Evaluation
Tian, Feng – ProQuest LLC, 2011
There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…
Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis
Chen, Tzu-An – ProQuest LLC, 2010
This simulation study compared the performance of two multilevel measurement testlet (MMMT) models: Beretvas and Walker's (2008) two-level MMMT model and Jiao, Wang, and Kamata's (2005) three-level model. Several conditions were manipulated (including testlet length, sample size, and the pattern of the testlet effects) to assess the impact on the…
Descriptors: Simulation, Item Response Theory, Comparative Analysis, Models
Stroup, Walter M.; Hills, Thomas; Carmona, Guadalupe – Technology, Knowledge and Learning, 2011
This paper summarizes an approach to helping future educators to engage with key issues related to the application of measurement-related statistics to learning and teaching, especially in the contexts of science, mathematics, technology and engineering (STEM) education. The approach we outline has two major elements. First, students are asked to…
Descriptors: Core Curriculum, High Stakes Tests, Psychometrics, Educational Testing
Liu, Xiufeng – IAP - Information Age Publishing, Inc., 2010
This book meets a demand in the science education community for a comprehensive and introductory measurement book in science education. It describes measurement instruments reported in refereed science education research journals, and introduces the Rasch modeling approach to developing measurement instruments in common science assessment domains,…
Descriptors: Graduate Students, Textbooks, Research Methodology, Science Tests
Rock, Donald A. – ETS Research Report Series, 2012
This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…
Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development
Hagge, Sarah Lynn – ProQuest LLC, 2010
Mixed-format tests containing both multiple-choice and constructed-response items are widely used on educational tests. Such tests combine the broad content coverage and efficient scoring of multiple-choice items with the assessment of higher-order thinking skills thought to be provided by constructed-response items. However, the combination of…
Descriptors: Test Format, True Scores, Equated Scores, Psychometrics
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Dorans, Neil J.; Liu, Jinghua – Educational Testing Service, 2009
The equating process links scores from different editions of the same test. For testing programs that build nearly parallel forms to the same explicit content and statistical specifications and administer forms under the same conditions, the linkings between the forms are expected to be equatings. Score equity assessment (SEA) provides a useful…
Descriptors: Testing Programs, Mathematics Tests, Quality Control, Psychometrics
Holling, Heinz; Bertling, Jonas P.; Zeuch, Nina – Studies in Educational Evaluation, 2009
Mathematical word problems represent a common item format for assessing student competencies. Automatic item generation (AIG) is an effective way of constructing many items with predictable difficulties, based on a set of predefined task parameters. The current study presents a framework for the automatic generation of probability word problems…
Descriptors: Word Problems (Mathematics), Probability, Automation, College Students
Glas, Cees A. W.; Geerlings, Hanneke – Studies in Educational Evaluation, 2009
Pupil monitoring systems support the teacher in tailoring teaching to the individual level of a student and in comparing the progress and results of teaching with national standards. The systems are based on the availability of an item bank calibrated using item response theory. The assessment of the students' progress and results can be further…
Descriptors: Item Banks, Adaptive Testing, National Standards, Psychometrics
Gierl, Mark J.; Cui, Ying – Measurement: Interdisciplinary Research and Perspectives, 2008
One promising application of diagnostic classification models (DCM) is in the area of cognitive diagnostic assessment in education. However, the successful application of DCM in educational testing will likely come with a price--and this price may be in the form of new test development procedures and practices required to yield data that satisfy…
Descriptors: Educational Testing, Classification, Psychometrics, Test Construction
Frey, Andreas; Seitz, Nicki-Nils – Studies in Educational Evaluation, 2009
The paper gives an overview of multidimensional adaptive testing (MAT) and evaluates its applicability in educational and psychological testing. The approach of Segall (1996) is described as a general framework for MAT. The main advantage of MAT is its capability to increase measurement efficiency. In simulation studies conceptualizing situations…
Descriptors: Psychological Testing, Adaptive Testing, Simulation, Evaluation Methods
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria