Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 83 |
Descriptor
Educational Testing | 147 |
Psychometrics | 147 |
Educational Assessment | 49 |
Test Construction | 46 |
Evaluation Methods | 40 |
Student Evaluation | 35 |
Measurement | 31 |
Measurement Techniques | 30 |
Test Validity | 30 |
Elementary Secondary Education | 25 |
Item Response Theory | 25 |
More ▼ |
Source
Author
Glas, Cees A. W. | 3 |
Bielinski, John | 2 |
Cui, Ying | 2 |
Embretson, Susan E. | 2 |
Frey, Andreas | 2 |
Gierl, Mark J. | 2 |
Haberman, Shelby J. | 2 |
Minnema, Jane | 2 |
Newton, Paul E. | 2 |
Oakland, Thomas | 2 |
Schutz, Richard E. | 2 |
More ▼ |
Publication Type
Education Level
Location
United Kingdom | 6 |
United Kingdom (England) | 4 |
United States | 4 |
Australia | 2 |
New York | 2 |
New Zealand | 2 |
United Kingdom (Wales) | 2 |
Canada | 1 |
Florida | 1 |
Germany | 1 |
Japan | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Individuals with Disabilities… | 1 |
National Defense Education Act | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Timothy Donald Folger – ProQuest LLC, 2024
This dissertation aims to bridge the gap between validity theory and the practice of validation. The dissertation employs a three-article approach. Following the introduction in Chapter I, three independent manuscripts representing three empirical studies are presented (i.e., Chapters II - IV). Each chapter is a stand-alone publishable manuscript,…
Descriptors: Educational Testing, Psychological Testing, Test Validity, Delphi Technique
Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2023
Though a substantial amount of research exists on imputing missing scores in educational assessments, there is little research on cases where responses or scores to an item are missing for all test takers. In this paper, we tackled the problem of imputing missing scores for tests for which the responses to an item are missing for all test takers.…
Descriptors: Scores, Test Items, Accuracy, Psychometrics
Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024
We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…
Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners
Lim Hooi Lian; Wun Thiam Yew – International Journal of Assessment Tools in Education, 2023
The majority of students from elementary to tertiary levels have misunderstandings and challenges acquiring various statistical concepts and skills. However, the existing statistics assessment frameworks challenge practice in a classroom setting. The purpose of this research is to develop and validate a statistical thinking assessment tool…
Descriptors: Psychometrics, Grade 7, Middle School Mathematics, Statistics Education
O'Keeffe, Cormac – E-Learning and Digital Media, 2017
International Large Scale Assessments have been producing data about educational attainment for over 60 years. More recently however, these assessments as tests have become digitally and computationally complex and increasingly rely on the calculative work performed by algorithms. In this article I first consider the coordination of relations…
Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment
Russell, Mike; Ludlow, Larry; O'Dwyer, Laura – Educational Measurement: Issues and Practice, 2019
The field of educational measurement has evolved considerably since the first doctoral programs were established. In response, programs have typically tacked on courses that address newly developed theories, methods, tools, and techniques. As our review of current programs evidences, this approach produces artificial distinctions among topics and…
Descriptors: Educational Testing, Specialists, Doctoral Programs, Program Evaluation
Borsboom, Denny; Wijsen, Lisa D. – Assessment in Education: Principles, Policy & Practice, 2017
The central role of educational testing practices in contemporary societies can hardly be overstated. It is furthermore evident that psychometric models regulate, justify, and legitimize the processes through which educational testing practices are used. In this commentary, the authors offer some observations that may be relevant for the analyses…
Descriptors: Educational Assessment, Learning, Psychometrics, Power Structure
Hathcoat, John D. – Practical Assessment, Research & Evaluation, 2013
The semantics, or meaning, of validity is a fluid concept in educational and psychological testing. Contemporary controversies surrounding this concept appear to stem from the proper location of validity. Under one view, validity is a property of score-based inferences and entailed uses of test scores. This view is challenged by the…
Descriptors: Test Validity, Educational Testing, Psychological Testing, Scores
Berk, Ronald A. – Journal of Faculty Development, 2016
Recently, student outcomes have bubbled to the top of debates about how to evaluate teaching in community and liberal arts colleges, universities, and professional schools, but even more international attention has been riveted on how outcomes are being used to evaluate teachers and administrators K-12 (Harris, 2012; Rowen & Raudenbush, 2016;…
Descriptors: Value Added Models, Academic Achievement, Outcomes of Education, Teacher Evaluation
Ramineni, Chaitanya; Williamson, David M. – Assessing Writing, 2013
In this paper, we provide an overview of psychometric procedures and guidelines Educational Testing Service (ETS) uses to evaluate automated essay scoring for operational use. We briefly describe the e-rater system, the procedures and criteria used to evaluate e-rater, implications for a range of potential uses of e-rater, and directions for…
Descriptors: Educational Testing, Guidelines, Scoring, Psychometrics
Condon, William – Assessing Writing, 2013
Automated Essay Scoring (AES) has garnered a great deal of attention from the rhetoric and composition/writing studies community since the Educational Testing Service began using e-rater[R] and the "Criterion"[R] Online Writing Evaluation Service as products in scoring writing tests, and most of the responses have been negative. While the…
Descriptors: Measurement, Psychometrics, Evaluation Methods, Educational Testing
Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012
Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…
Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems
Rock, Donald A. – ETS Research Report Series, 2012
This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…
Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development
Mayrath, Michael C., Ed.; Clarke-Midura, Jody, Ed.; Robinson, Daniel H., Ed.; Schraw, Gregory, Ed. – IAP - Information Age Publishing, Inc., 2012
Creative problem solving, collaboration, and technology fluency are core skills requisite of any nation's workforce that strives to be competitive in the 21st Century. Teaching these types of skills is an economic imperative, and assessment is a fundamental component of any pedagogical program. Yet, measurement of these skills is complex due to…
Descriptors: Expertise, Evidence, Integrated Curriculum, Educational Psychology
Feuer, Michael J. – Mid-Western Educational Researcher, 2011
In this keynote address, the author shares his reflections on politics, economics, and testing. He focuses on assessment and accountability and begins with some data from large scale written educational testing, "circa 1840". The author argues that people's penchant for accountability and their appetite for standardized testing are, in…
Descriptors: Testing Problems, Educational Testing, Standardized Tests, Risk