Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 16 |
Descriptor
Psychometrics | 60 |
Testing Problems | 60 |
Test Validity | 50 |
Test Construction | 24 |
Educational Assessment | 21 |
Evaluation Methods | 15 |
Measurement Techniques | 15 |
Elementary Secondary Education | 14 |
Educational Testing | 13 |
Test Reliability | 12 |
Test Interpretation | 11 |
More ▼ |
Source
Author
Thurlow, Martha | 4 |
Bielinski, John | 2 |
Dings, Jonathan | 2 |
Hurley, Christine | 2 |
Minnema, Jane | 2 |
Spicuzza, Richard | 2 |
Anderson, Scarvia B. | 1 |
Andrada, Gilbert N. | 1 |
Austin, J. Sue | 1 |
Baird, Jo-Anne | 1 |
Bernal, Ernesto M. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 12 |
Elementary Education | 3 |
Secondary Education | 2 |
Early Childhood Education | 1 |
Preschool Education | 1 |
Audience
Researchers | 5 |
Practitioners | 1 |
Students | 1 |
Location
United States | 4 |
United Kingdom | 3 |
Kentucky | 2 |
United Kingdom (England) | 2 |
Canada | 1 |
Minnesota | 1 |
New Zealand | 1 |
United Kingdom (Wales) | 1 |
Laws, Policies, & Programs
Debra P v Turlington | 1 |
Individuals with Disabilities… | 1 |
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Jiayi Wang; Michael T. Kalkbrenner; Riley Schaner – Psychology in the Schools, 2025
Teaching is a stressful profession with a high turnover rate. Schools and related institutions need to take more action to support teachers and keep teacher stress at a manageable level. The continued research and practical effort require measures to examine teachers' stress in a briefer and accurate manner. The Teacher Stress Scale is a recently…
Descriptors: Elementary School Teachers, Secondary School Teachers, Preschool Teachers, Stress Variables
Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022
In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…
Descriptors: Standardized Tests, Test Items, Test Validity, Scores
Hipkins, Rosemary – set: Research Information for Teachers, 2019
PISA [Programme for International Student Assessment] will be in the news again this year. The 2018 results are due to be released at the end of 2019 and they usually generate media interest. This Rangahau Whakarapopoto is a research brief which outlines things to watch out for as you think about what the results might mean.
Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment
McGill, Ryan J.; Styck, Kara M.; Palomares, Ronald S.; Hass, Michael R. – Learning Disability Quarterly, 2016
As a result of the upcoming Federal reauthorization of the Individuals With Disabilities Education Improvement Act (IDEA), practitioners and researchers have begun vigorously debating what constitutes evidence-based assessment for the identification of specific learning disability (SLD). This debate has resulted in strong support for a method that…
Descriptors: Learning Disabilities, Disability Identification, Disabilities, Federal Legislation
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
Wiliam, Dylan – Review of Research in Education, 2010
The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…
Descriptors: Educational Assessment, Validity, Inferences, Construct Validity
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Madaus, George F. – Educational Measurement: Issues and Practice, 1986
This reply to William A. Mehrens argues that test validity is the central issue in discussing the appropriate role of tests. It states that the procedures used to establish the validity of tests are inadequate because they depend primarily on content validity and not on construct and criterion validity. (JAZ)
Descriptors: Concurrent Validity, Construct Validity, Cutting Scores, Decision Making
Schoenfeld, Alan H. – Measurement: Interdisciplinary Research and Perspectives, 2007
The authors of this volume's stimulus papers have taken on the challenge of developing measures of teachers' mathematical knowledge for teaching (MKT). This task involves multiple decisions and considerations, including: (1) How does one specify the body of knowledge being assessed? What warrants are offered for those choices?; (2) How does one…
Descriptors: Test Validity, Psychometrics, Test Construction, Evaluation Research
Gearhart, Maryl – Measurement: Interdisciplinary Research and Perspectives, 2007
Teacher knowledge has been of theoretical and empirical interest for over two decades, and development of measures is overdue. The researchers represented in this volume have been breaking new ground by developing a measure of mathematical knowledge for teaching (MKT) without guiding precedents, and in the face of differing perspectives on teacher…
Descriptors: Learning Theories, Elementary School Mathematics, Teaching Methods, Construct Validity
Kulikowich, Jonna M. – Measurement: Interdisciplinary Research and Perspectives, 2007
Operating from multiple literature bases in cognitive psychology, mathematics education, and theoretical and applied psychometrics, Schilling, Hill and their colleagues provide a systemic approach to studying the validity of scores of mathematical knowledge for teaching. This system encompasses an array of task formats and methodologies. The…
Descriptors: Multiple Choice Tests, Learning Theories, Teaching Methods, Construct Validity
Hill, Heather C. – Measurement: Interdisciplinary Research and Perspectives, 2007
The author offers some thoughts on commentator's reactions to the substance of the measures, particularly those about measuring teacher learning and change, based on the major uses of the measures, and because this is a significant challenge facing test development as an enterprise. If teacher learning results in more integrated knowledge or…
Descriptors: Educational Testing, Tests, Measurement, Faculty Development
Schilling, Stephen – Measurement: Interdisciplinary Research and Perspectives, 2007
In this article, the author echoes his co-author's and colleague's pleasure (Hill, this issue) at the thoughtfulness and far-ranging nature of the comments to their initial attempts at test validation for the mathematical knowledge for teaching (MKT) measures using the validity argument approach. Because of the large number of commentaries they…
Descriptors: Generalizability Theory, Persuasive Discourse, Educational Testing, Measurement

Parmar, Rene S.; And Others – Learning Disability Quarterly, 1996
This study used the Assessment Standards of the National Council of Teachers of Mathematics to evaluate the appropriateness and adequacy of selected standardized tests of mathematics achievement as they pertain to students with disabilities. Problems with content validity included inadequate representation of content domains, inappropriate…
Descriptors: Academic Standards, Content Validity, Elementary Secondary Education, Mathematics Achievement