Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 11 |
Descriptor
Source
Educational Measurement:… | 11 |
Author
Bostic, Jonathan D. | 1 |
Camara, Wayne J. | 1 |
Carney, Michele B. | 1 |
Chapelle, Carol A. | 1 |
Enright, Mary K. | 1 |
Haberman, Shelby J. | 1 |
Harris, William G. | 1 |
Jamieson, Joan | 1 |
Jimmy de la Torre | 1 |
Jinran Wu | 1 |
Jonson, Jessica L. | 1 |
More ▼ |
Publication Type
Journal Articles | 11 |
Reports - Descriptive | 4 |
Reports - Evaluative | 4 |
Information Analyses | 2 |
Opinion Papers | 1 |
Reports - Research | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Russell, Michael – Educational Measurement: Issues and Practice, 2022
Despite agreement about the central importance of validity for educational and psychological testing, consensus regarding the definition of validity remains elusive. Differences in the definition of validity are examined and reveals that a potential cause of disagreement stems from differences in word use and meanings given to key terms commonly…
Descriptors: Test Validity, Psychological Testing, Educational Testing, Vocabulary
Xuelan Qiu; Jimmy de la Torre; You-Gan Wang; Jinran Wu – Educational Measurement: Issues and Practice, 2024
Multidimensional forced-choice (MFC) items have been found to be useful to reduce response biases in personality assessments. However, conventional scoring methods for the MFC items result in ipsative data, hindering the wider applications of the MFC format. In the last decade, a number of item response theory (IRT) models have been developed,…
Descriptors: Item Response Theory, Personality Traits, Personality Measures, Personality Assessment
Lavery, Matthew Ryan; Bostic, Jonathan D.; Kruse, Lance; Krupa, Erin E.; Carney, Michele B. – Educational Measurement: Issues and Practice, 2020
Since it was formalized by Kane, the argument-based approach to validation has been promoted as the preferred method for validating interpretations and uses of test scores. Because validation is discussed in terms of arguments, and arguments are both interactive and social, the present review systematically examines the scholarly arguments which…
Descriptors: Persuasive Discourse, Validity, Research Methodology, Peer Evaluation
Jonson, Jessica L.; Trantham, Pamela; Usher-Tate, Betty Jean – Educational Measurement: Issues and Practice, 2019
One of the substantive changes in the 2014 Standards for Educational and Psychological Testing was the elevation of fairness in testing as a foundational element of practice in addition to validity and reliability. Previous research indicates that testing practices often do not align with professional standards and guidelines. Therefore, to raise…
Descriptors: Culture Fair Tests, Test Validity, Test Reliability, Intelligence Tests
Plake, Barbara S.; Wise, Lauress L. – Educational Measurement: Issues and Practice, 2014
With the 2014 publication of the 5th revision of the "Standards for Educational and Psychological Testing," the cochairs of the Joint Committee for the revision process were asked to consider the role and importance of the "Standards" for the educational testing community, and in particular for members of the National Council…
Descriptors: Standards, Educational Testing, Psychological Testing, Role
Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Educational Measurement: Issues and Practice, 2011
The purpose of this ITEMS module is to provide an introduction to subscores. First, examples of subscores from an operational test are provided. Then, a review of methods that can be used to examine if subscores have adequate psychometric quality is provided. It is demonstrated, using results from operational and simulated data, that subscores…
Descriptors: Scores, Psychometrics, Tests, Data
Chapelle, Carol A.; Enright, Mary K.; Jamieson, Joan – Educational Measurement: Issues and Practice, 2010
Drawing on experience between 2000 and 2007 in developing a validity argument for the high-stakes Test of English as a "Foreign Language[TM]" (TOEFL[R]), this paper evaluates the differences between the argument-based approach to validity as presented by "Kane (2006)" and that described in the 1999 "AERA/APA/NCME Standards for Educational and…
Descriptors: Psychological Testing, Validity, High Stakes Tests, English (Second Language)
Harris, William G. – Educational Measurement: Issues and Practice, 2006
Some of the challenges that test publishers face in constructing educational assessments that meet high technical quality as prescribed in the "Standards for Educational and Psychological Testing" (AERA, APA, NCME, 1999) are examined. Federal educational initiatives are used to illustrate demands on technical quality that challenge the efforts of…
Descriptors: Standards, Educational Testing, Psychological Testing, Educational Assessment
Camara, Wayne J.; Lane, Suzanne – Educational Measurement: Issues and Practice, 2006
The "Standards for Educational and Psychological Testing" have evolved in the breadth and depth of coverage of issues in educational testing and measurement since their first publication in 1954. There were a number of substantive changes in the 1999 revision that addressed validity, fairness, accommodations, and compliance with the…
Descriptors: Educational Assessment, Revision (Written Composition), Standards, Psychological Testing
Sireci, Stephen G.; Parker, Polly – Educational Measurement: Issues and Practice, 2006
The psychometric literature is replete with comprehensive discussions of test validity, test validation, and the characteristics of quality assessment programs. The most authoritative source for guidance regarding sound test development and evaluation practices is the Standards for Educational and Psychological Testing. However, the Standards are…
Descriptors: Psychometrics, Test Validity, Educational Testing, Psychological Testing
Koretz, Daniel – Educational Measurement: Issues and Practice, 2006
The goal of the Standards for Educational and Psychological Testing is to improve testing practices, but their impact on practice appears spotty. Self-regulation clearly fails in some instances. The establishment of an external agency to oversee testing practices and adherence to the Standards would face substantial hurdles, and the ambiguity of…
Descriptors: Program Implementation, Educational Testing, Psychological Testing, Standard Setting