NotesFAQContact Us
Collection
Advanced
Search Tips
Source
Educational Measurement:…28
Audience
What Works Clearinghouse Rating
Showing 1 to 15 of 28 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Russell, Michael – Educational Measurement: Issues and Practice, 2022
Despite agreement about the central importance of validity for educational and psychological testing, consensus regarding the definition of validity remains elusive. Differences in the definition of validity are examined and reveals that a potential cause of disagreement stems from differences in word use and meanings given to key terms commonly…
Descriptors: Test Validity, Psychological Testing, Educational Testing, Vocabulary
Peer reviewed Peer reviewed
Direct linkDirect link
Coggeshall, Whitney Smiley – Educational Measurement: Issues and Practice, 2021
The continuous testing framework, where both successful and unsuccessful examinees have to demonstrate continued proficiency at frequent prespecified intervals, is a framework that is used in noncognitive assessment and is gaining in popularity in cognitive assessment. Despite the rigorous advantages of this framework, this paper demonstrates that…
Descriptors: Classification, Accuracy, Testing, Failure
Peer reviewed Peer reviewed
Direct linkDirect link
Newton, Paul E. – Educational Measurement: Issues and Practice, 2020
Educational assessment involves eliciting, transmitting, and receiving information concerning the level of proficiency of a learner in a specified domain. With that in mind, it is perhaps surprising that the literature seems to make very little use of the signal processing metaphor. The present article begins by making a general case for greater…
Descriptors: Educational Assessment, Student Evaluation, Evaluative Thinking, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
O'Leary, Timothy M.; Hattie, John A. C.; Griffin, Patrick – Educational Measurement: Issues and Practice, 2017
Validity is the most fundamental consideration in test development. Understandably, much time, effort, and money is spent in its pursuit. Central to the modern conception of validity are the interpretations made, and uses planned, on the basis of test scores. There is, unfortunately, however, evidence that test users have difficulty understanding…
Descriptors: Test Interpretation, Scores, Test Validity, Evidence
Peer reviewed Peer reviewed
Direct linkDirect link
Wise, Steven L. – Educational Measurement: Issues and Practice, 2017
The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…
Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time
Peer reviewed Peer reviewed
Direct linkDirect link
Johnstone, Christopher J.; Thompson, Sandra J.; Bottsford-Miller, Nicole A.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2008
Test items undergo multiple iterations of review before states and vendors deem them acceptable to be placed in a live statewide assessment. This article reviews three approaches that can add validity evidence to states' item review processes. The first process is a structured sensitivity review process that focuses on universal design…
Descriptors: Test Items, Disabilities, Test Construction, Testing Programs
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip; Haberman, Shelby; Puhan, Gautam – Educational Measurement: Issues and Practice, 2007
There is an increasing interest in reporting subscores, both at examinee level and at aggregate levels. However, it is important to ensure reasonable subscore performance in terms of high reliability and validity to minimize incorrect instructional and remediation decisions. This article employs a statistical measure based on classical test theory…
Descriptors: Test Reliability, Test Theory, Test Validity, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Gorin, Joanna S. – Educational Measurement: Issues and Practice, 2006
One of the primary themes of the National Research Council's 2001 book "Knowing What Students Know" was the importance of cognition as a component of assessment design and measurement theory (NRC, 2001). One reaction to the book has been an increased use of sophisticated statistical methods to model cognitive information available in test data.…
Descriptors: Test Construction, Student Evaluation, Academic Ability, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Lane, Suzanne – Educational Measurement: Issues and Practice, 2004
The validity of high-stakes assessments and accountability systems is discussed in relation to the requirements of No Child Left Behind (NCLB). The extent to which content standards and assessments are cognitively rich, the challenges in setting performance standards, and the impact of high-stakes assessments on instruction and student learning…
Descriptors: Federal Legislation, High Stakes Tests, Critical Thinking, Accountability
Peer reviewed Peer reviewed
Direct linkDirect link
Sireci, Stephen G.; Parker, Polly – Educational Measurement: Issues and Practice, 2006
The psychometric literature is replete with comprehensive discussions of test validity, test validation, and the characteristics of quality assessment programs. The most authoritative source for guidance regarding sound test development and evaluation practices is the Standards for Educational and Psychological Testing. However, the Standards are…
Descriptors: Psychometrics, Test Validity, Educational Testing, Psychological Testing
Peer reviewed Peer reviewed
Frisbie, David A.; Friedman, Stephen J. – Educational Measurement: Issues and Practice, 1987
This paper demonstrates how an analysis of the "Standards for Educational and Psychological Testing" (1985) can define the body of knowledge needed by teachers for the effective use of tests in classroom instruction. Procedures are described for identifying standards relevant to teachers' roles and their behavior. (SLD)
Descriptors: Measurement Techniques, Methods Courses, Preservice Teacher Education, Standards
Peer reviewed Peer reviewed
Nitko, Anthony J. – Educational Measurement: Issues and Practice, 1995
If curriculum is to be the basis for assessment reform, assessment specialists must model the process for producing valid assessment products. Validity criteria should guide any model for the assessment development process. However, curriculum-based assessment systems should not be confused with standards-driven assessment systems. (SLD)
Descriptors: Criteria, Curriculum Based Assessment, Educational Change, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Elliott, Stephen N.; Compton, Elizabeth; Roach, Andrew T. – Educational Measurement: Issues and Practice, 2007
The relationships between ratings on the Idaho Alternate Assessment (IAA) for 116 students with significant disabilities and corresponding ratings for the same students on two norm-referenced teacher rating scales were examined to gain evidence about the validity of resulting IAA scores. To contextualize these findings, another group of 54…
Descriptors: Inferences, Disabilities, Rating Scales, Eligibility
Peer reviewed Peer reviewed
Haney, Walter M. – Educational Measurement: Issues and Practice, 1982
The findings in Volume I of the Committee on Ability Testing's report (see ED 213 770 and ED 213 771) are shown to be ambiguous regarding the meaning of ability, vague regarding test validity, and ingenuous regarding test uses and misuse. The neglect of the basic question of what tests measure is noted. (CM)
Descriptors: Ability, Advisory Committees, Educational Testing, Elementary Secondary Education
Peer reviewed Peer reviewed
Rudner, Lawrence M. – Educational Measurement: Issues and Practice, 1990
Three major pragmatic issues in computerized testing are addressed: (1) encouraging teacher use; (2) reporting of information; and (3) test construction. Reference is made to four related articles. Additional areas for research include reporting of test information; item bank standards; validity; and rules for stopping in computerized testing.…
Descriptors: Computer Assisted Testing, Evaluation Utilization, Item Banks, Research Needs
Previous Page | Next Page ยป
Pages: 1  |  2