ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	8

Descriptor

Test Validity	24
Standards	16
Test Construction	8
Court Litigation	7
Licensing Examinations…	6
Testing Problems	6
Test Bias	5
Test Items	5
Civil Rights Legislation	4
Educational Testing	4
Legal Problems	4
Occupational Tests	4
Psychological Testing	4
Psychometrics	4
State Standards	4
Test Use	4
Measurement Techniques	3
Models	3
Standard Setting (Scoring)	3
Test Reliability	3
Test Theory	3
Testing Programs	3
Computer Assisted Testing	2
Constitutional Law	2
Content Validity	2
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	24
Reports - Evaluative	10
Opinion Papers	8
Reports - Research	6
Reports - Descriptive	3
Information Analyses	2
Speeches/Meeting Papers	2

Education Level

Grade 9	1
High Schools	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Civil Rights Act 1964 Title…	2
Debra P v Turlington	1
Fourteenth Amendment	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Florida State Student…	1
National Teacher Examinations	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Applying a Mixture Rasch Model-Based Approach to Standard Setting

Peer reviewed

Direct link

Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023

The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…

Descriptors: Item Response Theory, Standard Setting, Testing, Sampling

Supporting the Interpretive Validity of Student-Level Claims in Science Assessment with Tiered Claim Structures

Peer reviewed

Direct link

Student, Sanford R.; Gong, Brian – Educational Measurement: Issues and Practice, 2022

We address two persistent challenges in large-scale assessments of the Next Generation Science Standards: (a) the validity of score interpretations that target the standards broadly and (b) how to structure claims for assessments of this complex domain. The NGSS pose a particular challenge for specifying claims about students that evidence from…

Descriptors: Science Tests, Test Validity, Test Items, Test Construction

Using the "Joint Standards" to Design Postsecondary Assessments with Evidence of Validity and Reliability: An Approach to CAEP Accreditation

Peer reviewed

Direct link

Wilkerson, Judy R. – Educational Measurement: Issues and Practice, 2020

Validity and reliability are a major focus in teacher education accreditation by the Council for Accreditation of Educator Preparation (CAEP). CAEP requires the use of "accepted research standards," but many faculty and administrators are unsure how to meet this requirement. The Standards of Educational and Psychological Testing…

Descriptors: Test Construction, Test Validity, Test Reliability, Teacher Education Programs

An Evaluative Framework for Reviewing Fairness Standards and Practices in Educational Tests

Peer reviewed

Direct link

Jonson, Jessica L.; Trantham, Pamela; Usher-Tate, Betty Jean – Educational Measurement: Issues and Practice, 2019

One of the substantive changes in the 2014 Standards for Educational and Psychological Testing was the elevation of fairness in testing as a foundational element of practice in addition to validity and reliability. Previous research indicates that testing practices often do not align with professional standards and guidelines. Therefore, to raise…

Descriptors: Culture Fair Tests, Test Validity, Test Reliability, Intelligence Tests

Examining Effectiveness and Validity of Accommodations for English Language Learners in Mathematics: An Evidence-Based Computer Accommodation Decision System

Peer reviewed

Direct link

Abedi, Jamal; Zhang, Yu; Rowe, Susan E.; Lee, Hansol – Educational Measurement: Issues and Practice, 2020

Research indicates that the performance-gap between English Language Learners (ELLs) and their non-ELL peers is partly due to ELLs' difficulty in understanding assessment language. Accommodations have been shown to narrow this performance-gap, but many accommodations studies have not used a randomized design and are based on relatively small…

Descriptors: English Language Learners, Achievement Gap, Mathematics Tests, Standards

A Process for Reviewing and Evaluating Generated Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis – Educational Measurement: Issues and Practice, 2016

Testing organization needs large numbers of high-quality items due to the proliferation of alternative test administration methods and modern test designs. But the current demand for items far exceeds the supply. Test items, as they are currently written, evoke a process that is both time-consuming and expensive because each item is written,…

Descriptors: Test Items, Test Construction, Psychometrics, Models

Universal Design and Multimethod Approaches to Item Review

Peer reviewed

Direct link

Johnstone, Christopher J.; Thompson, Sandra J.; Bottsford-Miller, Nicole A.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2008

Test items undergo multiple iterations of review before states and vendors deem them acceptable to be placed in a live statewide assessment. This article reviews three approaches that can add validity evidence to states' item review processes. The first process is a structured sensitivity review process that focuses on universal design…

Descriptors: Test Items, Disabilities, Test Construction, Testing Programs

Two Weak Spots in the Practice of Criterion-referenced Measurement.

Peer reviewed

Linn, Robert L. – Educational Measurement: Issues and Practice, 1982

Confusion in the terminology used in criterion-referenced measurement specifications and development and standard setting and the attendant role of cut-off scores are shown to need practical clarification through psychometric research on test applications and consequences. (CM)

Descriptors: Academic Standards, Criterion Referenced Tests, Cutting Scores, Measurement Objectives

Validation of the NTE Tests for Certification Decisions.

Peer reviewed

Cross, Lawrence H. – Educational Measurement: Issues and Practice, 1985

Before using the National Teacher Examinations (NTE) for teacher certification, states are required to conduct a state-wide validity study. This paper describes approaches used in 35 studies for 18 states to establish (NTE) content validity through curriculum and job relatedness and to establish minimum performance standards. (BS)

Descriptors: Elementary Education, Minimum Competency Testing, Standard Setting (Scoring), Standardized Tests

Validity of High-Stakes Assessment: Are Students Engaged in Complex Thinking?

Peer reviewed

Direct link

Lane, Suzanne – Educational Measurement: Issues and Practice, 2004

The validity of high-stakes assessments and accountability systems is discussed in relation to the requirements of No Child Left Behind (NCLB). The extent to which content standards and assessments are cognitively rich, the challenges in setting performance standards, and the impact of high-stakes assessments on instruction and student learning…

Descriptors: Federal Legislation, High Stakes Tests, Critical Thinking, Accountability

Validity on Trial: Psychometric and Legal Conceptualizations of Validity

Peer reviewed

Direct link

Sireci, Stephen G.; Parker, Polly – Educational Measurement: Issues and Practice, 2006

The psychometric literature is replete with comprehensive discussions of test validity, test validation, and the characteristics of quality assessment programs. The most authoritative source for guidance regarding sound test development and evaluation practices is the Standards for Educational and Psychological Testing. However, the Standards are…

Descriptors: Psychometrics, Test Validity, Educational Testing, Psychological Testing

Test Standards--Some Implications for the Measurement Curriculum.

Peer reviewed

Frisbie, David A.; Friedman, Stephen J. – Educational Measurement: Issues and Practice, 1987

This paper demonstrates how an analysis of the "Standards for Educational and Psychological Testing" (1985) can define the body of knowledge needed by teachers for the effective use of tests in classroom instruction. Procedures are described for identifying standards relevant to teachers' roles and their behavior. (SLD)

Descriptors: Measurement Techniques, Methods Courses, Preservice Teacher Education, Standards

Commentary on Values and Standards in Performance Assessment.

Peer reviewed

Guion, Robert M. – Educational Measurement: Issues and Practice, 1995

This commentary discusses three essential themes in performance assessment and its scoring. First, scores should mean something. Second, performance scores should permit fair and meaningful comparisons. Third, validity-reducing errors should be minimal. Increased attention to performance assessment may overcome these problems. (SLD)

Descriptors: Educational Assessment, Performance Based Assessment, Scores, Scoring

Is the Curriculum a Reasonable Basis for Assessment Reform?

Peer reviewed

Nitko, Anthony J. – Educational Measurement: Issues and Practice, 1995

If curriculum is to be the basis for assessment reform, assessment specialists must model the process for producing valid assessment products. Validity criteria should guide any model for the assessment development process. However, curriculum-based assessment systems should not be confused with standards-driven assessment systems. (SLD)

Descriptors: Criteria, Curriculum Based Assessment, Educational Change, Evaluation Methods

The Golden Rule Settlement: A Minority Perspective.

Peer reviewed

Bond, Lloyd – Educational Measurement: Issues and Practice, 1987

This article suggests that mechanical application of Golden Rule-like procedures is inappropriate. The fundamental idea embodied in them, namely, that of taking issues of equity into account in test construction, may reasonably be done without doing violence to test validity. (JAZ)

Descriptors: Court Litigation, Item Analysis, Minority Groups, Standards

Previous Page | Next Page »

Pages: 1 | 2

Linn, Robert L.	2
Abedi, Jamal	1
Bond, Lloyd	1
Bottsford-Miller, Nicole A.	1
Brennan, Robert L.	1
Cross, Lawrence H.	1
Drasgow, Fritz	1
Faggen, Jane	1
Friedman, Stephen J.	1
Frisbie, David A.	1
Gierl, Mark J.	1
Gong, Brian	1
Guion, Robert M.	1
Hambleton, Ronald K.	1
Johnstone, Christopher J.	1
Jonson, Jessica L.	1
Kuehn, Phyllis A.	1
Lai, Hollis	1
Lane, Suzanne	1
Lee, Hansol	1
Lott, Winsor	1
Meng, Yu	1
Muckle, Timothy J.	1
Nitko, Anthony J.	1
More ▼