ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Statistical Analysis	8
Test Items	8
Diagnostic Tests	2
Educational Assessment	2
Foreign Countries	2
Measurement Techniques	2
Models	2
Reliability	2
Scores	2
Test Bias	2
Test Construction	2
Validity	2
Case Studies	1
Cognitive Tests	1
Data Analysis	1
Disabilities	1
Equations (Mathematics)	1
Evaluation Methods	1
Factor Analysis	1
Glossaries	1
Goodness of Fit	1
Graduate Students	1
Groups	1
Hypothesis Testing	1
Interviews	1
More ▼

Source

Educational Measurement:…

Author

Almehrizi, Rashid S.	1
Banks, Kathleen	1
Bottsford-Miller, Nicole A.	1
Clauser, Brian E.	1
Cui, Ying	1
Davenport, Ernest C.	1
Davison, Mark L.	1
Gierl, Mark J.	1
Harring, Jeffrey R.	1
Johnson, Tessa L.	1
Johnstone, Christopher J.	1
Liou, Pey-Yan	1
Love, Quintin U.	1
Mazor, Kathleen M.	1
Roberts, Mary Roduta	1
Thompson, Sandra J.	1
Thurlow, Martha L.	1
More ▼

Publication Type

Journal Articles	8
Reports - Evaluative	4
Reports - Descriptive	2
Reports - Research	2
Information Analyses	1
Opinion Papers	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Secondary Education	1

Audience

Location

Canada	2
United States	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

Digital Module 16: Longitudinal Data Analysis

Peer reviewed

Direct link

Harring, Jeffrey R.; Johnson, Tessa L. – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Jeffrey Harring and Ms. Tessa Johnson introduce the linear mixed effects (LME) model as a flexible general framework for simultaneously modeling continuous repeated measures data with a scientifically defensible function that adequately summarizes both individual change as well as the average response. The module…

Descriptors: Educational Assessment, Data Analysis, Longitudinal Studies, Case Studies

Easier Said than Done: Rejoinder on Sijtsma and on Green and Yang

Peer reviewed

Direct link

Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U. – Educational Measurement: Issues and Practice, 2016

The main points of Sijtsma and Green and Yang in Educational Measurement: Issues and Practice (34, 4) are that reliability, internal consistency, and unidimensionality are distinct and that Cronbach's alpha may be problematic. Neither of these assertions are at odds with Davenport, Davison, Liou, and Love in the same issue. However, many authors…

Descriptors: Educational Assessment, Reliability, Validity, Test Construction

A Synthesis of the Peer-Reviewed Differential Bundle Functioning Research

Peer reviewed

Direct link

Banks, Kathleen – Educational Measurement: Issues and Practice, 2013

The purpose of this article was to present a synthesis of the peer-reviewed differential bundle functioning (DBF) research that has been conducted to date. A total of 16 studies were synthesized according to the following characteristics: tests used and learner groups, organizing principles used for developing bundles, DBF detection methods used,…

Descriptors: Test Bias, Research, Tests, Student Characteristics

Validating Student Score Inferences with Person-Fit Statistic and Verbal Reports: A Person-Fit Study for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Roberts, Mary Roduta – Educational Measurement: Issues and Practice, 2013

The goal of this study was to investigate the usefulness of person-fit analysis in validating student score inferences in a cognitive diagnostic assessment. In this study, a two-stage procedure was used to evaluate person fit for a diagnostic test in the domain of statistical hypothesis testing. In the first stage, the person-fit statistic, the…

Descriptors: Scores, Validity, Cognitive Tests, Diagnostic Tests

Universal Design and Multimethod Approaches to Item Review

Peer reviewed

Direct link

Johnstone, Christopher J.; Thompson, Sandra J.; Bottsford-Miller, Nicole A.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2008

Test items undergo multiple iterations of review before states and vendors deem them acceptable to be placed in a live statewide assessment. This article reviews three approaches that can add validity evidence to states' item review processes. The first process is a structured sensitivity review process that focuses on universal design…

Descriptors: Test Items, Disabilities, Test Construction, Testing Programs

Using Statistical Procedures To Identify Differentially Functioning Test Items. An NCME Instructional Module.

Peer reviewed

Clauser, Brian E.; Mazor, Kathleen M. – Educational Measurement: Issues and Practice, 1998

This module prepares the reader to use statistical procedures to detect differentially functioning test items. The Mantel-Haenszel statistic, logistic regression, the SIBTEST procedure, the Standardization procedure, and various item response theory-based procedures are presented. Theoretical frameworks, strengths and weaknesses, and…

Descriptors: Item Bias, Item Response Theory, Statistical Analysis, Teaching Methods

Using Dimensionality-Based DIF Analyses to Identify and Interpret Constructs That Elicit Group Differences

Peer reviewed

Direct link

Gierl, Mark J. – Educational Measurement: Issues and Practice, 2005

In this paper I describe and illustrate the Roussos-Stout (1996) multidimensionality-based DIF analysis paradigm, with emphasis on its implication for the selection of a matching and studied subtest for DIF analyses. Standard DIF practice encourages an exploratory search for matching subtest items based on purely statistical criteria, such as a…

Descriptors: Models, Test Items, Test Bias, Statistical Analysis