ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	9

Descriptor

Statistical Analysis	12
Test Theory	12
Test Reliability	5
Computation	3
Evaluation Methods	3
Multiple Choice Tests	3
Test Items	3
Test Validity	3
Correlation	2
Educational Quality	2
Item Analysis	2
Mathematical Models	2
Measurement	2
Measurement Techniques	2
Psychological Testing	2
Scores	2
Student Evaluation	2
Academic Standards	1
Achievement Rating	1
Adolescents	1
Business Administration…	1
Cheating	1
Classification	1
Cognitive Measurement	1
College Instruction	1
More ▼

Source

Advances in Physiology…	1
Assessment in Education:…	1
Behavioral Research and…	1
Educational Measurement:…	1
Educational and Psychological…	1
International Journal of…	1
Journal of Early Adolescence	1
Journal of Educational and…	1
Marketing Education Review	1
Psychometrika	1
Quality Assurance in…	1
More ▼

Publication Type

Reports - Descriptive	12
Journal Articles	10
Information Analyses	1
Numerical/Quantitative Data	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1
More ▼

Audience

Practitioners

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

A Measurement Is a Choice and Stevens' Scales of Measurement Do Not Help Make It: A Response to Chalmers

Peer reviewed

Direct link

Zumbo, Bruno D.; Kroc, Edward – Educational and Psychological Measurement, 2019

Chalmers recently published a critique of the use of ordinal a[alpha] proposed in Zumbo et al. as a measure of test reliability in certain research settings. In this response, we take up the task of refuting Chalmers' critique. We identify three broad misconceptions that characterize Chalmers' criticisms: (1) confusing assumptions with…

Descriptors: Test Reliability, Statistical Analysis, Misconceptions, Mathematical Models

Components of Variance of Scales with a Bifactor Subscale Structure from Two Calculations of Alpha

Peer reviewed

Direct link

Andrich, David – Educational Measurement: Issues and Practice, 2016

Since Cronbach's (1951) elaboration of a from its introduction by Guttman (1945), this coefficient has become ubiquitous in characterizing assessment instruments in education, psychology, and other social sciences. Also ubiquitous are caveats on the calculation and interpretation of this coefficient. This article summarizes a recent contribution…

Descriptors: Computation, Correlation, Test Theory, Measures (Individuals)

Generalizability Theory as a Unifying Framework of Measurement Reliability in Adolescent Research

Peer reviewed

Direct link

Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014

In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…

Descriptors: Generalizability Theory, Measurement, Reliability, Correlation

Making Do with What We Have: Use Your Bootstraps

Peer reviewed

Direct link

Calmettes, Guillaume; Drummond, Gordon B.; Vowler, Sarah L. – Advances in Physiology Education, 2012

A jack knife is a pocket knife that is put to many tasks, because it's ready to hand. Often there could be a better tool for the job, such as a screwdriver, a scraper, or a can-opener, but these are not usually pocket items. In statistical terms, the expression implies making do with what's available. Another simile, of an extreme situation, is…

Descriptors: Statistical Analysis, Computation, Population Distribution, Evaluation Methods

An Innovative Excel Application to Improve Exam Reliability in Marketing Courses

Peer reviewed

Direct link

Keller, Christopher M.; Kros, John F. – Marketing Education Review, 2011

Measures of survey reliability are commonly addressed in marketing courses. One statistic of reliability is "Cronbach's alpha." This paper presents an application of survey reliability as a reflexive application of multiple-choice exam validation. The application provides an interactive decision support system that incorporates survey item…

Descriptors: Test Validity, Marketing, Test Reliability, Multiple Choice Tests

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

Instrument Development Procedures for Mathematics Measures. Technical Report Number 08-02

Download full text

Jung, Eunju; Liu, Kimy; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008

The purpose of this study was to develop general outcome measures (GOM) in mathematics so that teachers could focus their instruction on needed prerequisite skills. We describe in detail, the manner in which content-related evidence was established and then present a number of statistical analyses conducted to evaluate the technical adequacy of…

Descriptors: Item Analysis, Test Construction, Test Theory, Mathematics Tests

Detecting Answer Copying when the Regular Response Process Follows a Known Response Model

Peer reviewed

Direct link

van der Linden, Wim J.; Sotaridona, Leonardo – Journal of Educational and Behavioral Statistics, 2006

A statistical test for detecting answer copying on multiple-choice items is presented. The test is based on the exact null distribution of the number of random matches between two test takers under the assumption that the response process follows a known response model. The null distribution can easily be generalized to the family of distributions…

Descriptors: Test Items, Multiple Choice Tests, Cheating, Responses

Quality Assurance of Multiple-Choice Tests

Peer reviewed

Direct link

Bush, Martin E. – Quality Assurance in Education: An International Perspective, 2006

Purpose: To provide educationalists with an understanding of the key quality issues relating to multiple-choice tests, and a set of guidelines for the quality assurance of such tests. Design/methodology/approach: The discussion of quality issues is structured to reflect the order in which those issues naturally arise. It covers the design of…

Descriptors: Multiple Choice Tests, Test Reliability, Educational Quality, Quality Control

Basic Concepts in Classical Test Theory: Relating Variance Partitioning in Substantive Analyses to the Same Process in Measurement Analyses.

Download full text

Dawson, Thomas E. – 1997

The basic processes in univariate statistics involve partitioning the sum of squares into two components: explained and within. This paper explains that the same partitioning occurs in measurement analyses, i.e., splitting the sum of squares into reliable and unreliable components. In addition, it is shown how the three types of error inherent in…

Descriptors: Estimation (Mathematics), Measurement Techniques, Scores, Statistical Analysis

Test Theory and Psychometrika: The Past Twenty-Five Years.

Peer reviewed

Lewis, Charles – Psychometrika, 1986

On the occasion of Psychometrika's fiftieth anniversary, the past twenty-five years' developments in mental test theory are reviewed. Psychometrika articles treating topics in test theory are listed in a bibliography. (Author/LMO)

Descriptors: Cognitive Measurement, Mathematical Models, Psychological Testing, Psychometrics

Louis Guttman's Contributions to Classical Test Theory

Peer reviewed

Direct link

Zimmerman, Donald W.; Williams, Richard H.; Zumbo, Bruno D.; Ross, Donald – International Journal of Testing, 2005

This article focuses on Louis Guttman's contributions to the classical theory of educational and psychological tests, one of the lesser known of his many contributions to quantitative methods in the social sciences. Guttman's work in this field provided a rigorous mathematical basis for ideas that, for many decades after Spearman's initial work,…

Descriptors: Evaluation Methods, Test Theory, Social Sciences, Psychological Testing

Zumbo, Bruno D.	2
Andrich, David	1
Beguin, A. A.	1
Bush, Martin E.	1
Calmettes, Guillaume	1
Dawson, Thomas E.	1
Drummond, Gordon B.	1
Fan, Xitao	1
Jung, Eunju	1
Keller, Christopher M.	1
Ketterlin-Geller, Leanne R.	1
Kroc, Edward	1
Kros, John F.	1
Lewis, Charles	1
Liu, Kimy	1
Ross, Donald	1
Sotaridona, Leonardo	1
Sun, Shaojing	1
Tindal, Gerald	1
Verstralen, H. H. F. M.	1
Vowler, Sarah L.	1
Williams, Richard H.	1
Zimmerman, Donald W.	1
van Rijn, P. W.	1
van der Linden, Wim J.	1
More ▼