ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	2

Descriptor

Error of Measurement	24
Higher Education	24
Test Reliability	24
Mathematical Models	6
Multiple Choice Tests	6
Scores	6
Test Interpretation	6
Interrater Reliability	5
Test Validity	5
Testing Problems	5
Criterion Referenced Tests	4
Statistical Analysis	4
Cutting Scores	3
Error Patterns	3
Evaluation Methods	3
Generalizability Theory	3
Measurement Techniques	3
Test Construction	3
Test Theory	3
Academic Achievement	2
College Entrance Examinations	2
College Freshmen	2
Comparative Analysis	2
Correlation	2
Decision Making	2
More ▼

Source

Applied Psychological…	3
Educational and Psychological…	3
Evaluation Review	1
Journal of Agronomic…	1
Journal of Consulting and…	1
Journal of Experimental…	1
Language Learning	1
NCME Measurement in Education	1
Practical Assessment,…	1
Research & Practice in…	1

Publication Type

Reports - Research	16
Journal Articles	13
Speeches/Meeting Papers	7
Reports - Evaluative	3
Opinion Papers	2
Collected Works - Serials	1
Guides - General	1
Information Analyses	1
Reports - Descriptive	1
Reports - General	1

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Researchers

Location

Georgia	1
Oklahoma	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Test of Standard Written…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

The Miscalculation of Interrater Reliability: A Case Study Involving the AAC&U VALUE Rubrics

Peer reviewed
PDF on ERIC

Download full text

Szafran, Robert F. – Practical Assessment, Research & Evaluation, 2017

Institutional assessment of student learning objectives has become a fact-of-life in American higher education and the Association of American Colleges and Universities' (AAC&U) VALUE Rubrics have become a widely adopted evaluation and scoring tool for student work. As faculty from a variety of disciplines, some less familiar with the…

Descriptors: Interrater Reliability, Case Studies, Scoring Rubrics, Behavioral Objectives

Generalizability of Student Writing across Multiple Tasks: A Challenge for Authentic Assessment

Peer reviewed
PDF on ERIC

Download full text

Hathcoat, John D.; Penn, Jeremy D. – Research & Practice in Assessment, 2012

Critics of standardized testing have recommended replacing standardized tests with more authentic assessment measures, such as classroom assignments, projects, or portfolios rated by a panel of raters using common rubrics. Little research has examined the consistency of scores across multiple authentic assignments or the implications of this…

Descriptors: Generalizability Theory, Performance Based Assessment, Writing Across the Curriculum, Standardized Tests

Estimating Measurement Error on Highly Speeded Tests.

Peer reviewed

Whitely, Susan E. – Applied Psychological Measurement, 1979

A model which gives maximum likelihood estimates of measurement error within the context of a simplex model for practice effects is presented. The appropriateness of the model is tested for five traits, and error estimates are compared to the classical formula estimates. (Author/JKS)

Descriptors: Error of Measurement, Error Patterns, Higher Education, Mathematical Models

Empirical Estimates of the Comparative Reliability of Matching Tests and Multiple-Choice Tests.

Peer reviewed

Zimmerman, Donald W.; And Others – Journal of Experimental Education, 1984

Three types of test were compared: a completion test, a matching test, and a multiple-choice test. The completion test was more reliable than the matching test, and the matching test was more reliable than the multiple-choice test. (Author/BW)

Descriptors: Comparative Analysis, Error of Measurement, Higher Education, Mathematical Models

Scoring Reliability on the WAIS-R.

Peer reviewed

Ryan, Joseph J.; And Others – Journal of Consulting and Clinical Psychology, 1983

Wechsler Adult Intelligence Scale-Revised protocols from two vocational counseling clients were scored by 19 psychologists and 20 graduate students. Regardless of scorer's experience level, mechanical scoring error produced summary scores varying by as much as 4 to 18 IQ points. (Author/RC)

Descriptors: Error of Measurement, Graduate Students, Higher Education, Intelligence Tests

Systematic Errors in Approximations to the Standard Error of Measurement and Reliability.

Peer reviewed

Kleinke, David J. – Applied Psychological Measurement, 1979

Lord's, Millman's and Saupe's methods of approximating the standard error of measurement are reviewed. Through an empirical demonstration involving 200 university classroom tests, all three approximations are shown to be biased. (Author/JKS)

Descriptors: Error of Measurement, Error Patterns, Higher Education, Mathematical Formulas

A Comparison of Two, Three and Four-Choice Item Tests Given a Fixed Total Number of Choices.

Peer reviewed

Straton, Ralph G.; Catts, Ralph M. – Educational and Psychological Measurement, 1980

Multiple-choice tests composed entirely of two-, three-, or four-choice items were investigated. Results indicated that number of alternatives per item was inversely related to item difficulty, but directly related to item discrimination. Reliability and standard error of measurement of three-choice item tests was equivalent or superior.…

Descriptors: Difficulty Level, Error of Measurement, Foreign Countries, Higher Education

Using Longitudinal Data to Estimate Reliability in the Presence of Correlated Measurement Errors.

Peer reviewed

Werts, C. E.; And Others – Educational and Psychological Measurement, 1980

Test-retest correlations can lead to biased reliability estimates when there is instability of true scores and/or when measurement errors are correlated. Using three administrations of the Test of Standard Written English and essay ratings, an analysis is demonstrated which separates true score instability and correlated errors. (Author/BW)

Descriptors: College Freshmen, Error of Measurement, Essay Tests, Higher Education

Rules of Thumb for Estimating Reliability Coefficients Using Generalizability Theory.

Peer reviewed

Rentz, R. Robert – Educational and Psychological Measurement, 1980

This paper elaborates on the work of Cardinet, and others, by clarifying some points regarding calculations, specifically with reference to existing computer programs, and by presenting illustrative examples of the calculation and interpretation of several generalizability coefficients from a complex six-facet (factor) design. (Author/RL)

Descriptors: Analysis of Variance, Computation, Computer Programs, Error of Measurement

Improving Examinee Performance on Multiple Choice Tests.

Schmitt, Alicia P.; Crocker, Linda – 1981

The effectiveness of a strategy for improving performance on multiple choice items for examinees with different levels of test anxiety was assessed. Undergraduate measurement students responded to the Mandler-Sarason Test Anxiety Scale and to an objective test covering course content. Results indicated that, for most examinees, generation of an…

Descriptors: Error of Measurement, Higher Education, Multiple Choice Tests, Response Style (Tests)

Measuring Criterion-Referenced Test Reliability with a Single Test Administration.

Scheetz, James P.; vonFraunhofer, J. Anthony – 1980

Subkoviak suggested a technique for estimating both group reliability and the reliability associated with assigning a given individual to a mastery or non-mastery category based on a single test administration. Two assumptions underlie this model. First, it is assumed that had successive test administrations occurred, scores for each individual…

Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, Higher Education

Assessing the Reliability of Criterion-Referenced Measures Used to Evaluate Health-Education Programs.

Peer reviewed

Schaeffer, Gary A.; And Others – Evaluation Review, 1986

The reliability of criterion-referenced tests (CRTs) used in health program evaluation can be conceptualized in different ways. Formulas are presented for estimating appropriate standard error of measurement (SEM) for CRTs. The SEM can be used in computing confidence intervals for domain score estimates and for a cut-score. (Author/LMO)

Descriptors: Accountability, Criterion Referenced Tests, Cutting Scores, Error of Measurement

A Study of Three-option and Four-option Multiple Choice Exams.

Cooper, Terence H. – Journal of Agronomic Education (JAE), 1988

Describes a study used to determine differences in exam reliability, difficulty, and student evaluations. Indicates that when a fourth option was added to the three-option items, the exams became more difficult. Includes methods, results discussion, and tables on student characteristics, whole test analyses, and selected items. (RT)

Descriptors: Agronomy, College Science, Error of Measurement, Evaluation Methods

Lies, Damn Lies, and Statistics Revisited: A Comparison of Three Methods of Representing Change. AIR 1991 Annual Forum Paper.

Download full text

Pike, Gary R. – 1991

Because change is fundamental to education and the measurement of change assesses the quality and effectiveness of postsecondary education, this study examined three methods of measuring change: (1) gain scores; (2) residual scores; and (3) repeated measures. Data for the study was obtained from transcripts of 722 graduating seniors at the…

Descriptors: Academic Achievement, College Seniors, Error of Measurement, Higher Education

Self-Instructional Manual for Psychological Assessment. Manpower and Community College Counseling Program.

PDF pending restoration

Fuller, Edward, – 1973

This self-instructional manual for psychological assessment focuses on the following topics: (1) general statistics, (2) central tendency, (3) random, continuous, and discrete variables, (4) variability, (5) measuring variability, (6) sampling, (7) derived scores, (8) covariation, (9) reliability and validity, and (10) standard error of…

Descriptors: Autoinstructional Aids, Correlation, Error of Measurement, Guides

Previous Page | Next Page »

Pages: 1 | 2

Whitely, Susan E.	2
Bridgeman, Brent	1
Cason, Gerald J.	1
Catts, Ralph M.	1
Cooper, Terence H.	1
Crocker, Linda	1
Fruen, Mary	1
Fuller, Edward,	1
Haladyna, Tom	1
Hathcoat, John D.	1
Kleinke, David J.	1
Macpherson, Colin R.	1
Naizer, Gilbert	1
Penn, Jeremy D.	1
Pike, Gary R.	1
Polio, Charlene G.	1
Rentz, R. Robert	1
Roid, Gale	1
Rowley, Glenn L.	1
Ryan, Joseph J.	1
Schaeffer, Gary A.	1
Scheetz, James P.	1
Schmitt, Alicia P.	1
Straton, Ralph G.	1
More ▼