ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	10

Descriptor

Decision Making	14
Error of Measurement	14
Reliability	4
Estimation (Mathematics)	3
Statistical Analysis	3
Test Reliability	3
Computation	2
Cutting Scores	2
Generalizability Theory	2
Inferences	2
Item Analysis	2
Item Response Theory	2
Measurement	2
Models	2
Probability	2
Responses	2
School Districts	2
Test Validity	2
Academic Achievement	1
Achievement Gains	1
Achievement Tests	1
Adaptive Testing	1
Administrator Attitudes	1
Administrators	1
Analysis of Variance	1
More ▼

Source

American Journal of Business…	1
Carnegie Foundation for the…	1
Economics of Education Review	1
Educational Measurement:…	1
Evaluation and the Health…	1
Home Economics Research…	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Literacy	1
Policy Analysis for…	1
Psychological Methods	1
Psychological Review	1
Studies in Philosophy and…	1
More ▼

Publication Type

Reports - Evaluative	14
Journal Articles	11
Opinion Papers	2
Reports - Research	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	2
Elementary Secondary Education	1
Grade 1	1
Grade 3	1
Higher Education	1
Kindergarten	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

California	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Examining the Precision of Cut Scores within a Generalizability Theory Framework: A Closer Look at the Item Effect

Peer reviewed

Direct link

Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020

An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…

Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting

Campbell's Law and the Ethics of Immensurability

Peer reviewed

Direct link

Sidorkin, Alexander M. – Studies in Philosophy and Education, 2016

The paper examines "Campbell's Law": "The more any quantitative social indicator is used for social decision-making, the more subject it will be to corruption pressures and the more apt it will be to distort and corrupt the social processes it is intended to monitor." The examination of measurability leads to explaining the…

Descriptors: Social Indicators, Measurement, Decision Making, Error of Measurement

Policy and Evidence: A Critical Analysis of the Year 1 Phonics Screening Check in England

Peer reviewed

Direct link

Grundin, Hans U. – Literacy, 2018

This paper aims to present a critical analysis of the Year 1 Phonics Screening Check (PSC), with special focus on the relationship between the UK Department for Education's policy-making and the evidence considered in the process of developing and evaluating the PSC. The reports from the in-house Standards and Testing Agency and from commissioned…

Descriptors: Foreign Countries, Criticism, Screening Tests, Phonics

Measuring Social Emotional Learning through Student Surveys in the CORE Districts: A Pragmatic Approach to Validity and Reliability

Download full text

Gehlbach, Hunter; Hough, Heather J. – Policy Analysis for California Education, PACE, 2018

As educational practitioners and policymakers expand the range of student outcomes they assess, student perception surveys--particularly those targeting social-emotional learning--have grown in popularity. Despite excitement around the potential for measuring a wider array of important student outcomes, concerns about the validity of the…

Descriptors: Social Development, Emotional Development, Validity, School Districts

Testing Mixture Models of Transitive Preference: Comment on Regenwetter, Dana, and Davis-Stober (2011)

Peer reviewed

Direct link

Birnbaum, Michael H. – Psychological Review, 2011

This article contrasts 2 approaches to analyzing transitivity of preference and other behavioral properties in choice data. The approach of Regenwetter, Dana, and Davis-Stober (2011) assumes that on each choice, a decision maker samples randomly from a mixture of preference orders to determine whether "A" is preferred to "B." In contrast, Birnbaum…

Descriptors: Evidence, Testing, Computation, Probability

How Should Educators Interpret Value-Added Scores? What We Know Series: Value-Added Methods and Applications. Knowledge Brief 1

Download full text

Raudenbush, Stephen W.; Jean, Marshall – Carnegie Foundation for the Advancement of Teaching, 2012

A teacher's value-added score is intended to convey how much that teacher has contributed to student learning in a particular subject in a particular year. Different school districts define and compute value-added scores in different ways. A variety of people may see value-added estimates, and each group may use them for different purposes.…

Descriptors: Teacher Effectiveness, Achievement Tests, Statistical Bias, Teacher Evaluation

Reliability of Decision-Making Frameworks for Response to Intervention for Reading

Peer reviewed

Direct link

Burns, Matthew K.; Scholin, Sarah E.; Kosciolek, Stacey; Livingston, Judy – Journal of Psychoeducational Assessment, 2010

The current study examines the consistency of two response-to-intervention (RTI) decision-making models. Weekly progress monitoring data for 30 students participating in a Tier II intervention were collected for 30 weeks. The data were examined by comparing them to an aimline with a yearly goal and by computing a dual discrepancy (DD) using…

Descriptors: Reading Achievement, Reading Tests, Data Collection, Responses

BCS or Just BS: How College Football Could Crown the Wrong National Champion? Just Do the Math--Correctly!

Peer reviewed
PDF on ERIC

Download full text

Teasley, C.E. Wynn; Hornyak, Martin – American Journal of Business Education, 2010

The 2009 college football season is here, but there has been a continuing controversy swirling over how the Football Bowl Subdivision (FBS) selects its national champion. College football uses a multi-criterion decision matrix (MCDM) evaluation technique to determine which two teams will play for the national championship. We analyzed the BCS…

Descriptors: Business Administration, Business Administration Education, Team Sports, College Athletics

Measurement Error, Education Production and Data Envelopment Analysis

Peer reviewed

Direct link

Ruggiero, John – Economics of Education Review, 2006

Data Envelopment Analysis has become a popular tool for evaluating the efficiency of decision making units. The nonparametric approach has been widely applied to educational production. The approach is, however, deterministic and leads to biased estimates of performance in the presence of measurement error. Numerous simulation studies confirm the…

Descriptors: Data Analysis, Decision Making, Efficiency, Productivity

On the Consistency of Individual Classification Using Short Scales

Peer reviewed

Direct link

Emons, Wilco H. M.; Sijtsma, Klaas; Meijer, Rob R. – Psychological Methods, 2007

Short tests containing at most 15 items are used in clinical and health psychology, medicine, and psychiatry for making decisions about patients. Because short tests have large measurement error, the authors ask whether they are reliable enough for classifying patients into a treatment and a nontreatment group. For a given certainty level,…

Descriptors: Psychiatry, Patients, Error of Measurement, Test Length

Methodological Note: Let the t-Test Rest in Peace--A Note on the Control of Error Rates.

Peer reviewed

Moran, James D., III – Home Economics Research Journal, 1986

Describes the appropriate control of experiment-wise error rates and the researcher's role in making decisions to control for Type 1 errors. Issues related to level of significance, one- versus two-tailed tests, and the use of multivariate statistics are included. (Author/CT)

Descriptors: Decision Making, Error of Measurement, Multivariate Analysis, Research Methodology

When Classical Measurement Theory Is Insufficient and Generalizability Theory Is Essential.

Download full text

Thompson, Bruce; Crowley, Susan – 1994

Most training programs in education and psychology focus on classical test theory techniques for assessing score dependability. This paper discusses generalizability theory and explores its concepts using a small heuristic data set. Generalizability theory subsumes and extends classical test score theory. It is able to estimate the magnitude of…

Descriptors: Analysis of Variance, Cutting Scores, Decision Making, Error of Measurement

Generalizability of Performance Assessments.

Peer reviewed

Brennan, Robert L.; Johnson, Eugene G. – Educational Measurement: Issues and Practice, 1995

The application of generalizability theory to the reliability and error variance estimation for performance assessment scores is discussed. Decision makers concerned with performance assessment need to realize the restrictions that limit generalizability such as limitations that lead to reductions in the number of tasks possible, rater quality,…

Descriptors: Decision Making, Educational Assessment, Error of Measurement, Estimation (Mathematics)

Confidence in Pass/Fail Decisions for Computer Adaptive and Paper and Pencil Examinations.

Peer reviewed

Bergstrom, Betty A.; Lunz, Mary E. – Evaluation and the Health Professions, 1992

The level of confidence in pass/fail decisions obtained with computerized adaptive tests and paper-and-pencil tests was greater for 645 medical technology students when the computer adaptive test implemented a 90 percent confidence stopping rule than for paper-and-pencil tests of comparable length. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Confidence Testing

Bergstrom, Betty A.	1
Birnbaum, Michael H.	1
Brennan, Robert L.	1
Burns, Matthew K.	1
Clauser, Brian E.	1
Clauser, Jerome C.	1
Crowley, Susan	1
Emons, Wilco H. M.	1
Gehlbach, Hunter	1
Grundin, Hans U.	1
Hornyak, Martin	1
Hough, Heather J.	1
Jean, Marshall	1
Johnson, Eugene G.	1
Kane, Michael	1
Kosciolek, Stacey	1
Livingston, Judy	1
Lunz, Mary E.	1
Meijer, Rob R.	1
Moran, James D., III	1
Raudenbush, Stephen W.	1
Ruggiero, John	1
Scholin, Sarah E.	1
Sidorkin, Alexander M.	1
Sijtsma, Klaas	1
More ▼