ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	4

Descriptor

Decision Making	14
Error of Measurement	14
Test Reliability	14
Test Construction	5
Cutting Scores	4
Test Interpretation	4
Test Theory	4
Test Validity	4
True Scores	4
Criterion Referenced Tests	3
Estimation (Mathematics)	3
Item Response Theory	3
Mastery Tests	3
Mathematical Formulas	3
Test Bias	3
Career Development	2
Classification	2
Comparative Analysis	2
Educational Policy	2
Efficiency	2
Generalizability Theory	2
Higher Education	2
Item Analysis	2
Measurement	2
Norm Referenced Tests	2
More ▼

Source

Educational Measurement:…	2
Applied Psychological…	1
CBE - Life Sciences Education	1
International Journal of…	1
Journal of Educational…	1
Literacy	1
NCME Measurement in Education	1

Publication Type

Journal Articles	7
Reports - Research	6
Speeches/Meeting Papers	4
Reports - Evaluative	3
Collected Works - General	1
Collected Works - Serials	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Descriptive	1

Education Level

Elementary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 14 results Save | Export

The Invariance Paradox: Using Optimal Test Design to Minimize Bias

Peer reviewed

Direct link

Jones, Andrew T.; Kopp, Jason P.; Ong, Thai Q. – Educational Measurement: Issues and Practice, 2020

Studies investigating invariance have often been limited to measurement or prediction invariance. Selection invariance, wherein the use of test scores for classification results in equivalent classification accuracy between groups, has received comparatively little attention in the psychometric literature. Previous research suggests that some form…

Descriptors: Test Construction, Test Bias, Classification, Accuracy

Exploring Student Sensemaking When Engaging with Anomalous Data

Peer reviewed

Direct link

Adrian Adams; Lauren Barth-Cohen – CBE - Life Sciences Education, 2024

In undergraduate research settings, students are likely to encounter anomalous data, that is, data that do not meet their expectations. Most of the research that directly or indirectly captures the role of anomalous data in research settings uses post-hoc reflective interviews or surveys. These data collection approaches focus on recall of past…

Descriptors: Undergraduate Students, Physics, Science Instruction, Laboratory Experiments

Policy and Evidence: A Critical Analysis of the Year 1 Phonics Screening Check in England

Peer reviewed

Direct link

Grundin, Hans U. – Literacy, 2018

This paper aims to present a critical analysis of the Year 1 Phonics Screening Check (PSC), with special focus on the relationship between the UK Department for Education's policy-making and the evidence considered in the process of developing and evaluating the PSC. The reports from the in-house Standards and Testing Agency and from commissioned…

Descriptors: Foreign Countries, Criticism, Screening Tests, Phonics

Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018

Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…

Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making

Reliability Issues with Performance Assessments: A Collection of Papers. ACT Research Report Series 97-3.

Download full text

Colton, Dean A.; Gao, Xiaohong; Harris, Deborah J.; Kolen, Michael J.; Martinovich-Barhite, Dara; Wang, Tianyou; Welch, Catherine J. – 1997

This collection consists of six papers, each dealing with some aspects of reliability and performance testing. Each paper has an abstract, and each contains its own references. Papers include: (1) "Using Reliabilities To Make Decisions" (Deborah J. Harris); (2) "Conditional Standard Errors, Reliability, and Decision Consistency…

Descriptors: Decision Making, Error of Measurement, Item Response Theory, Performance Based Assessment

Reliability of Tests Used to Make Pass/Fail Decisions: Answering the Right Questions.

Download full text

Livingston, Samuel A. – 1978

The traditional reliability coefficient and standard error of measurement are not adequate measures of reliability for tests used to make pass/fail decisions. Answering the important reliability questions requires estimation of the joint distribution of true and observed scores. Lord's "Method 20" estimates this distribution without the…

Descriptors: Cutting Scores, Decision Making, Efficiency, Error of Measurement

Assessing the Reliability of Tests Used to Make Pass/Fail Decisions.

Peer reviewed

Livingston, Samuel A.; Wingersky, Marilyn A. – Journal of Educational Measurement, 1979

Procedures are described for studying the reliability of decisions based on specific passing scores with tests made up of discrete items and designed to measure continuous rather than categorical traits. These procedures are based on the estimation of the joint distribution of true scores and observed scores. (CTM)

Descriptors: Cutting Scores, Decision Making, Efficiency, Error of Measurement

When Classical Measurement Theory Is Insufficient and Generalizability Theory Is Essential.

Download full text

Thompson, Bruce; Crowley, Susan – 1994

Most training programs in education and psychology focus on classical test theory techniques for assessing score dependability. This paper discusses generalizability theory and explores its concepts using a small heuristic data set. Generalizability theory subsumes and extends classical test score theory. It is able to estimate the magnitude of…

Descriptors: Analysis of Variance, Cutting Scores, Decision Making, Error of Measurement

The Internal and External Optimality of Decisions Based on Tests.

Peer reviewed

Mellenbergh, Gideon J.; van der Linden, Wim J. – Applied Psychological Measurement, 1979

For six tests, coefficient delta as an index for internal optimality is computed. Internal optimality is defined as the magnitude of risk of the decision procedure with respect to the true score. Results are compared with an alternative index (coefficient kappa) for assessing the consistency of decisions. (Author/JKS)

Descriptors: Classification, Comparative Analysis, Decision Making, Error of Measurement

Generalizability of Performance Assessments.

Peer reviewed

Brennan, Robert L.; Johnson, Eugene G. – Educational Measurement: Issues and Practice, 1995

The application of generalizability theory to the reliability and error variance estimation for performance assessment scores is discussed. Decision makers concerned with performance assessment need to realize the restrictions that limit generalizability such as limitations that lead to reductions in the number of tasks possible, rater quality,…

Descriptors: Decision Making, Educational Assessment, Error of Measurement, Estimation (Mathematics)

The Mean Split-Half Coefficient of Agreement and its Relation to Other Single-Administration Test Indices: A Study Based on Simulated Data. Technical Report No. 350.

Download full text

Marshall, J. Laird – 1976

A summary is provided of the rationale for questioning the applicability of classical reliability measures to criterion referenced tests; an extension of the classical theory of true and error scores to incorporate a theory of dichotomous decisions; a presentation of the mean split-half coefficient of agreement, a single-administration test index…

Descriptors: Career Development, Computer Programs, Criterion Referenced Tests, Decision Making

The Paradox of Criterion-Referenced Measurement.

Download full text

Haladyna, Tom – 1976

The existence of criterion-referenced (CR) measurement is questioned in this paper. Despite beliefs that differences exist between two alternative forms of measurement, CR and Norm Referenced (NR), an analysis of philosophical and psychological descriptions of measurement, as well as a growing number of empirical studies, reveal that the common…

Descriptors: Academic Standards, Achievement Tests, Career Development, Comparative Analysis

The Use of Tests in Admissions to Higher Education.

Fruen, Mary – NCME Measurement in Education, 1978

There are both strengths and weaknesses of using standardized test scores as a criterion for admission to institutions of higher education. The relative importance of scores is dependent on the institution's degree of selectivity. In general, decision processes and admissions criteria are not well defined. Advantages of test scores include: use of…

Descriptors: Admission Criteria, College Admission, College Entrance Examinations, Competitive Selection

An Empirical Study of the Properties of Two Estimates of Decision-Consistency Used with Two Types of Teacher-Constructed Classroom Tests.

Macpherson, Colin R.; Rowley, Glenn L. – 1986

Teacher-made mastery tests were administered in a classroom-sized sample to study their decision consistency. Decision-consistency of criterion-referenced tests is usually defined in terms of the proportion of examinees who are classified in the same way after two test administrations. Single-administration estimates of decision consistency were…

Descriptors: Classroom Research, Comparative Testing, Criterion Referenced Tests, Cutting Scores

Livingston, Samuel A.	2
Adrian Adams	1
Bichi, Ado Abdu	1
Brennan, Robert L.	1
Colton, Dean A.	1
Crowley, Susan	1
Fruen, Mary	1
Gao, Xiaohong	1
Grundin, Hans U.	1
Haladyna, Tom	1
Harris, Deborah J.	1
Johnson, Eugene G.	1
Jones, Andrew T.	1
Kolen, Michael J.	1
Kopp, Jason P.	1
Lauren Barth-Cohen	1
Macpherson, Colin R.	1
Marshall, J. Laird	1
Martinovich-Barhite, Dara	1
Mellenbergh, Gideon J.	1
Ong, Thai Q.	1
Rowley, Glenn L.	1
Talib, Rohaya	1
Thompson, Bruce	1
Wang, Tianyou	1
More ▼