ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	11

Descriptor

Error of Measurement	43
Test Reliability	43
Test Theory	43
Mathematical Models	10
Test Validity	10
Item Analysis	7
Scores	7
Test Construction	7
Criterion Referenced Tests	6
Cutting Scores	6
Item Response Theory	6
Test Items	6
Evaluation Methods	5
Foreign Countries	5
Generalizability Theory	5
Mastery Tests	5
Mathematical Formulas	5
Statistical Analysis	5
Test Interpretation	5
Achievement Tests	4
Career Development	4
Correlation	4
Decision Making	4
Latent Trait Theory	4
Measurement	4
More ▼

Publication Type

Reports - Research	30
Journal Articles	27
Reports - Evaluative	9
Speeches/Meeting Papers	8
Opinion Papers	4
Reports - Descriptive	1

Education Level

Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Adult Education	1
Elementary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1

Audience

Researchers

Location

United Kingdom (England)	2
Canada	1
Norway	1
United Kingdom (Great Britain)	1

Laws, Policies, & Programs

Assessments and Surveys

Expressive One Word Picture…

What Works Clearinghouse Rating

Showing 1 to 15 of 43 results Save | Export

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

A Simple Model to Determine the Efficient Duration of Exams

Peer reviewed

Direct link

Ellis, Jules L. – Educational and Psychological Measurement, 2021

This study develops a theoretical model for the costs of an exam as a function of its duration. Two kind of costs are distinguished: (1) the costs of measurement errors and (2) the costs of the measurement. Both costs are expressed in time of the student. Based on a classical test theory model, enriched with assumptions on the context, the costs…

Descriptors: Test Length, Models, Error of Measurement, Measurement

The Effect of Chance Success on Equalization Error in Test Equation Based on Classical Test Theory

Peer reviewed
PDF on ERIC

Download full text

Koçak, Duygu – International Journal of Progressive Education, 2020

The aim of this study was to determine the effect of chance success on test equalization. For this purpose, artificially generated 500 and 1000 sample size data sets were synchronized using linear equalization and equal percentage equalization methods. In the data which were produced as a simulative, a total of four cases were created with no…

Descriptors: Test Theory, Equated Scores, Error of Measurement, Sample Size

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

Accuracy of a Classical Test Theory-Based Procedure for Estimating the Reliability of a Multistage Test. Research Report. ETS RR-17-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017

The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…

Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing

Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018

Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…

Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making

Problems in Estimating Composite Reliability of "Unitised" Assessments

Peer reviewed

Direct link

Bramley, Tom; Dhawan, Vikas – Research Papers in Education, 2013

This paper discusses the issues involved in calculating indices of composite reliability for "modular" or "unitised" assessments of the kind used in GCSEs, AS and A level examinations in England. The increasingly widespread use of on-screen marking has meant that the item-level data required for calculating indices of…

Descriptors: Foreign Countries, Exit Examinations, Secondary Education, Test Reliability

Generalizability Theory and Classical Test Theory

Peer reviewed

Direct link

Brennan, Robert L. – Applied Measurement in Education, 2011

Broadly conceived, reliability involves quantifying the consistencies and inconsistencies in observed scores. Generalizability theory, or G theory, is particularly well suited to addressing such matters in that it enables an investigator to quantify and distinguish the sources of inconsistencies in observed scores that arise, or could arise, over…

Descriptors: Generalizability Theory, Test Theory, Test Reliability, Item Response Theory

A Control Systems Concept Inventory Test Design and Assessment

Peer reviewed

Direct link

Bristow, M.; Erkorkmaz, K.; Huissoon, J. P.; Jeon, Soo; Owen, W. S.; Waslander, S. L.; Stubley, G. D. – IEEE Transactions on Education, 2012

Any meaningful initiative to improve the teaching and learning in introductory control systems courses needs a clear test of student conceptual understanding to determine the effectiveness of proposed methods and activities. The authors propose a control systems concept inventory. Development of the inventory was collaborative and iterative. The…

Descriptors: Diagnostic Tests, Concept Formation, Undergraduate Students, Engineering Education

The Reliability of Results from National Tests, Public Examinations, and Vocational Qualifications in England

Peer reviewed

Direct link

He, Qingping; Opposs, Dennis – Educational Research and Evaluation, 2012

National tests, public examinations, and vocational qualifications in England are used for a variety of purposes, including the certification of individual learners in different subject areas and the accountability of individual professionals and institutions. However, there has been ongoing debate about the reliability and validity of their…

Descriptors: Qualifications, Evidence, National Competency Tests, Foreign Countries

Tests in Europe: Where We Are and Where We Should Go

Peer reviewed

Direct link

Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012

Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…

Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries

Measurement Error and Changes in Personal Constructs.

Peer reviewed

Chambers, William V. – Social Behavior and Personality, 1985

Personal construct psychologists have suggested various psychological functions explain differences in the stability of constructs. Among these functions are constellatory and loose construction. This paper argues that measurement error is a more parsimonious explanation of the differences in construct stability reported in these studies. (Author)

Descriptors: Error of Measurement, Test Construction, Test Format, Test Reliability

A Review of the Beta-Binomial Model and Its Extensions.

Peer reviewed

Wilcox, Rand R. – Journal of Educational Statistics, 1981

Both the binomial and beta-binomial models are applied to various problems occurring in mental test theory. The paper reviews and critiques these models. The emphasis is on the extensions of the models that have been proposed in recent years, and that might not be familiar to many educators. (Author)

Descriptors: Error of Measurement, Item Analysis, Mathematical Models, Test Reliability

On the Reliability of Categorically Scored Examinations

Peer reviewed

Direct link

Kupermintz, Haggai – Journal of Educational Measurement, 2004

A decision-theoretic approach to the question of reliability in categorically scored examinations is explored. The concepts of true scores and errors are discussed as they deviate from conventional psychometric definitions and measurement error in categorical scores is cast in terms of misclassifications. A reliability measure based on…

Descriptors: Test Reliability, Error of Measurement, Psychometrics, Test Theory

The Reliability of a Profile.

Peer reviewed

Yarnold, Paul R. – Educational and Psychological Measurement, 1984

Unreliable profiles impose the difficulty that ordinal and interval relations among the individual's scores become uncertain or unstable. A profile reliability coefficient is derived to estimate the relative expected extent of this ordinal and interval "inversion" for any profile of K measures. (Author/DWH)

Descriptors: Error of Measurement, Mathematical Models, Profiles, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	4
Journal of Experimental…	4
Journal of Educational…	3
Journal of Educational…	2
Applied Measurement in…	1
Applied Psychological…	1
Dyslexia	1
ETS Research Report Series	1
Educational Research and…	1
IEEE Transactions on Education	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Practical Assessment,…	1
Psychometrika	1
Research Papers in Education	1
School Psychology Review	1
Social Behavior and…	1
More ▼

Zimmerman, Donald W.	4
Haladyna, Tom	3
Huynh, Huynh	3
Williams, Richard H.	3
Roid, Gale	2
Aksu, Gökhan	1
Altepeter, Tom	1
Belfry, M. Joan	1
Bichi, Ado Abdu	1
Bramley, Tom	1
Brennan, Robert L.	1
Bristow, M.	1
Cason, Gerald J.	1
Chambers, William V.	1
Cotton, Sue M.	1
Crewther, David P.	1
Crewther, Sheila G.	1
Crowley, Susan	1
Dhawan, Vikas	1
Ecob, Russell	1
Ellis, Jules L.	1
Elosua, Paula	1
Erkorkmaz, K.	1
Eser, Mehmet Taha	1
More ▼