ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	12

Descriptor

Error of Measurement	13
Foreign Countries	13
Test Theory	13
Item Response Theory	8
Test Reliability	5
Test Items	4
Achievement Tests	3
Computation	3
Difficulty Level	3
Comparative Analysis	2
Correlation	2
Educational Quality	2
Evaluation Research	2
Exit Examinations	2
Generalizability Theory	2
Mathematics Tests	2
Personality Measures	2
Psychometrics	2
Science Tests	2
Scores	2
Test Construction	2
Test Validity	2
Academic Achievement	1
Academic Standards	1
Academically Gifted	1
More ▼

Source

Applied Psychological…	2
EURASIA Journal of…	1
Educational Research and…	1
Educational and Psychological…	1
IEEE Transactions on Education	1
International Journal of…	1
International Journal of…	1
International Online Journal…	1
Online Submission	1
Practical Assessment,…	1
Research Papers in Education	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	8
Reports - Evaluative	4
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	4
Higher Education	3
Junior High Schools	3
Postsecondary Education	3
Elementary Education	2
Secondary Education	2
Adult Education	1
Grade 8	1
Middle Schools	1

Audience

Location

Australia	2
United Kingdom (England)	2
Canada	1
Germany	1
Norway	1
Philippines	1
South Korea (Seoul)	1
Taiwan	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Eysenck Personality Inventory	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

Comparison of Performance Measures Obtained from Foreign Language Tests According to Item Response Theory vs Classical Test Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat – International Online Journal of Education and Teaching, 2022

Foreign language testing is a multi-dimensional phenomenon and obtaining objective and error-free scores on learners' language skills is often problematic. While assessing foreign language performance on high-stakes tests, using different testing approaches including Classical Test Theory (CTT), Generalizability Theory (GT) and/or Item Response…

Descriptors: Second Language Learning, Second Language Instruction, Item Response Theory, Language Tests

The Comparison of Item Parameters Estimated from Parametric and Nonparametric Item Response Theory Models in Case of the Violance of Local Independence Assumption

Peer reviewed
PDF on ERIC

Download full text

Dirlik, Ezgi Mor – International Journal of Progressive Education, 2019

Item response theory (IRT) has so many advantages than its precedent Classical Test Theory (CTT) such as non-changing item parameters, ability parameter estimations free from the items. However, in order to get these advantages, some assumptions should be met and they are; unidimensionality, normality and local independence. However, it is not…

Descriptors: Comparative Analysis, Nonparametric Statistics, Item Response Theory, Models

An Application of Multivariate Generalizability in Selection of Mathematically Gifted Students

Peer reviewed

Direct link

Kim, Sungyeun; Berebitsky, Dan – EURASIA Journal of Mathematics, Science & Technology Education, 2016

This study investigates error sources and the effects of each error source to determine optimal weights of the composite score of teacher recommendation letters and self-introduction letters using multivariate generalizability theory. Data were collected from the science education institute for the gifted attached to the university located within…

Descriptors: Academically Gifted, Foreign Countries, Mathematics, Mathematics Instruction

Problems in Estimating Composite Reliability of "Unitised" Assessments

Peer reviewed

Direct link

Bramley, Tom; Dhawan, Vikas – Research Papers in Education, 2013

This paper discusses the issues involved in calculating indices of composite reliability for "modular" or "unitised" assessments of the kind used in GCSEs, AS and A level examinations in England. The increasingly widespread use of on-screen marking has meant that the item-level data required for calculating indices of…

Descriptors: Foreign Countries, Exit Examinations, Secondary Education, Test Reliability

Taking the Error Term of the Factor Model into Account: The Factor Score Predictor Interval

Peer reviewed

Direct link

Beauducel, Andre – Applied Psychological Measurement, 2013

The problem of factor score indeterminacy implies that the factor and the error scores cannot be completely disentangled in the factor model. It is therefore proposed to compute Harman's factor score predictor that contains an additive combination of factor and error variance. This additive combination is discussed in the framework of classical…

Descriptors: Factor Analysis, Predictor Variables, Reliability, Error of Measurement

A Control Systems Concept Inventory Test Design and Assessment

Peer reviewed

Direct link

Bristow, M.; Erkorkmaz, K.; Huissoon, J. P.; Jeon, Soo; Owen, W. S.; Waslander, S. L.; Stubley, G. D. – IEEE Transactions on Education, 2012

Any meaningful initiative to improve the teaching and learning in introductory control systems courses needs a clear test of student conceptual understanding to determine the effectiveness of proposed methods and activities. The authors propose a control systems concept inventory. Development of the inventory was collaborative and iterative. The…

Descriptors: Diagnostic Tests, Concept Formation, Undergraduate Students, Engineering Education

The Reliability of Results from National Tests, Public Examinations, and Vocational Qualifications in England

Peer reviewed

Direct link

He, Qingping; Opposs, Dennis – Educational Research and Evaluation, 2012

National tests, public examinations, and vocational qualifications in England are used for a variety of purposes, including the certification of individual learners in different subject areas and the accountability of individual professionals and institutions. However, there has been ongoing debate about the reliability and validity of their…

Descriptors: Qualifications, Evidence, National Competency Tests, Foreign Countries

Quantifying Response Dependence between Two Dichotomous Items Using the Rasch Model

Peer reviewed

Direct link

Andrich, David; Kreiner, Svend – Applied Psychological Measurement, 2010

Models of modern test theory imply statistical independence among responses, generally referred to as "local independence." One violation of local independence occurs when the response to one item governs the response to a subsequent item. Expanding on a formulation of this kind of violation as a process in the dichotomous Rasch model,…

Descriptors: Test Theory, Item Response Theory, Test Items, Correlation

Tests in Europe: Where We Are and Where We Should Go

Peer reviewed

Direct link

Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012

Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…

Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries

Demonstrating the Difference between Classical Test Theory and Item Response Theory Using Derived Test Data

Download full text

Magno, Carlo – Online Submission, 2009

The present report demonstrates the difference between classical test theory (CTT) and item response theory (IRT) approach using an actual test data for chemistry junior high school students. The CTT and IRT were compared across two samples and two forms of test on their item difficulty, internal consistency, and measurement errors. The specific…

Descriptors: Private Schools, Measurement, Error of Measurement, Foreign Countries

The Application of Bayesian Thinking to Educational Measurement Problems.

Thorndike, Robert L. – 1980

In an invitational address to the Victorian Institute of Educational Research, the author discussed Bayesian theory and its relationship to the design and construction of tailored or adaptive tests. Bayesian thinking involves recognizing the role of prior probabilities and using these probabilities in combination with new data to arrive at future…

Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Error of Measurement

Methods in Scaling the Basic Competence Test

Peer reviewed

Direct link

Chang, Shun-Wen – Educational and Psychological Measurement, 2006

This study evaluates the effects of employing the linear, normalizing, and arcsine transformation methods for constructing scale scores on the Basic Competence Test (BCTEST). Tests in three subject areas (Chinese, English, and Mathematics) were studied using the data of test administrations from 2001 to 2003. The resulting scale scores for each…

Descriptors: Standardized Tests, Achievement Tests, Test Theory, True Scores

Andrich, David	1
Beauducel, Andre	1
Berebitsky, Dan	1
Bramley, Tom	1
Bristow, M.	1
Chang, Shun-Wen	1
Dhawan, Vikas	1
Dirlik, Ezgi Mor	1
Elosua, Paula	1
Erkorkmaz, K.	1
He, Qingping	1
Huebner, Alan	1
Huissoon, J. P.	1
Iliescu, Dragos	1
Jeon, Soo	1
Kim, Sungyeun	1
Kreiner, Svend	1
Magno, Carlo	1
Opposs, Dennis	1
Owen, W. S.	1
Polat, Murat	1
Skar, Gustaf B.	1
Stubley, G. D.	1
Thorndike, Robert L.	1
Waslander, S. L.	1
More ▼