ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	11

Descriptor

Error of Measurement	22
Test Items	22
Test Theory	22
Mathematical Models	9
Item Response Theory	8
Correlation	7
Difficulty Level	6
Test Reliability	6
Item Analysis	5
Test Construction	5
Achievement Tests	4
Comparative Analysis	4
Foreign Countries	4
Scores	4
Statistical Analysis	4
Testing Problems	4
Career Development	3
Equated Scores	3
Factor Analysis	3
Goodness of Fit	3
Measurement Techniques	3
Models	3
Test Validity	3
Computation	2
Evaluation Criteria	2
More ▼

Source

Educational and Psychological…	3
Applied Psychological…	2
ETS Research Report Series	2
IEEE Transactions on Education	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Measurement:…	1
Online Submission	1
School Psychology Review	1
More ▼

Publication Type

Reports - Research	15
Journal Articles	14
Reports - Descriptive	4
Reports - Evaluative	4
Speeches/Meeting Papers	2

Education Level

Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Elementary Education	1
Grade 3	1
Junior High Schools	1

Audience

Researchers

Location

Australia	1
Canada	1
Philippines	1

Laws, Policies, & Programs

Assessments and Surveys

Expressive One Word Picture…	1
SAT (College Admission Test)	1
Students Evaluation of…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018

Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…

Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making

The Comparison of Item Parameters Estimated from Parametric and Nonparametric Item Response Theory Models in Case of the Violance of Local Independence Assumption

Peer reviewed
PDF on ERIC

Download full text

Dirlik, Ezgi Mor – International Journal of Progressive Education, 2019

Item response theory (IRT) has so many advantages than its precedent Classical Test Theory (CTT) such as non-changing item parameters, ability parameter estimations free from the items. However, in order to get these advantages, some assumptions should be met and they are; unidimensionality, normality and local independence. However, it is not…

Descriptors: Comparative Analysis, Nonparametric Statistics, Item Response Theory, Models

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

A Control Systems Concept Inventory Test Design and Assessment

Peer reviewed

Direct link

Bristow, M.; Erkorkmaz, K.; Huissoon, J. P.; Jeon, Soo; Owen, W. S.; Waslander, S. L.; Stubley, G. D. – IEEE Transactions on Education, 2012

Any meaningful initiative to improve the teaching and learning in introductory control systems courses needs a clear test of student conceptual understanding to determine the effectiveness of proposed methods and activities. The authors propose a control systems concept inventory. Development of the inventory was collaborative and iterative. The…

Descriptors: Diagnostic Tests, Concept Formation, Undergraduate Students, Engineering Education

Quantifying Response Dependence between Two Dichotomous Items Using the Rasch Model

Peer reviewed

Direct link

Andrich, David; Kreiner, Svend – Applied Psychological Measurement, 2010

Models of modern test theory imply statistical independence among responses, generally referred to as "local independence." One violation of local independence occurs when the response to one item governs the response to a subsequent item. Expanding on a formulation of this kind of violation as a process in the dichotomous Rasch model,…

Descriptors: Test Theory, Item Response Theory, Test Items, Correlation

On Bias in Linear Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010

The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…

Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends

Polytomous Differential Item Functioning and Violations of Ordering of the Expected Latent Trait by the Raw Score

Peer reviewed

Direct link

DeMars, Christine E. – Educational and Psychological Measurement, 2008

The graded response (GR) and generalized partial credit (GPC) models do not imply that examinees ordered by raw observed score will necessarily be ordered on the expected value of the latent trait (OEL). Factors were manipulated to assess whether increased violations of OEL also produced increased Type I error rates in differential item…

Descriptors: Test Items, Raw Scores, Test Theory, Error of Measurement

Theory of Test Translation Error

Peer reviewed

Direct link

Solano-Flores, Guillermo; Backhoff, Eduardo; Contreras-Nino, Luis Angel – International Journal of Testing, 2009

In this article, we present a theory of test translation whose intent is to provide the conceptual foundation for effective, systematic work in the process of test translation and test translation review. According to the theory, translation error is multidimensional; it is not simply the consequence of defective translation but an inevitable fact…

Descriptors: Test Items, Investigations, Semantics, Translation

Demonstrating the Difference between Classical Test Theory and Item Response Theory Using Derived Test Data

Download full text

Magno, Carlo – Online Submission, 2009

The present report demonstrates the difference between classical test theory (CTT) and item response theory (IRT) approach using an actual test data for chemistry junior high school students. The CTT and IRT were compared across two samples and two forms of test on their item difficulty, internal consistency, and measurement errors. The specific…

Descriptors: Private Schools, Measurement, Error of Measurement, Foreign Countries

Equating Error in Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2006

Traditionally, error in equating observed scores on two versions of a test is defined as the difference between the transformations that equate the quantiles of their distributions in the sample and population of test takers. But it is argued that if the goal of equating is to adjust the scores of test takers on one version of the test to make…

Descriptors: Equated Scores, Evaluation Criteria, Models, Error of Measurement

When Can Subscores Have Value? Research Report. ETS RR-05-08

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2005

In educational tests, subscores are often generated from a portion of the items in a larger test. Guidelines based on mean-squared error are proposed to indicate whether subscores are worth reporting. Alternatives considered are direct reports of subscores, estimates of subscores based on total score, combined estimates based on subscores and…

Descriptors: Scores, Test Items, Error of Measurement, Computation

Congeneric and (Essentially) Tau-Equivalent Estimates of Score Reliability: What They Are and How to Use Them

Peer reviewed

Direct link

Graham, James M. – Educational and Psychological Measurement, 2006

Coefficient alpha, the most commonly used estimate of internal consistency, is often considered a lower bound estimate of reliability, though the extent of its underestimation is not typically known. Many researchers are unaware that coefficient alpha is based on the essentially tau-equivalent measurement model. It is the violation of the…

Descriptors: Models, Test Theory, Reliability, Structural Equation Models

Reliability of Composite Measurements Based on the m Highest of n Equivalent Components.

Peer reviewed

Huynh, Huynh – Journal of Educational Statistics, 1986

Under the assumptions of classical measurement theory and the condition of normality, a formula is derived for the reliability of composite scores. The formula represents an extension of the Spearman-Brown formula to the case of truncated data. (Author/JAZ)

Descriptors: Computer Simulation, Error of Measurement, Expectancy Tables, Scoring Formulas

Some Relationships between the Binomial Error Model and Classical Test Theory.

Peer reviewed

Feldt, Leonard S. – Educational and Psychological Measurement, 1984

The binomial error model includes form-to-form difficulty differences as error variance and leads to Ruder-Richardson formula 21 as an estimate of reliability. If the form-to-form component is removed from the estimate of error variance, the binomial model leads to KR 20 as the reliability estimate. (Author/BW)

Descriptors: Achievement Tests, Difficulty Level, Error of Measurement, Mathematical Formulas

An Alternative Interpretation of Three Stability Models. Measurement and Methodology, Work Unit 2: Technical Adequacy of Tests.

Wilcox, Rand R. – 1978

Two fundamental problems in mental test theory are to estimate true score and to estimate the amount of error when testing an examinee. In this report, three probability models which characterize a single test item in terms of a population of examinees are described. How these models may be modified to characterize a single examinee in terms of an…

Descriptors: Achievement Tests, Comparative Analysis, Error of Measurement, Mathematical Models

Previous Page | Next Page »

Pages: 1 | 2

van der Linden, Wim J.	2
Altepeter, Tom	1
Andrich, David	1
Backhoff, Eduardo	1
Bichi, Ado Abdu	1
Borrello, Gloria M.	1
Bristow, M.	1
Contreras-Nino, Luis Angel	1
Curry, Allen R.	1
DeMars, Christine E.	1
Dirlik, Ezgi Mor	1
Erkorkmaz, K.	1
Feldt, Leonard S.	1
Graham, James M.	1
Haberman, Shelby J.	1
Haladyna, Tom	1
Hocevar, Dennis	1
Huissoon, J. P.	1
Huynh, Huynh	1
Jeon, Soo	1
Kreiner, Svend	1
Li, Feifei	1
Livingston, Samuel A.	1
Magno, Carlo	1
More ▼