ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	6

Descriptor

Methods	9
Test Theory	9
Educational Testing	3
Error of Measurement	3
Item Response Theory	3
Scores	3
Test Items	3
Comparative Analysis	2
Computation	2
Equated Scores	2
Generalizability Theory	2
Latent Trait Theory	2
Reliability	2
Test Length	2
Test Reliability	2
Academic Achievement	1
Academic Standards	1
Aptitude Tests	1
Classification	1
Cognitive Tests	1
College Entrance Examinations	1
Computer Software	1
Construct Validity	1
Criterion Referenced Tests	1
Data Analysis	1
More ▼

Source

ACT, Inc.	1
American School Board Journal	1
Applied Measurement in…	1
ETS Research Report Series	1
Educational Testing Service	1
Evaluation in Education:…	1
International Journal of…	1
Rehabilitation Counseling…	1

Publication Type

Journal Articles	6
Reports - Evaluative	5
Reports - Research	2
Opinion Papers	1
Reports - General	1
Speeches/Meeting Papers	1

Education Level

Grade 3	1
Higher Education	1
Postsecondary Education	1

Audience

Counselors

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
General Aptitude Test Battery	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

Download full text

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling

Construction of Chained True Score Equipercentile Equatings under the Kernel Equating (KE) Framework and Their Relationship to Levine True Score Equating. Research Report. ETS RR-09-24

Download full text

Chen, Haiwen; Holland, Paul – Educational Testing Service, 2009

In this paper, we develop a new chained equipercentile equating procedure for the nonequivalent groups with anchor test (NEAT) design under the assumptions of the classical test theory model. This new equating is named chained true score equipercentile equating. We also apply the kernel equating framework to this equating design, resulting in a…

Descriptors: True Scores, Equated Scores, Test Theory, Methods

The Utility of Augmented Subscores in a Licensure Exam: An Evaluation of Methods Using Empirical Data

Peer reviewed

Direct link

Puhan, Gautam; Sinharay, Sandip; Haberman, Shelby; Larkin, Kevin – Applied Measurement in Education, 2010

Will subscores provide additional information than what is provided by the total score? Is there a method that can estimate more trustworthy subscores than observed subscores? To answer the first question, this study evaluated whether the true subscore was more accurately predicted by the observed subscore or total score. To answer the second…

Descriptors: Licensing Examinations (Professions), Scores, Computation, Methods

Correcting Fallacies in Validity, Reliability, and Classification

Peer reviewed

Direct link

Sijtsma, Klaas – International Journal of Testing, 2009

This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…

Descriptors: Construct Validity, Reliability, Classification, Test Theory

Modern Psychometric Methodology: Applications of Item Response Theory

Peer reviewed

Direct link

Reid, Christine A.; Kolakowsky-Hayner, Stephanie A.; Lewis, Allen N.; Armstrong, Amy J. – Rehabilitation Counseling Bulletin, 2007

Item response theory (IRT) methodology is introduced as a tool for improving assessment instruments used with people who have disabilities. Need for this approach in rehabilitation is emphasized; differences between IRT and classical test theory are clarified. Concepts essential to understanding IRT are defined, necessary data assumptions are…

Descriptors: Psychometrics, Methods, Item Response Theory, Aptitude Tests

Obtaining Some Degree of Correspondence Between Unequatable Scores: A Comparison of Item Response Theory and Equipercentile Equating Methods.

Yen, Wendy M. – 1982

Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…

Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods

Selecting Items for Criterion-Referenced Tests.

Mellenbergh, Gideon J.; van der Linden, Wim J. – Evaluation in Education: International Progress, 1982

Three item selection methods for criterion-referenced tests are examined: the classical theory of item difficulty and item-test correlation; the latent trait theory of item characteristic curves; and a decision-theoretic approach for optimal item selection. Item contribution to the standardized expected utility of mastery testing is discussed. (CM)

Descriptors: Criterion Referenced Tests, Educational Testing, Item Analysis, Latent Trait Theory

Trouble with Testing: Why Standards-based Assessment Doesn't Measure Up.

Popham, W. James – American School Board Journal, 2003

Claims that standards-based tests neither measure skills and knowledge accurately nor help educators do a better instructional job. The article offers suggestions in four areas to make the tests contribute to improved instruction: measurement of content standards; descriptions of standards; standard-by-standard reporting; and locally administered…

Descriptors: Academic Achievement, Academic Standards, Cognitive Tests, Educational Testing

Armstrong, Amy J.	1
Chen, Haiwen	1
Cui, Zhongmin	1
Fang, Yu	1
Haberman, Shelby	1
Holland, Paul	1
Kolakowsky-Hayner, Stephanie…	1
Larkin, Kevin	1
Lewis, Allen N.	1
Li, Feifei	1
Mellenbergh, Gideon J.	1
Popham, W. James	1
Puhan, Gautam	1
Reid, Christine A.	1
Sijtsma, Klaas	1
Sinharay, Sandip	1
Traynor, Anne	1
Woodruff, David	1
Yen, Wendy M.	1
van der Linden, Wim J.	1
More ▼