Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 6 |
Descriptor
Methods | 9 |
Test Theory | 9 |
Educational Testing | 3 |
Error of Measurement | 3 |
Item Response Theory | 3 |
Scores | 3 |
Test Items | 3 |
Comparative Analysis | 2 |
Computation | 2 |
Equated Scores | 2 |
Generalizability Theory | 2 |
More ▼ |
Source
ACT, Inc. | 1 |
American School Board Journal | 1 |
Applied Measurement in… | 1 |
ETS Research Report Series | 1 |
Educational Testing Service | 1 |
Evaluation in Education:… | 1 |
International Journal of… | 1 |
Rehabilitation Counseling… | 1 |
Author
Armstrong, Amy J. | 1 |
Chen, Haiwen | 1 |
Cui, Zhongmin | 1 |
Fang, Yu | 1 |
Haberman, Shelby | 1 |
Holland, Paul | 1 |
Kolakowsky-Hayner, Stephanie… | 1 |
Larkin, Kevin | 1 |
Lewis, Allen N. | 1 |
Li, Feifei | 1 |
Mellenbergh, Gideon J. | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Reports - Evaluative | 5 |
Reports - Research | 2 |
Opinion Papers | 1 |
Reports - General | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Grade 3 | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Counselors | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
ACT Assessment | 1 |
General Aptitude Test Battery | 1 |
What Works Clearinghouse Rating
Li, Feifei – ETS Research Report Series, 2017
An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…
Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement
Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013
Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…
Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling
Chen, Haiwen; Holland, Paul – Educational Testing Service, 2009
In this paper, we develop a new chained equipercentile equating procedure for the nonequivalent groups with anchor test (NEAT) design under the assumptions of the classical test theory model. This new equating is named chained true score equipercentile equating. We also apply the kernel equating framework to this equating design, resulting in a…
Descriptors: True Scores, Equated Scores, Test Theory, Methods
Puhan, Gautam; Sinharay, Sandip; Haberman, Shelby; Larkin, Kevin – Applied Measurement in Education, 2010
Will subscores provide additional information than what is provided by the total score? Is there a method that can estimate more trustworthy subscores than observed subscores? To answer the first question, this study evaluated whether the true subscore was more accurately predicted by the observed subscore or total score. To answer the second…
Descriptors: Licensing Examinations (Professions), Scores, Computation, Methods
Sijtsma, Klaas – International Journal of Testing, 2009
This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…
Descriptors: Construct Validity, Reliability, Classification, Test Theory
Reid, Christine A.; Kolakowsky-Hayner, Stephanie A.; Lewis, Allen N.; Armstrong, Amy J. – Rehabilitation Counseling Bulletin, 2007
Item response theory (IRT) methodology is introduced as a tool for improving assessment instruments used with people who have disabilities. Need for this approach in rehabilitation is emphasized; differences between IRT and classical test theory are clarified. Concepts essential to understanding IRT are defined, necessary data assumptions are…
Descriptors: Psychometrics, Methods, Item Response Theory, Aptitude Tests
Yen, Wendy M. – 1982
Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…
Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods
Mellenbergh, Gideon J.; van der Linden, Wim J. – Evaluation in Education: International Progress, 1982
Three item selection methods for criterion-referenced tests are examined: the classical theory of item difficulty and item-test correlation; the latent trait theory of item characteristic curves; and a decision-theoretic approach for optimal item selection. Item contribution to the standardized expected utility of mastery testing is discussed. (CM)
Descriptors: Criterion Referenced Tests, Educational Testing, Item Analysis, Latent Trait Theory
Popham, W. James – American School Board Journal, 2003
Claims that standards-based tests neither measure skills and knowledge accurately nor help educators do a better instructional job. The article offers suggestions in four areas to make the tests contribute to improved instruction: measurement of content standards; descriptions of standards; standard-by-standard reporting; and locally administered…
Descriptors: Academic Achievement, Academic Standards, Cognitive Tests, Educational Testing