ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	18

Descriptor

Equated Scores	33
Testing Problems	15
Test Interpretation	11
Scores	9
Educational Testing	8
Evaluation Methods	8
Psychometrics	8
Scaling	8
Test Construction	8
Test Use	8
High Stakes Tests	7
Test Theory	7
Educational Assessment	6
Test Validity	6
Comparative Analysis	5
Definitions	5
Foreign Countries	5
Measurement Techniques	5
Predictive Measurement	5
Testing Programs	5
Classification	4
Elementary Secondary Education	4
Standardized Tests	4
Test Selection	4
Accountability	3
More ▼

Source

Measurement:…	11
Educational Measurement:…	3
Journal of Educational…	3
Assessment & Evaluation in…	1
Journal of Research and…	1
New Directions for Testing…	1
School Psychology…	1
Today's Education	1

Publication Type

Opinion Papers	33
Journal Articles	22
Speeches/Meeting Papers	8
Reports - Evaluative	6
Reports - Descriptive	2
Collected Works - Proceedings	1
Guides - Non-Classroom	1
Information Analyses	1
Legal/Legislative/Regulatory…	1
Reports - General	1
Reports - Research	1
More ▼

Education Level

Elementary Secondary Education

Audience

Researchers

Location

New York	3
United Kingdom (England)	3
United States	3
United Kingdom	2
United Kingdom (Wales)	2
Australia	1
Delaware	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	3
Advanced Placement…	2
National Assessment of…	2
Comprehensive Tests of Basic…	1
Graduate Management Admission…	1
Graduate Record Examinations	1
Iowa Tests of Basic Skills	1
Law School Admission Test	1
Preliminary Scholastic…	1
Sequential Tests of…	1
Test of English as a Foreign…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 33 results Save | Export

Machine Learning and Small Data

Peer reviewed

Direct link

Cui, Zhongmin – Educational Measurement: Issues and Practice, 2021

Commonly used machine learning applications seem to relate to big data. This article provides a gentle review of machine learning and shows why machine learning can be applied to small data too. An example of applying machine learning to screen irregularity reports is presented. In the example, the support vector machine and multinomial naïve…

Descriptors: Artificial Intelligence, Man Machine Systems, Data, Bayesian Statistics

Interpretation of the Translated WISC-V: Caveat Venditor and Caveat Emptor

Peer reviewed

Direct link

Kettler, Ryan J. – School Psychology International, 2020

This article is a commentary on McGill et al.'s (2020) article "Use of Translated and Adapted Versions of the WISC-V: Caveat Emptor." McGill et al. use caveat emptor in their title to indicate that the buyer of an assessment must be careful about the product being purchased, presumably because the seller of the assessment is not being…

Descriptors: Children, Intelligence Tests, Translation, Test Reliability

On Attempting to Do What Lord Said Was Impossible: Commentary on van der Linden's "Some Conceptual Issues in Observed-Score Equating"

Peer reviewed

Direct link

Dorans, Neil J. – Journal of Educational Measurement, 2013

van der Linden (this issue) uses words differently than Holland and Dorans. This difference in language usage is a source of some confusion in van der Linden's critique of what he calls equipercentile equating. I address these differences in language. van der Linden maintains that there are only two requirements for score equating. I maintain…

Descriptors: Equated Scores, Language Usage, Statistical Distributions

Adapting Accountability Systems to the Limitations of Educational Measurement

Peer reviewed

Direct link

Kane, Michael – Measurement: Interdisciplinary Research and Perspectives, 2015

Michael Kane writes in this article that he is in more or less complete agreement with Professor Koretz's characterization of the problem outlined in the paper published in this issue of "Measurement." Kane agrees that current testing practices are not adequate for test-based accountability (TBA) systems, but he writes that he is far…

Descriptors: Educational Testing, Accountability, Standardized Tests, Equated Scores

Comments on van der Linden's Critique and Proposal for Equating

Peer reviewed

Direct link

Holland, Paul W. – Journal of Educational Measurement, 2013

While agreeing with van der Linden (this issue) that test equating needs better theoretical underpinnings, my comments criticize several aspects of his article. His examples are, for the most part, worthless; he does not use well-established terminology correctly; his view of 100 years of attempts to give a theoretical basis for equating is…

Descriptors: Equated Scores, Test Theory, Transformations (Mathematics), Computation

Comments on "Some Conceptual Issues in Observed-Score Equating" by Wim J. van der Linden

Peer reviewed

Direct link

Bradlow, Eric T. – Journal of Educational Measurement, 2013

The van der Linden article (this issue) provides a roadmap for future research in equating. My belief is that the roadmap begins and ends with collecting auxiliary data that can be utilized to provide improved equating, especially when data are sparse or equating beyond simple moments is desired.

Descriptors: Equated Scores, Data Collection, Statistical Analysis, Research

Testing for Accountability: A Balancing Act That Challenges Current Testing Practices and Theories

Peer reviewed

Direct link

Brennan, Robert L. – Measurement: Interdisciplinary Research and Perspectives, 2015

Koretz, in his article published in this issue, provides compelling arguments that the high stakes currently associated with accountability testing lead to behavioral changes in students, teachers, and other stakeholders that often have negative consequences, such as inflated scores. Koretz goes on to argue that these negative consequences require…

Descriptors: Accountability, High Stakes Tests, Behavior Change, Student Behavior

The Epidemiology of Modern Test Score Use: Anticipating Aggregation, Adjustment, and Equating

Peer reviewed

Direct link

Ho, Andrew – Measurement: Interdisciplinary Research and Perspectives, 2013

In his thoughtful focus article, Haertel (this issue) pushes testing experts to broaden the scope of their validation efforts and to invite scholars from other disciplines to join them. He credits existing validation frameworks for helping the measurement community to identify incomplete or nonexistent validity arguments. However, he notes his…

Descriptors: Educational Testing, Scores, Test Use, Test Validity

Comments on Neil Dorans's NCME Career Award Address: The Contestant Perspective on Taking Tests--Emanations from the Statue within

Peer reviewed

Direct link

Mislevy, Robert J. – Educational Measurement: Issues and Practice, 2012

This article presents the author's observations on Neil Dorans's NCME Career Award Address: "The Contestant Perspective on Taking Tests: Emanations from the Statue within." He calls attention to some points that Dr. Dorans made in his address, and offers his thoughts in response.

Descriptors: Testing, Test Reliability, Psychometrics, Scores

Assumptions about True-Scores and Populations in Equating

Peer reviewed

Direct link

Brennan, Robert L. – Measurement: Interdisciplinary Research and Perspectives, 2010

This excellent set of papers is comprehensive and very well written. The Kane et al. paper lays out the theory for linear equating with the NEAT design using a clever but simple framework. The Suh et al. paper is an excellent empirical study of the various methods. The Mroch et al. paper provides an insightful evaluation of the methods as…

Descriptors: Equated Scores, Evaluation Methods, Psychometrics, Models

Linear Equating for the NEAT Design: A Rejoinder and Some Further Comments

Peer reviewed

Direct link

Kane, Michael T.; Mroch, Andrew A.; Suh, Youngsuk; Ripkey, Douglas R. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the authors' rejoinder to commentaries on linear equating and the NEAT design. The authors appreciate the insightful work of the commentary writers. Each has made a number of interesting points, many of which the authors had not considered at all. Before responding to some of those points, the authors reiterate what they see…

Descriptors: Weighted Scores, Equated Scores, Models, Scores

A Single Population Litmus Test for Linear Scale Alignment Methods: Commentary on Kane, Mroch, Suh, and Ripkey

Peer reviewed

Direct link

Dorans, Neil J. – Measurement: Interdisciplinary Research and Perspectives, 2010

Kane, Mroch, Suh, and Ripkey (2009) describe what they call five linear equating methods for the nonequivalent groups with anchor test (NEAT) design. The authors embed these methods within a two-dimensional framework. The first dimension contrasts what the authors call a parameter substitution (PS) approach what they call a chained linear…

Descriptors: Measures (Individuals), Equated Scores, Item Response Theory, Predictor Variables

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Linking through Improved Design, Not Redefinition: Commentary on Newton

Peer reviewed

Direct link

Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010

"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…

Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques

Previous Page | Next Page »

Pages: 1 | 2 | 3

Brennan, Robert L.	2
Dorans, Neil J.	2
Angoff, William H.	1
Arter, Judith A.	1
Baird, Jo-Anne	1
Bradlow, Eric T.	1
Carroll, John B.	1
Cresswell, Mike	1
Cui, Zhongmin	1
D'Onofrio, William D., Comp.	1
Forster, Fred	1
Fremer, John	1
Gordon, Belita	1
Green, Donald Ross	1
Ho, Andrew	1
Holland, Paul W.	1
Holmes, Susan E.	1
Hoover, H. D.	1
Kane, Michael	1
Kane, Michael T.	1
Keene, John M.	1
Kettler, Ryan J.	1
LaValle, Kenneth P.	1
Lott, Winsor A.	1
More ▼