ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	15

Descriptor

Comparative Analysis	20
Educational Testing	20
Psychometrics	20
Measurement Techniques	11
Foreign Countries	10
Educational Assessment	9
Equated Scores	9
Evaluation Methods	9
Test Interpretation	8
Test Construction	7
Definitions	6
High Stakes Tests	6
Predictive Measurement	6
Test Use	6
Test Items	5
Testing Problems	5
Classification	4
Computer Assisted Testing	4
Educational Policy	4
Evaluation Criteria	4
Item Response Theory	4
Measurement	4
Scaling	4
Statistical Analysis	4
Test Theory	4
More ▼

Source

Measurement:…	5
ProQuest LLC	3
Research Papers in Education	2
Assessing Writing	1
Design and Technology…	1
Educational Research and…	1
Journal of Applied Testing…	1
Journal of Educational…	1
Learning Disabilities…	1
Ministerial Council on…	1
Psychometrika	1
More ▼

Publication Type

Journal Articles	14
Opinion Papers	6
Reports - Research	5
Reports - Evaluative	4
Dissertations/Theses -…	3
Information Analyses	2
Numerical/Quantitative Data	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education	13
Elementary Education	2
Grade 8	2
Grade 6	1
Higher Education	1
Postsecondary Education	1

Audience

Location

United Kingdom	5
United Kingdom (England)	3
United States	3
Australia	2
United Kingdom (Wales)	2
Taiwan	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Advanced Placement…	2
SAT (College Admission Test)	2
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

A Multicomponent Latent Trait Model for Diagnosis

Peer reviewed

Direct link

Embretson, Susan E.; Yang, Xiangdong – Psychometrika, 2013

This paper presents a noncompensatory latent trait model, the multicomponent latent trait model for diagnosis (MLTM-D), for cognitive diagnosis. In MLTM-D, a hierarchical relationship between components and attributes is specified to be applicable to permit diagnosis at two levels. MLTM-D is a generalization of the multicomponent latent trait…

Descriptors: Mathematics Achievement, Achievement Tests, Item Response Theory, Measurement

Large-Scale Assessment, Locally-Developed Measures, and Automated Scoring of Essays: Fishing for Red Herrings?

Peer reviewed

Direct link

Condon, William – Assessing Writing, 2013

Automated Essay Scoring (AES) has garnered a great deal of attention from the rhetoric and composition/writing studies community since the Educational Testing Service began using e-rater[R] and the "Criterion"[R] Online Writing Evaluation Service as products in scoring writing tests, and most of the responses have been negative. While the…

Descriptors: Measurement, Psychometrics, Evaluation Methods, Educational Testing

A Comparison of Equating/Linking Using the Stocking-Lord Method and Concurrent Calibration with Mixed-Format Tests in the Non-Equivalent Groups Common-Item Design under IRT

Direct link

Tian, Feng – ProQuest LLC, 2011

There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…

Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis

Random or Fixed Testlet Effects: A Comparison of Two Multilevel Testlet Models

Direct link

Chen, Tzu-An – ProQuest LLC, 2010

This simulation study compared the performance of two multilevel measurement testlet (MMMT) models: Beretvas and Walker's (2008) two-level MMMT model and Jiao, Wang, and Kamata's (2005) three-level model. Several conditions were manipulated (including testlet length, sample size, and the pattern of the testlet effects) to assess the impact on the…

Descriptors: Simulation, Item Response Theory, Comparative Analysis, Models

Group Comparisons of Mathematics Performance from a Cognitive Diagnostic Perspective

Peer reviewed

Direct link

Chen, Yi-Hsin; Ferron, John M.; Thompson, Marilyn S.; Gorin, Joanna S.; Tatsuoka, Kikumi K. – Educational Research and Evaluation, 2010

Traditional comparisons of test score means identify group differences in broad academic areas, but fail to provide substantive description of how the groups differ on the specific cognitive attributes required for success in the academic area. The rule space method (RSM) allows for group comparisons at the cognitive attribute level, which…

Descriptors: Foreign Countries, Academic Achievement, Probability, Algebra

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Evaluating the Rank-Ordering Method for Standard Maintaining

Peer reviewed

Direct link

Bramley, Tom; Gill, Tim – Research Papers in Education, 2010

The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…

Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries

A Psychometric Evaluation of a State Testing Program: Accommodated versus Non-Accommodated Students

Direct link

Roxbury, Tiese L. – ProQuest LLC, 2010

Federal legislation such as "No Child Left Behind" mandated that students with disabilities be included in accountability standards, creating an important responsibility to fairly assess all students, even those with disabilities. Consequently, a sense of urgency was placed on the entire educational system to ensure that these students…

Descriptors: Test Items, Testing Programs, Federal Legislation, Educational Testing

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Linking through Improved Design, Not Redefinition: Commentary on Newton

Peer reviewed

Direct link

Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010

"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…

Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques

Contrasting Conceptions of Comparability

Peer reviewed

Direct link

Newton, Paul E. – Research Papers in Education, 2010

Robert Coe has claimed that three broad conceptions of comparability can be identified from the literature: performance, statistical and conventional. Each of these he rejected, in favour of a single, integrated conception which relies upon the notion of a "linking construct" and which he termed "construct comparability".…

Descriptors: Psychometrics, Measurement Techniques, Foreign Countries, Tests

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

An Investigation into the Use of Cognitive Ability Tests in the Identification of Gifted Students in Design and Technology

Peer reviewed
PDF on ERIC

Download full text

Twissell, Adrian – Design and Technology Education, 2011

This study examines whether MidYIS and YELLIS cognitive ability tests (CATs) are appropriate methods for the identification of giftedness in Design and Technology. A key rationale for the study was whether CATs and able to identify those students with the aptitudes considered of importance to identifying giftedness in Design and Technology and…

Descriptors: Foreign Countries, Gifted, Identification, Cognitive Ability

Evaluating Comparability in Computerized Adaptive Testing: Issues, Criteria and an Example.

Peer reviewed

Wang, Tianyou; Kolen, Michael J. – Journal of Educational Measurement, 2001

Reviews research literature on comparability issues in computerized adaptive testing (CAT) and synthesizes issues specific to comparability and test security. Develops a framework for evaluating comparability that contains three categories of criteria: (1) validity; (2) psychometric property/reliability; and (3) statistical assumption/test…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Criteria

Previous Page | Next Page »

Pages: 1 | 2

Newton, Paul E.	2
Algozzine, Bob	1
Baird, Jo-Anne	1
Bramley, Tom	1
Chen, Tzu-An	1
Chen, Yi-Hsin	1
Condon, William	1
Cresswell, Mike	1
Donovan, Jenny	1
Embretson, Susan E.	1
Ferron, John M.	1
Gill, Tim	1
Gorin, Joanna S.	1
Harvey, Anne L.	1
Hutton, Penny	1
Kolen, Michael J.	1
Lennon, Melissa	1
Luecht, Richard M.	1
Mazzeo, John	1
Oakland, Thomas	1
Roxbury, Tiese L.	1
Tatsuoka, Kikumi K.	1
Tebeleff, Michael	1
Thompson, Marilyn S.	1
More ▼