ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	21

Publication Type

Reports - Evaluative	42
Journal Articles	32
Speeches/Meeting Papers	5
Book/Product Reviews	3
Opinion Papers	3
Guides - Non-Classroom	1
Information Analyses	1
Numerical/Quantitative Data	1

Education Level

Higher Education	5
Postsecondary Education	4
Elementary Education	1
Elementary Secondary Education	1
Kindergarten	1

Audience

Practitioners	1
Researchers	1

Location

Canada	1
Colorado	1
Florida	1
Luxembourg	1
New York	1
Sweden	1
United Kingdom (England)	1
Utah	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

ACT Assessment	1
Armed Services Vocational…	1
Stanford Achievement Tests	1
Wisconsin Card Sorting Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 42 results Save | Export

A Design for Comparing CTT and IRT in Test Assembly, Scoring and Argumentation: Differences among Reliability, Information and Validation

Peer reviewed

Direct link

Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019

This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…

Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring

The Controversy of Consequences

Peer reviewed

Direct link

Twing, Jon S. – Assessment in Education: Principles, Policy & Practice, 2016

This special issue of "Assessment in Education" contains the type of debate needed about what Cizek (2015) calls a "… lingering flaw in the concept of validity…." Some practitioners might not agree that the current theory of validation is flawed. Specifically, the debate Jon Twing is referencing concerns the role of the…

Descriptors: Test Validity, Misconceptions, Evidence, Scores

A Note on Assessing the Added Value of Subscores

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2014

Brennan (Brennan, R. L., 2012) noted that users of test scores often want (indeed, demand) that subscores be reported, along with total test scores, for diagnostic purposes. Haberman (Haberman, S. J., 2008) suggested a method based on classical test theory (CTT) to determine if subscores have added value over the total score. According to this…

Descriptors: Scores, Test Theory, Test Interpretation

Analysis of Added Value of Subscores with Respect to Classification

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2014

Brennan noted that users of test scores often want (indeed, demand) that subscores be reported, along with total test scores, for diagnostic purposes. Haberman suggested a method based on classical test theory (CTT) to determine if subscores have added value over the total score. One way to interpret the method is that a subscore has added value…

Descriptors: Scores, Test Theory, Classification, Cutting Scores

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

Download full text

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu – ACT, Inc., 2013

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

Descriptors: Comparative Analysis, Error of Measurement, Scores, Scaling

Florida Center for Reading Research (FCRR) Reading Assessment (FRA): Kindergarten to Grade 2. Technical Manual

Download full text

Foorman, Barbara R.; Petscher, Yaacov; Schatschneider, Chris – Florida Center for Reading Research, 2015

The grades K-2 Florida Center for Reading Research (FCRR) Reading Assessment (FRA) consists of computer-adaptive alphabetic and oral language screening tasks that provide a Probability of Literacy Success (PLS) linked to grade-level performance (i.e., the 40th percentile) on the word reading (in kindergarten) or reading comprehension (in grades…

Descriptors: Reading Instruction, Reading Tests, Kindergarten, Grade 1

Classification Accuracy in Key Stage 2 National Curriculum Tests in England

Peer reviewed

Direct link

He, Qingping; Hayes, Malcolm; Wiliam, Dylan – Research Papers in Education, 2013

The accuracy of the results of the national tests in English, mathematics and science taken by 11-year olds in England has been a matter of much debate since their introduction in 1994, with estimates of the proportion of students incorrectly classified varying from 10 to 30%. Using live data from the 2009 and 2010 administration of the national…

Descriptors: Foreign Countries, National Curriculum, Accuracy, Classification

A Study of General Education Astronomy Students' Understandings of Cosmology. Part I. Development and Validation of Four Conceptual Cosmology Surveys

Peer reviewed

Direct link

Wallace, Colin S.; Prather, Edward E.; Duncan, Douglas K. – Astronomy Education Review, 2011

This is the first in a series of five articles describing a national study of general education astronomy students' conceptual and reasoning difficulties with cosmology. In this paper, we describe the process by which we designed four new surveys to assess general education astronomy students' conceptual cosmology knowledge. These surveys focused…

Descriptors: General Education, Astronomy, Surveys, Evolution

The Utility of Augmented Subscores in a Licensure Exam: An Evaluation of Methods Using Empirical Data

Peer reviewed

Direct link

Puhan, Gautam; Sinharay, Sandip; Haberman, Shelby; Larkin, Kevin – Applied Measurement in Education, 2010

Will subscores provide additional information than what is provided by the total score? Is there a method that can estimate more trustworthy subscores than observed subscores? To answer the first question, this study evaluated whether the true subscore was more accurately predicted by the observed subscore or total score. To answer the second…

Descriptors: Licensing Examinations (Professions), Scores, Computation, Methods

A Study of General Education Astronomy Students' Understandings of Cosmology. Part II. Evaluating Four Conceptual Cosmology Surveys: A Classical Test Theory Approach

Peer reviewed

Direct link

Wallace, Colin S.; Prather, Edward E.; Duncan, Douglas K. – Astronomy Education Review, 2011

This is the second of five papers detailing our national study of general education astronomy students' conceptual and reasoning difficulties with cosmology. This article begins our quantitative investigation of the data. We describe how we scored students' responses to four conceptual cosmology surveys, and we present evidence for the inter-rater…

Descriptors: Astronomy, Scientific Concepts, College Students, Introductory Courses

A Control Systems Concept Inventory Test Design and Assessment

Peer reviewed

Direct link

Bristow, M.; Erkorkmaz, K.; Huissoon, J. P.; Jeon, Soo; Owen, W. S.; Waslander, S. L.; Stubley, G. D. – IEEE Transactions on Education, 2012

Any meaningful initiative to improve the teaching and learning in introductory control systems courses needs a clear test of student conceptual understanding to determine the effectiveness of proposed methods and activities. The authors propose a control systems concept inventory. Development of the inventory was collaborative and iterative. The…

Descriptors: Diagnostic Tests, Concept Formation, Undergraduate Students, Engineering Education

Test Anxiety and the Validity of Cognitive Tests: A Confirmatory Factor Analysis Perspective and Some Empirical Findings

Peer reviewed

Direct link

Wicherts, Jelte M.; Scholten, Annemarie Zand – Intelligence, 2010

The validity of cognitive ability tests is often interpreted solely as a function of the cognitive abilities that these tests are supposed to measure, but other factors may be at play. The effects of test anxiety on the criterion related validity (CRV) of tests was the topic of a recent study by Reeve, Heggestad, and Lievens (2009) (Reeve, C. L.,…

Descriptors: Familiarity, Test Validity, Cognitive Tests, Factor Analysis

Adaptations and Access to Assessment of Common Core Content

Peer reviewed

Direct link

Kettler, Ryan J. – Review of Research in Education, 2015

This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…

Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations

Validity and the Consequences of Test Interpretation and Use

Peer reviewed

Direct link

Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011

The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…

Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation

Incomplete Psychometric Equivalence of Scores Obtained on the Manual and the Computer Version of the Wisconsin Card Sorting Test?

Peer reviewed

Direct link

Steinmetz, Jean-Paul; Brunner, Martin; Loarer, Even; Houssemand, Claude – Psychological Assessment, 2010

The Wisconsin Card Sorting Test (WCST) assesses executive and frontal lobe function and can be administered manually or by computer. Despite the widespread application of the 2 versions, the psychometric equivalence of their scores has rarely been evaluated and only a limited set of criteria has been considered. The present experimental study (N =…

Descriptors: Computer Assisted Testing, Psychometrics, Test Theory, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3

Applied Psychological…	6
Educational Measurement:…	5
Educational and Psychological…	4
Astronomy Education Review	2
ACT, Inc.	1
Applied Measurement in…	1
Assessment in Education:…	1
Florida Center for Reading…	1
IEEE Transactions on Education	1
Intelligence	1
Journal of Educational…	1
Journal on Educational…	1
Language, Speech, and Hearing…	1
Multivariate Behavioral…	1
National Center for Analysis…	1
Practical Assessment,…	1
Psychological Assessment	1
Psychological Review	1
Remedial and Special…	1
Research Papers in Education	1
Review of Research in…	1
Social Indicators Research	1
More ▼

Scores	42
Test Theory	42
Reliability	11
Error of Measurement	10
Test Reliability	9
Item Response Theory	8
Test Validity	8
Correlation	7
Measurement Techniques	7
Models	7
Psychometrics	7
Test Interpretation	7
Estimation (Mathematics)	5
Foreign Countries	5
Mathematical Models	5
Statistical Analysis	5
Validity	5
Achievement Gains	4
Comparative Analysis	4
Decision Making	4
Educational Testing	4
Elementary Secondary Education	4
Evaluation Methods	4
Test Construction	4
Test Items	4
More ▼

Sinharay, Sandip	4
Duncan, Douglas K.	2
Haberman, Shelby	2
Prather, Edward E.	2
Puhan, Gautam	2
Thompson, Bruce	2
Wallace, Colin S.	2
Alqarni, Abdulelah Mohammed	1
Biswas, Ajoy Kumar	1
Boyd, Donald	1
Brennan, Robert L.	1
Bristow, M.	1
Brunner, Martin	1
Cahan, Sorel	1
Collins, Linda M.	1
Crowley, Susan	1
Cui, Zhongmin	1
Drasgow, Fritz	1
Embretson, Susan E.	1
Erkorkmaz, K.	1
Fang, Yu	1
Ferrando, Pere J.	1
Foorman, Barbara R.	1
Graham, James M.	1
More ▼