ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	9

Source

Applied Measurement in…

Publication Type

Journal Articles	37
Reports - Evaluative	37
Information Analyses	5
Guides - Non-Classroom	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
High Schools	1
Middle Schools	1

Audience

Location

Arizona	1
Massachusetts	1
North Carolina	1
United States	1
Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	1
Self Description Questionnaire	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Some Methods and Evaluation for Linking and Equating with Small Samples

Peer reviewed

Direct link

Peabody, Michael R. – Applied Measurement in Education, 2020

The purpose of the current article is to introduce the equating and evaluation methods used in this special issue. Although a comprehensive review of all existing models and methodologies would be impractical given the format, a brief introduction to some of the more popular models will be provided. A brief discussion of the conditions required…

Descriptors: Evaluation Methods, Equated Scores, Sample Size, Item Response Theory

Where Are We Now? Learning Progressions and Formative Assessment

Peer reviewed

Direct link

Gotwals, Amelia Wenk – Applied Measurement in Education, 2018

In this commentary, I consider the three empirical studies in this special issue based on two main aspects: (a) the nature of the learning progressions and (b) what formative assessment practice(s) were investigated. Specifically, I describe differences among the learning progressions in terms of scope and grain size. I also identify three…

Descriptors: Skill Development, Behavioral Objectives, Formative Evaluation, Evaluation Methods

In Search of Validity Evidence in Support of the Interpretation and Use of Assessments of Complex Constructs: Discussion of Research on Assessing 21st Century Skills

Peer reviewed

Direct link

Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016

Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…

Descriptors: Evaluation Methods, Test Construction, Design, Scaling

Practical Application of a Synthetic Linking Function on Small-Sample Equating

Peer reviewed

Direct link

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Applied Measurement in Education, 2011

The synthetic function is a weighted average of the identity (the linking function for forms that are known to be completely parallel) and a traditional equating method. The purpose of the present study was to investigate the benefits of the synthetic function on small-sample equating using various real data sets gathered from different…

Descriptors: Testing Programs, Equated Scores, Investigations, Data Analysis

Two Approaches for Identifying Low-Motivated Students in a Low-Stakes Assessment Context

Peer reviewed

Direct link

Swerdzewski, Peter J.; Harmes, J. Christine; Finney, Sara J. – Applied Measurement in Education, 2011

Many universities rely on data gathered from tests that are low stakes for examinees but high stakes for the various programs being assessed. Given the lack of consequences associated with many collegiate assessments, the construct-irrelevant variance introduced by unmotivated students is potentially a serious threat to the validity of the…

Descriptors: Computer Assisted Testing, Student Motivation, Inferences, Universities

Criterion-Focused Approach to Reducing Adverse Impact in College Admissions

Peer reviewed

Direct link

Sinha, Ruchi; Oswald, Frederick; Imus, Anna; Schmitt, Neal – Applied Measurement in Education, 2011

The current study examines how using a multidimensional battery of predictors (high-school grade point average (GPA), SAT/ACT, and biodata), and weighting the predictors based on the different values institutions place on various student performance dimensions (college GPA, organizational citizenship behaviors (OCBs), and behaviorally anchored…

Descriptors: Grade Point Average, Interrater Reliability, Rating Scales, College Admission

Validating Measurement of Knowledge Integration in Science Using Multiple-Choice and Explanation Items

Peer reviewed

Direct link

Lee, Hee-Sun; Liu, Ou Lydia; Linn, Marcia C. – Applied Measurement in Education, 2011

This study explores measurement of a construct called knowledge integration in science using multiple-choice and explanation items. We use construct and instructional validity evidence to examine the role multiple-choice and explanation items plays in measuring students' knowledge integration ability. For construct validity, we analyze item…

Descriptors: Knowledge Level, Construct Validity, Validity, Scaffolding (Teaching Technique)

Detecting and Correcting Scale Drift in Test Equating: An Illustration from a Large Scale Testing Program

Peer reviewed

Direct link

Puhan, Gautam – Applied Measurement in Education, 2009

The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…

Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory

Creating IRT-Based Parallel Test Forms Using the Genetic Algorithm Method

Peer reviewed

Direct link

Sun, Koun-Tem; Chen, Yu-Jen; Tsai, Shu-Yen; Cheng, Chien-Fen – Applied Measurement in Education, 2008

In educational measurement, the construction of parallel test forms is often a combinatorial optimization problem that involves the time-consuming selection of items to construct tests having approximately the same test information functions (TIFs) and constraints. This article proposes a novel method, genetic algorithm (GA), to construct parallel…

Descriptors: Test Format, Measurement Techniques, Equations (Mathematics), Item Response Theory

Multitrait-Multimethod Analyses: Inferring Each Trait-Method Combination with Multiple Indicators.

Peer reviewed

Marsh, Herbert W. – Applied Measurement in Education, 1993

Approaches to the evaluation of multitrait-multimethod data in which there are multiple indicators of each trait-method combination are explored. A first-order factor defined by multiple indicators is posited for each trait-method combination, and method and trait factors are posited as second-order factors. (SLD)

Descriptors: Correlation, Evaluation Methods, Multitrait Multimethod Techniques, Scores

A Refined Item Digraph Analysis of a Proportional Reasoning Test.

Peer reviewed

Bart, William M.; Williams-Morris, Ruth – Applied Measurement in Education, 1990

Refined item digraph analysis (RIDA) is a way of studying diagnostic and prescriptive testing. It permits assessment of a test item's diagnostic value by examining the extent to which the item has properties of ideal items. RIDA is illustrated with the Orange Juice Test, which assesses the proportionality concept. (TJH)

Descriptors: Diagnostic Tests, Evaluation Methods, Item Analysis, Mathematical Models

Automated Tools for Subject Matter Expert Evaluation of Automated Scoring

Peer reviewed

Direct link

Williamson, David M.; Bejar, Isaac I.; Sax, Anne – Applied Measurement in Education, 2004

As automated scoring of complex constructed-response examinations reaches operational status, the process of evaluating the quality of resultant scores, particularly in contrast to scores of expert human graders, becomes as complex as the data itself. Using a vignette from the Architectural Registration Examination (ARE), this article explores the…

Descriptors: Validity, Scoring, Scores, Evaluation Methods

Bayesian or Non-Bayesian: A Comparison Study of Item Parameter Estimation in the Three-Parameter Logistic Model

Peer reviewed

Direct link

Gao, Furong; Chen, Lisue – Applied Measurement in Education, 2005

Through a large-scale simulation study, this article compares item parameter estimates obtained by the marginal maximum likelihood estimation (MMLE) and marginal Bayes modal estimation (MBME) procedures in the 3-parameter logistic model. The impact of different prior specifications on the MBME estimates is also investigated using carefully…

Descriptors: Simulation, Computation, Bayesian Statistics, Item Analysis

Performance of SIBTEST When the Percentage of DIF Items Is Large

Peer reviewed

Direct link

Gierl, Mark J.; Gotzmann, Andrea; Boughton, Keith A. – Applied Measurement in Education, 2004

Differential item functioning (DIF) analyses are used to identify items that operate differently between two groups, after controlling for ability. The Simultaneous Item Bias Test (SIBTEST) is a popular DIF detection method that matches examinees on a true score estimate of ability. However in some testing situations, like test translation and…

Descriptors: True Scores, Simulation, Test Bias, Student Evaluation

Methodological Approaches to the Validation of Academic Self-Concept: The Construct and Its Measures.

Peer reviewed

Byrne, Barbara M. – Applied Measurement in Education, 1990

Methodological procedures used in validating the theoretical structure of academic self-concept and validating associated measurement instruments are reviewed. Substantive findings from research related to modes of inquiry are summarized, and recommendations for future research are outlined. (TJH)

Descriptors: Classification, Construct Validity, Evaluation Methods, Literature Reviews

Previous Page | Next Page »

Pages: 1 | 2 | 3

Plake, Barbara S.	3
Su, Ya-Hui	2
Wang, Wen-Chung	2
Ackerman, Terry A.	1
Baron, Joan Boykoff	1
Bart, William M.	1
Bejar, Isaac I.	1
Boughton, Keith A.	1
Brookhart, Susan M.	1
Byrne, Barbara M.	1
Calfee, Robert	1
Chen, Lisue	1
Chen, Yu-Jen	1
Cheng, Chien-Fen	1
Crocker, Linda	1
Crouse, Jill D.	1
Dorans, Neil J.	1
Ercikan, Kadriye	1
Fetler, Mark E.	1
Finney, Sara J.	1
Frary, Robert B.	1
Gao, Furong	1
Gierl, Mark J.	1
Gotwals, Amelia Wenk	1
More ▼

Evaluation Methods	37
Performance Based Assessment	12
Decision Making	8
Educational Assessment	8
Item Response Theory	8
Test Construction	8
Elementary Secondary Education	7
Standards	7
Test Items	7
Evaluators	6
Standard Setting (Scoring)	6
Student Evaluation	6
Simulation	5
Statistical Analysis	5
Teacher Evaluation	5
Test Bias	5
Equated Scores	4
Measurement Techniques	4
Models	4
Scoring	4
Test Use	4
Test Validity	4
Validity	4
Construct Validity	3
Equations (Mathematics)	3
More ▼