Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 5 |
Descriptor
Comparative Analysis | 8 |
Validity | 6 |
Correlation | 3 |
Reliability | 3 |
Grade 3 | 2 |
Item Response Theory | 2 |
Mathematics Tests | 2 |
Models | 2 |
Scores | 2 |
Scoring | 2 |
Achievement | 1 |
More ▼ |
Source
Applied Measurement in… | 8 |
Author
Bong, Mimi | 1 |
Ferrara, Steve | 1 |
Finch, Holmes | 1 |
Ginsburg, Alan | 1 |
Hocevar, Dennis | 1 |
Lane, Suzanne | 1 |
Mehrens, William A. | 1 |
Noell, Jay | 1 |
Osborn Popp, Sharon E. | 1 |
Phillips, S. E. | 1 |
Ryan, Joseph M. | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 5 |
Reports - Evaluative | 2 |
Reports - Descriptive | 1 |
Education Level
Elementary Secondary Education | 2 |
Grade 3 | 1 |
Grade 5 | 1 |
Grade 8 | 1 |
Audience
Location
South Korea | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Finch, Holmes – Applied Measurement in Education, 2022
Much research has been devoted to identification of differential item functioning (DIF), which occurs when the item responses for individuals from two groups differ after they are conditioned on the latent trait being measured by the scale. There has been less work examining differential step functioning (DSF), which is present for polytomous…
Descriptors: Comparative Analysis, Item Response Theory, Item Analysis, Simulation
Steedle, Jeffrey T.; Ferrara, Steve – Applied Measurement in Education, 2016
As an alternative to rubric scoring, comparative judgment generates essay scores by aggregating decisions about the relative quality of the essays. Comparative judgment eliminates certain scorer biases and potentially reduces training requirements, thereby allowing a large number of judges, including teachers, to participate in essay evaluation.…
Descriptors: Essays, Scoring, Comparative Analysis, Evaluators
Stone, Clement A.; Ye, Feifei; Zhu, Xiaowen; Lane, Suzanne – Applied Measurement in Education, 2010
Although reliability of subscale scores may be suspect, subscale scores are the most common type of diagnostic information included in student score reports. This research compared methods for augmenting the reliability of subscale scores for an 8th-grade mathematics assessment. Yen's Objective Performance Index, Wainer et al.'s augmented scores,…
Descriptors: Item Response Theory, Case Studies, Reliability, Scores
Osborn Popp, Sharon E.; Ryan, Joseph M.; Thompson, Marilyn S. – Applied Measurement in Education, 2009
Scoring rubrics are routinely used to evaluate the quality of writing samples produced for writing performance assessments, with anchor papers chosen to represent score points defined in the rubric. Although the careful selection of anchor papers is associated with best practices for scoring, little research has been conducted on the role of…
Descriptors: Writing Evaluation, Scoring Rubrics, Selection, Scoring
Noell, Jay; Ginsburg, Alan – Applied Measurement in Education, 2009
The report, "Evaluation of the National Assessment of Educational Progress", provides a number of recommendations for addressing validity concerns about NAEP. This article identifies actions that could be taken by the Congress, the National Center for Education Statistics, and the National Assessment Governing Board--which share responsibility for…
Descriptors: National Competency Tests, Federal Government, Public Agencies, Test Validity

Bong, Mimi; Hocevar, Dennis – Applied Measurement in Education, 2002
Examined convergent and discriminant validity of various self-efficacy measures across two studies, one involving 358 U.S. high school students and another involving 235 Korean female high school students. Across the studies the first-order confirmatory factor analyses provide support for both convergent validity of different self-efficacy…
Descriptors: Comparative Analysis, Foreign Countries, High School Students, High Schools

Mehrens, William A.; Phillips, S. E. – Applied Measurement in Education, 1989
A sequential decision-making approach based on college grade point averages and test scores for teacher licensure decisions within the conjunctive model is contrasted with the compensatory model for decision making. Criteria for choosing one model over another and a rationale for favoring the conjunctive model are provided. (TJH)
Descriptors: Comparative Analysis, Cutting Scores, Decision Making, Grade Point Average

Williams, Valerie S. L. – Applied Measurement in Education, 1997
Using item response theory to investigate differential item functioning (DIF), students' expected course grades were examined and found to function similarly across sex and race. These grades were incorporated into the matching criterion, enhancing the validity of subgroup comparisons for the third-grade mathematics test taken by 1,050 students.…
Descriptors: Comparative Analysis, Criteria, Elementary School Students, Grade 3