ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	4

Descriptor

Program Effectiveness	4
Scores	3
Generalizability Theory	2
Mathematics Tests	2
Scoring	2
Age Differences	1
Assistive Technology	1
Automation	1
Comparative Analysis	1
Computation	1
Computer Assisted Testing	1
Correlation	1
Credentials	1
Criterion Referenced Tests	1
Cutting Scores	1
Difficulty Level	1
Disabilities	1
Effect Size	1
Evaluation Criteria	1
Foreign Countries	1
Group Discussion	1
Guides	1
High Stakes Tests	1
Mathematics Achievement	1
Mathematics Skills	1
More ▼

Source

Applied Measurement in…

Author

Chis, Liliana	1
Clauser, Brian E.	1
Domaleski, Christopher S.	1
Engelhard, George, Jr.	1
Fincher, Melissa	1
Harik, Polina	1
Johnson, Robert L.	1
Lottridge, Susan	1
Margolis, Melissa J.	1
McManus, I. C.	1
Mollon, Jennifer	1
Penny, Jim	1
Shaw, Dan	1
Shumate, Steven R.	1
Surles, James	1
Williams, Simon	1
Wood, Scott	1
More ▼

Publication Type

Journal Articles	4
Reports - Evaluative	2
Reports - Research	2

Education Level

Grade 3	1
Grade 4	1
Grade 6	1
Grade 7	1
Secondary Education	1

Audience

Location

Georgia	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Georgia Criterion Referenced…

What Works Clearinghouse Rating

Showing all 4 results Save | Export

The Effectiveness of Machine Score-Ability Ratings in Predicting Automated Scoring Performance

Peer reviewed

Direct link

Lottridge, Susan; Wood, Scott; Shaw, Dan – Applied Measurement in Education, 2018

This study sought to provide a framework for evaluating machine score-ability of items using a new score-ability rating scale, and to determine the extent to which ratings were predictive of observed automated scoring performance. The study listed and described a set of factors that are thought to influence machine score-ability; these factors…

Descriptors: Program Effectiveness, Computer Assisted Testing, Test Scoring Machines, Scoring

Mathematics Performance of Students with and without Disabilities under Accommodated Conditions Using Resource Guides and Calculators on High Stakes Tests

Peer reviewed

Direct link

Engelhard, George, Jr.; Fincher, Melissa; Domaleski, Christopher S. – Applied Measurement in Education, 2011

This study examines the effects of two test administration accommodations on the mathematics performance of students within the context of a large-scale statewide assessment. The two test administration accommodations were resource guides and calculators. A stratified random sample of schools was selected to represent the demographic…

Descriptors: Testing Accommodations, Disabilities, High Stakes Tests, Program Effectiveness

An Empirical Examination of the Impact of Group Discussion and Examinee Performance Information on Judgments Made in the Angoff Standard-Setting Procedure

Peer reviewed

Direct link

Clauser, Brian E.; Harik, Polina; Margolis, Melissa J.; McManus, I. C.; Mollon, Jennifer; Chis, Liliana; Williams, Simon – Applied Measurement in Education, 2009

Numerous studies have compared the Angoff standard-setting procedure to other standard-setting methods, but relatively few studies have evaluated the procedure based on internal criteria. This study uses a generalizability theory framework to evaluate the stability of the estimated cut score. To provide a measure of internal consistency, this…

Descriptors: Generalizability Theory, Group Discussion, Standard Setting (Scoring), Scoring

The Effects of the Number of Scale Points and Non-Normality on the Generalizability Coefficient: A Monte Carlo Study

Peer reviewed

Direct link

Shumate, Steven R.; Surles, James; Johnson, Robert L.; Penny, Jim – Applied Measurement in Education, 2007

Increasingly, assessment practitioners use generalizability coefficients to estimate the reliability of scores from performance tasks. Little research, however, examines the relation between the estimation of generalizability coefficients and the number of rubric scale points and score distributions. The purpose of the present research is to…

Descriptors: Generalizability Theory, Monte Carlo Methods, Measures (Individuals), Program Effectiveness