ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	9

Descriptor

Accuracy	9
Scoring Formulas	9
Scores	4
Evaluation Methods	3
Interrater Reliability	3
Multiple Choice Tests	3
Test Construction	3
Classification	2
Cutting Scores	2
Equated Scores	2
Guessing (Tests)	2
Item Response Theory	2
Mathematics Tests	2
Measurement Techniques	2
Models	2
Predictive Validity	2
Probability	2
Reading Tests	2
Tables (Data)	2
Test Reliability	2
Test Wiseness	2
Achievement Gains	1
Achievement Rating	1
Achievement Tests	1
Adaptive Testing	1
More ▼

Source

Northwest Evaluation…	2
Anatomical Sciences Education	1
Applied Measurement in…	1
Educational Leadership	1
Higher Education Studies	1
Journal of Educational…	1
Measurement and Evaluation in…	1
Review of Educational Research	1

Publication Type

Journal Articles	7
Reports - Evaluative	4
Reports - Research	4
Numerical/Quantitative Data	2
Information Analyses	1
Reports - Descriptive	1

Education Level

Grade 7	1
Higher Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Appraising the Scoring Performance of Automated Essay Scoring Systems--Some Additional Considerations: Which Essays? Which Human Raters? Which Scores?

Peer reviewed

Direct link

Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018

The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…

Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Developing, Analyzing, and Using Distractors for Multiple-Choice Tests in Education: A Comprehensive Review

Peer reviewed

Direct link

Gierl, Mark J.; Bulut, Okan; Guo, Qi; Zhang, Xinxin – Review of Educational Research, 2017

Multiple-choice testing is considered one of the most effective and enduring forms of educational assessment that remains in practice today. This study presents a comprehensive review of the literature on multiple-choice testing in education focused, specifically, on the development, analysis, and use of the incorrect options, which are also…

Descriptors: Multiple Choice Tests, Difficulty Level, Accuracy, Error Patterns

Modeling Student Test-Taking Motivation in the Context of an Adaptive Achievement Test

Peer reviewed

Direct link

Wise, Steven L.; Kingsbury, G. Gage – Journal of Educational Measurement, 2016

This study examined the utility of response time-based analyses in understanding the behavior of unmotivated test takers. For the data from an adaptive achievement test, patterns of observed rapid-guessing behavior and item response accuracy were compared to the behavior expected under several types of models that have been proposed to represent…

Descriptors: Achievement Tests, Student Motivation, Test Wiseness, Adaptive Testing

Climbing Bloom's Taxonomy Pyramid: Lessons from a Graduate Histology Course

Peer reviewed

Direct link

Zaidi, Nikki B.; Hwang, Charles; Scott, Sara; Stallard, Stefanie; Purkiss, Joel; Hortsch, Michael – Anatomical Sciences Education, 2017

Bloom's taxonomy was adopted to create a subject-specific scoring tool for histology multiple-choice questions (MCQs). This Bloom's Taxonomy Histology Tool (BTHT) was used to analyze teacher- and student-generated quiz and examination questions from a graduate level histology course. Multiple-choice questions using histological images were…

Descriptors: Taxonomy, Anatomy, Graduate Students, Scoring Formulas

Grading: Why You Should Trust Your Judgment

Direct link

Guskey, Thomas R.; Jung, Lee Ann – Educational Leadership, 2016

Many educators consider grades calculated from statistical algorithms more accurate, objective, and reliable than grades they calculate themselves. But in this research, the authors first asked teachers to use their professional judgment to choose a summary grade for hypothetical students. When the researchers compared the teachers' grade with the…

Descriptors: Grading, Computer Assisted Testing, Interrater Reliability, Grades (Scholastic)

Linking the ACT ASPIRE Assessments to NWEA MAP Assessments

Download full text

Northwest Evaluation Association, 2016

Northwest Evaluation Association™ (NWEA™) is committed to providing partners with useful tools to help make inferences from Measures of Academic Progress® (MAP®) interim assessment scores. One important tool is the concordance table between MAP and state summative assessments. Concordance tables have been used for decades to relate scores on…

Descriptors: Tables (Data), Benchmarking, Scoring Formulas, Scores

Linking the Smarter Balanced Assessments to NWEA MAP Assessments

Download full text

Northwest Evaluation Association, 2015

Concordance tables have been used for decades to relate scores on different tests measuring similar but distinct constructs. These tables, typically derived from statistical linking procedures, provide a direct link between scores on different tests and serve various purposes. Aside from describing how a score on one test relates to performance on…

Descriptors: Outcome Measures, Tables (Data), Language Arts, English Instruction

Multiple-Choice Testing Using Immediate Feedback--Assessment Technique (IF AT®) Forms: Second-Chance Guessing vs. Second-Chance Learning?

Peer reviewed
PDF on ERIC

Download full text

Merrel, Jeremy D.; Cirillo, Pier F.; Schwartz, Pauline M.; Webb, Jeffrey A. – Higher Education Studies, 2015

Multiple choice testing is a common but often ineffective method for evaluating learning. A newer approach, however, using Immediate Feedback Assessment Technique (IF AT®, Epstein Educational Enterprise, Inc.) forms, offers several advantages. In particular, a student learns immediately if his or her answer is correct and, in the case of an…

Descriptors: Multiple Choice Tests, Feedback (Response), Evaluation Methods, Guessing (Tests)

Bardhoshi, Gerta	1
Bulut, Okan	1
Cirillo, Pier F.	1
Cohen, Allan	1
Erford, Bradley T.	1
Gierl, Mark J.	1
Guo, Qi	1
Guskey, Thomas R.	1
Hortsch, Michael	1
Hwang, Charles	1
Jung, Lee Ann	1
Kingsbury, G. Gage	1
Merrel, Jeremy D.	1
Purkiss, Joel	1
Raczynski, Kevin	1
Schwartz, Pauline M.	1
Scott, Sara	1
Stallard, Stefanie	1
Webb, Jeffrey A.	1
Wise, Steven L.	1
Zaidi, Nikki B.	1
Zhang, Xinxin	1
More ▼