ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	2

Source

Educational Measurement:…

Author

Attali, Yigal	1
Gagne, Phill	1
Geisinger, Kurt F.	1
Ito, Kyoko	1
Jaeger, Richard M.	1
Lissitz, Robert W.	1
Mills, Craig N.	1
Plake, Barbara S.	1
Reid, Jerry B.	1
Schafer, William D.	1
Sykes, Robert C.	1
Wang, Zhen	1
More ▼

Publication Type

Journal Articles	8
Reports - Evaluative	5
Information Analyses	1
Opinion Papers	1
Reports - Descriptive	1
Reports - Research	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Rater Certification Tests: A Psychometric Approach

Peer reviewed

Direct link

Attali, Yigal – Educational Measurement: Issues and Practice, 2019

Rater training is an important part of developing and conducting large-scale constructed-response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to…

Descriptors: Evaluators, Certification, High Stakes Tests, Scoring

Effects of Assigning Raters to Items

Peer reviewed

Direct link

Sykes, Robert C.; Ito, Kyoko; Wang, Zhen – Educational Measurement: Issues and Practice, 2008

Student responses to a large number of constructed response items in three Math and three Reading tests were scored on two occasions using three ways of assigning raters: single reader scoring, a different reader for each response (item-specific), and three readers each scoring a rater item block (RIB) containing approximately one-third of a…

Descriptors: Test Items, Mathematics Tests, Reading Tests, Scoring

Resistance to Confounding Style and Content in Scoring Constructed-Response Items

Peer reviewed

Direct link

Schafer, William D.; Gagne, Phill; Lissitz, Robert W. – Educational Measurement: Issues and Practice, 2005

An assumption that is fundamental to the scoring of student-constructed responses (e.g., essays) is the ability of raters to focus on the response characteristics of interest rather than on other features. A common example, and the focus of this study, is the ability of raters to score a response based on the content achievement it demonstrates…

Descriptors: Scoring, Language Usage, Effect Size, Student Evaluation

Using Standard-Setting Data to Establish Cutoff Scores.

Peer reviewed

Geisinger, Kurt F. – Educational Measurement: Issues and Practice, 1991

Ways to use standard-setting data to adjust cutoff scores on examinations are reviewed. Ten sources of information to be used in determining standards are listed. The decision to modify passing scores should be based on these types of information and consideration of adverse impact or rating process irregularities. (SLD)

Descriptors: Cutting Scores, Evaluation Utilization, Evaluators, Interrater Reliability

Factors Influencing Intrajudge Consistency during Standard-Setting.

Peer reviewed

Plake, Barbara S.; And Others – Educational Measurement: Issues and Practice, 1991

Possible sources of intrajudge inconsistency in standard setting are reviewed, and approaches are presented to improve the accuracy of rating. Procedures for providing judges with feedback through discussion or computerized communication are discussed. Monitoring and maintaining judges' consistency throughout the rating process are essential. (SLD)

Descriptors: Computer Assisted Instruction, Evaluators, Examiners, Feedback

Defining Minimal Competence.

Peer reviewed

Mills, Craig N.; And Others – Educational Measurement: Issues and Practice, 1991

An approach is presented to the definition of minimal competence for judges to use in standard setting. Panelists in standard setting must receive training to ensure that differences in rating result from differences in perceptions of item difficulty, not in differences of opinion about the definition of minimal competence. (SLD)

Descriptors: Cutting Scores, Decision Making, Definitions, Difficulty Level

Selection of Judges for Standard-Setting.

Peer reviewed

Jaeger, Richard M. – Educational Measurement: Issues and Practice, 1991

Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)

Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners

Training Judges to Generate Standard-Setting Data.

Peer reviewed

Reid, Jerry B. – Educational Measurement: Issues and Practice, 1991

Training judges to generate item ratings in standard setting once the reference group has been defined is discussed. It is proposed that sensitivity to the factors that determine difficulty can be improved through training. Three criteria for determining when training is sufficient are offered. (SLD)

Descriptors: Computer Assisted Instruction, Difficulty Level, Evaluators, Interrater Reliability

Evaluators	8
Scoring	8
Interrater Reliability	6
Standard Setting (Scoring)	5
Test Interpretation	5
Cutting Scores	3
Minimum Competencies	3
Minimum Competency Testing	3
Testing Problems	3
Computer Assisted Instruction	2
Decision Making	2
Difficulty Level	2
Examiners	2
Measurement Techniques	2
Selection	2
Bias	1
Certification	1
Definitions	1
Effect Size	1
Error of Measurement	1
Essay Tests	1
Evaluation Methods	1
Evaluation Utilization	1
Factor Analysis	1
Feedback	1
More ▼