ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Evaluators	21
Test Interpretation	21
Interrater Reliability	8
Elementary Secondary Education	7
Evaluation Methods	7
Scoring	7
Standard Setting (Scoring)	7
Testing Problems	5
Test Results	4
Test Validity	4
Testing	4
Cutting Scores	3
Data Collection	3
Decision Making	3
Difficulty Level	3
Minimum Competencies	3
Minimum Competency Testing	3
Rating Scales	3
Scores	3
Standards	3
Test Selection	3
Academic Achievement	2
Computer Assisted Instruction	2
Educational Testing	2
Evaluation Utilization	2
More ▼

Source

Educational Measurement:…	6
Interpreter and Translator…	1
Journal of Educational…	1
Journal of MultiDisciplinary…	1
Language Testing	1
Remedial and Special Education	1

Publication Type

Journal Articles	11
Reports - Research	9
Reports - Evaluative	7
Guides - General	3
ERIC Digests in Full Text	2
ERIC Publications	2
Opinion Papers	2
Information Analyses	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Grade 7	2
Middle Schools	2
Elementary Education	1
Grade 5	1
Grade 6	1
Grade 8	1
Higher Education	1
Postsecondary Education	1

Audience

Policymakers	2
Practitioners	2
Administrators	1
Researchers	1

Location

California	2
China	1

Laws, Policies, & Programs

Assessments and Surveys

Adaptive Behavior Scale	1
National Assessment of…	1
National Teacher Examinations	1
Teacher Performance…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Raters' Scoring Process in Assessment of Interpreting: An Empirical Study Based on Eye Tracking and Retrospective Verbalisation

Peer reviewed

Direct link

Chao Han; Binghan Zheng; Mingqing Xie; Shirong Chen – Interpreter and Translator Trainer, 2024

Human raters' assessment of interpreting is a complex process. Previous researchers have mainly relied on verbal reports to examine this process. To advance our understanding, we conducted an empirical study, collecting raters' eye-movement and retrospection data in a computerised interpreting assessment in which three groups of raters (n = 35)…

Descriptors: Foreign Countries, College Students, College Graduates, Interrater Reliability

Exploring the Impact of Rater Effects on Person Fit in Rater-Mediated Assessments

Peer reviewed

Direct link

Wind, Stefanie A. – Educational Measurement: Issues and Practice, 2020

Researchers have documented the impact of rater effects, or raters' tendencies to give different ratings than would be expected given examinee achievement levels, in performance assessments. However, the degree to which rater effects influence person fit, or the reasonableness of test-takers' achievement estimates given their response patterns,…

Descriptors: Performance Based Assessment, Evaluators, Achievement, Influences

Operationalizing the Reading-into-Writing Construct in Analytic Rating Scales: Effects of Different Approaches on Rating

Peer reviewed

Direct link

Lestari, Santi B.; Brunfaut, Tineke – Language Testing, 2023

Assessing integrated reading-into-writing task performances is known to be challenging, and analytic rating scales have been found to better facilitate the scoring of these performances than other common types of rating scales. However, little is known about how specific operationalizations of the reading-into-writing construct in analytic rating…

Descriptors: Reading Writing Relationship, Writing Tests, Rating Scales, Writing Processes

An Examination of Assessment Fidelity in the Administration and Interpretation of Reading Tests

Peer reviewed

Direct link

Reed, Deborah K.; Sturges, Keith M. – Remedial and Special Education, 2013

Researchers have expressed concern about "implementation" fidelity in intervention research but have not extended that concern to "assessment" fidelity, or the extent to which pre-/posttests are administered and interpreted as intended. When studying reading interventions, data gathering heavily influences the identification of…

Descriptors: Reading Tests, Fidelity, Pretests Posttests, Intervention

Demands on Users for Interpretation of Achievement Test Scores: Implications for the Evaluation Profession

Peer reviewed

Direct link

Della-Piana, Gabriel Mario; Gardner, Michael – Journal of MultiDisciplinary Evaluation, 2011

Background: Professional standards for validity of achievement tests have long reflected a consensus that validity is the degree to which evidence and theory support interpretations of test scores entailed by the intended uses of tests. Yet there are convincing lines of evidence that the standards are not adequately followed in practice, that…

Descriptors: Achievement Tests, Test Validity, Scores, Standards

Using Standard-Setting Data to Establish Cutoff Scores.

Peer reviewed

Geisinger, Kurt F. – Educational Measurement: Issues and Practice, 1991

Ways to use standard-setting data to adjust cutoff scores on examinations are reviewed. Ten sources of information to be used in determining standards are listed. The decision to modify passing scores should be based on these types of information and consideration of adverse impact or rating process irregularities. (SLD)

Descriptors: Cutting Scores, Evaluation Utilization, Evaluators, Interrater Reliability

The Junior High Teacher as a Classroom Evaluator.

Strathe, Marlene I. – 1981

The purposes of this study were, first, to gather descriptive information regarding the measurement and evaluation skills actually utilized by junior high school teachers and, second, to identify differences among elementary, junior, and senior high school teachers. A questionnaire of 41 statements assessed on a five-point scale the usefulness to…

Descriptors: Elementary Secondary Education, Evaluation Methods, Evaluators, Junior High Schools

Factors Influencing Intrajudge Consistency during Standard-Setting.

Peer reviewed

Plake, Barbara S.; And Others – Educational Measurement: Issues and Practice, 1991

Possible sources of intrajudge inconsistency in standard setting are reviewed, and approaches are presented to improve the accuracy of rating. Procedures for providing judges with feedback through discussion or computerized communication are discussed. Monitoring and maintaining judges' consistency throughout the rating process are essential. (SLD)

Descriptors: Computer Assisted Instruction, Evaluators, Examiners, Feedback

Issues Related to Test Use.

Anderson, Scarvia B. – 1977

Several issues are related to the use of educational tests. First, test users must be able to choose appropriate tests, interpret scores, and make decisions based on scores. In the field of educational testing, few test users have adequate training in these areas. Second, test makers must clearly specify directions for administration, allowable…

Descriptors: Educational Testing, Elementary Secondary Education, Evaluators, Guides

Defining Minimal Competence.

Peer reviewed

Mills, Craig N.; And Others – Educational Measurement: Issues and Practice, 1991

An approach is presented to the definition of minimal competence for judges to use in standard setting. Panelists in standard setting must receive training to ensure that differences in rating result from differences in perceptions of item difficulty, not in differences of opinion about the definition of minimal competence. (SLD)

Descriptors: Cutting Scores, Decision Making, Definitions, Difficulty Level

Teacher Performance Assessment Instruments: A Guide to Interpretation.

Capie, William; And Others – 1978

This manual was prepared to assist in the development of skills requisite to rating the performance of student or beginning teachers. The activities prescribed in the manual are intended to enable experienced teachers to describe the spectrum of performances indicative of the 18 competencies subsumed in the Teacher Performance Assessment…

Descriptors: Classroom Observation Techniques, Data Collection, Elementary Secondary Education, Evaluators

Selection of Judges for Standard-Setting.

Peer reviewed

Jaeger, Richard M. – Educational Measurement: Issues and Practice, 1991

Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)

Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners

Training Judges to Generate Standard-Setting Data.

Peer reviewed

Reid, Jerry B. – Educational Measurement: Issues and Practice, 1991

Training judges to generate item ratings in standard setting once the reference group has been defined is discussed. It is proposed that sensitivity to the factors that determine difficulty can be improved through training. Three criteria for determining when training is sufficient are offered. (SLD)

Descriptors: Computer Assisted Instruction, Difficulty Level, Evaluators, Interrater Reliability

Guide for School Testing Programs.

Ward, Annie W., Ed.; And Others

A number of brief papers are presented to provide guidelines for test directors of school systems. This collection is intended for both newly appointed and experienced directors. Contributions were solicited from practicing directors of testing; the authors include Anthony J. Allen, Margaret Backman, Joan Bollenbacker, Gerald Hanna, James Lawson;…

Descriptors: Administrator Guides, Administrator Role, Educational Testing, Elementary Secondary Education

Influence of Type of Judge, Normative Information, and Discussion on Standards Recommended for the National Teacher Examinations.

Peer reviewed

Busch, John Christian; Jaeger, Richard M. – Journal of Educational Measurement, 1990

The effects of using recommended data collection procedures on median recommended test standards, variability of recommended test standards, and reliability of recommended standards for 7 subtests of the National Teacher Examinations Communications Skills and General Knowledge Tests were explored, using 236 evaluators (75 public school teachers…

Descriptors: College Faculty, Data Collection, Evaluators, Higher Education

Previous Page | Next Page »

Pages: 1 | 2

Jaeger, Richard M.	2
Anderson, Scarvia B.	1
Binghan Zheng	1
Bronson, William H.	1
Brunfaut, Tineke	1
Busch, John Christian	1
Capie, William	1
Chao Han	1
Della-Piana, Gabriel Mario	1
Gardner, Michael	1
Geisinger, Kurt F.	1
Kennedy, Mary M.	1
Lambert, Nadine M.	1
Law, Alexander I.	1
Lestari, Santi B.	1
Matter, M. Kevin	1
Mills, Craig N.	1
Mingqing Xie	1
Plake, Barbara S.	1
Reed, Deborah K.	1
Reid, Jerry B.	1
Rudner, Lawrence M.	1
Shirong Chen	1
Strathe, Marlene I.	1
More ▼