Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 11 |
Descriptor
Scoring | 14 |
Error of Measurement | 4 |
Test Validity | 4 |
Validity | 4 |
Academic Achievement | 3 |
Achievement Tests | 3 |
Automation | 3 |
Elementary Secondary Education | 3 |
Scores | 3 |
Standard Setting (Scoring) | 3 |
Test Items | 3 |
More ▼ |
Source
Educational Measurement:… | 17 |
Author
Allalouf, Avi | 2 |
Anderson, Dan | 1 |
Baumer, Michal | 1 |
Bejar, Issac I. | 1 |
Boyer, Michelle | 1 |
Breyer, F. Jay | 1 |
Bunch, Michael B. | 1 |
Burkhardt, Amy | 1 |
Cangelosi, James S. | 1 |
Cizek, Gregory J. | 1 |
Cross, Lawrence H. | 1 |
More ▼ |
Publication Type
Journal Articles | 17 |
Reports - Descriptive | 17 |
Education Level
Audience
Teachers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
National Teacher Examinations | 1 |
Teacher Performance… | 1 |
What Works Clearinghouse Rating
Zesch, Torsten; Horbach, Andrea; Zehner, Fabian – Educational Measurement: Issues and Practice, 2023
In this article, we systematize the factors influencing performance and feasibility of automatic content scoring methods for short text responses. We argue that performance (i.e., how well an automatic system agrees with human judgments) mainly depends on the linguistic variance seen in the responses and that this variance is indirectly influenced…
Descriptors: Influences, Academic Achievement, Feasibility Studies, Automation
Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2020
Educational tests are standardized so that all examinees are tested on the same material, under the same testing conditions, and with the same scoring protocols. This uniformity is designed to provide a level "playing field" for all examinees so that the test is "the same" for everyone. Thus, standardization is designed to…
Descriptors: Standards, Educational Assessment, Culture Fair Tests, Scoring
Lottridge, Sue; Burkhardt, Amy; Boyer, Michelle – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Dr. Sue Lottridge, Amy Burkhardt, and Dr. Michelle Boyer provide an overview of automated scoring. Automated scoring is the use of computer algorithms to score unconstrained open-ended test items by mimicking human scoring. The use of automated scoring is increasing in educational assessment programs because it allows…
Descriptors: Computer Assisted Testing, Scoring, Automation, Educational Assessment
Allalouf, Avi; Gutentag, Tony; Baumer, Michal – Educational Measurement: Issues and Practice, 2017
Quality control (QC) in testing is paramount. QC procedures for tests can be divided into two types. The first type, one that has been well researched, is QC for tests administered to large population groups on few administration dates using a small set of test forms (e.g., large-scale assessment). The second type is QC for tests, usually…
Descriptors: Quality Control, Scoring, Computer Assisted Testing, Error Patterns
Myford, Carol M. – Educational Measurement: Issues and Practice, 2012
Over the last several decades, researchers have studied many and varied aspects of rater cognition. Those interested in pursuing basic research have focused on gaining an understanding of raters' thought processes as they score different types of performances and products, striving to understand how raters' mental representations and the cognitive…
Descriptors: Evidence, Validity, Cognitive Processes, Models
Williamson, David M.; Xi, Xiaoming; Breyer, F. Jay – Educational Measurement: Issues and Practice, 2012
A framework for evaluation and use of automated scoring of constructed-response tasks is provided that entails both evaluation of automated scoring as well as guidelines for implementation and maintenance in the context of constantly evolving technologies. Consideration of validity issues and challenges associated with automated scoring are…
Descriptors: Automation, Scoring, Evaluation, Guidelines
Bejar, Issac I. – Educational Measurement: Issues and Practice, 2012
The scoring process is critical in the validation of tests that rely on constructed responses. Documenting that readers carry out the scoring in ways consistent with the construct and measurement goals is an important aspect of score validity. In this article, rater cognition is approached as a source of support for a validity argument for scores…
Descriptors: Scores, Inferences, Validity, Scoring
Geisinger, Kurt F.; McCormick, Carina M. – Educational Measurement: Issues and Practice, 2010
Standard-setting studies utilizing procedures such as the Bookmark or Angoff methods are just one component of the complete standard-setting process. Decision makers ultimately must determine what they believe to be the most appropriate standard or cut score to use, employing the input of the standard-setting panelists as one piece of information…
Descriptors: Standard Setting (Scoring), Measurement, Cutting Scores, Educational Policy
Raymond, Mark R.; Neustel, Sandra; Anderson, Dan – Educational Measurement: Issues and Practice, 2009
Examinees who take high-stakes assessments are usually given an opportunity to repeat the test if they are unsuccessful on their initial attempt. To prevent examinees from obtaining unfair score increases by memorizing the content of specific test items, testing agencies usually assign a different test form to repeat examinees. The use of multiple…
Descriptors: Test Results, Test Items, Testing, Aptitude Tests
Sykes, Robert C.; Ito, Kyoko; Wang, Zhen – Educational Measurement: Issues and Practice, 2008
Student responses to a large number of constructed response items in three Math and three Reading tests were scored on two occasions using three ways of assigning raters: single reader scoring, a different reader for each response (item-specific), and three readers each scoring a rater item block (RIB) containing approximately one-third of a…
Descriptors: Test Items, Mathematics Tests, Reading Tests, Scoring

Ercikan, Kadriye – Educational Measurement: Issues and Practice, 2002
Reviews two types of multiple scoring practices and discusses how multiple scoring affects inferences. Multiple scoring uses a single observation as evidence for making inferences about an examinee's competence in multiple assessment units. Summarizes key implications of multiple scoring. (SLD)
Descriptors: Scoring, Statistical Inference
Allalouf, Avi – Educational Measurement: Issues and Practice, 2007
There is significant potential for error in long production processes that consist of sequential stages, each of which is heavily dependent on the previous stage, such as the SER (Scoring, Equating, and Reporting) process. Quality control procedures are required in order to monitor this process and to reduce the number of mistakes to a minimum. In…
Descriptors: Scoring, Quality Control, Sequential Approach, Error Correction
Cizek, Gregory J.; Bunch, Michael B.; Koons, Heather – Educational Measurement: Issues and Practice, 2004
This module describes some common standard-setting procedures used to derive performance levels for achievement tests in education, licensure, and certification. Upon completing the module, readers will be able to: describe what standard setting is; understand why standard setting is necessary; recognize some of the purposes of standard setting;…
Descriptors: Achievement Tests, Standard Setting, Academic Standards, Academic Achievement

Cangelosi, James S. – Educational Measurement: Issues and Practice, 1984
Test development procedures and six methods for determining cut-off scores are briefly described. An alternate method, appropriate when the test developer also determines the cut-off score, is suggested. Unlike other methods, the standard is set during the test development stage. Its computations are intelligible to nonstatistically-oriented…
Descriptors: Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education, Error of Measurement

Fisher, Thomas H.; And Others – Educational Measurement: Issues and Practice, 1985
The new Florida Master Teacher Program in which the state provides bonuses directly to qualified teachers is described. Three measurement issues in implementing the program are discussed: (1) evaluating a teacher's classroom performance; (2) evaluating a teacher's subject area knowledge; and (3) combining scores to determine which teachers…
Descriptors: Elementary Secondary Education, Incentives, Job Performance, Merit Pay
Previous Page | Next Page ยป
Pages: 1 | 2