Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 8 |
Descriptor
Scoring | 23 |
Decision Making | 21 |
Test Validity | 7 |
Scores | 6 |
Test Use | 6 |
Evaluators | 5 |
Interrater Reliability | 5 |
Models | 5 |
Test Construction | 5 |
Correlation | 4 |
Evaluation Methods | 4 |
More ▼ |
Source
Author
Allwood, Carl Martin | 1 |
Aloisi, Cesare | 1 |
Berson, Nancy | 1 |
Brugman, Daniel | 1 |
Brull, Harry | 1 |
Buratti, Sandra | 1 |
Callaghan, A. | 1 |
Case, Susan M. | 1 |
Cheng, Liying | 1 |
Crooks, Terry | 1 |
Crowson, Mary | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 23 |
Journal Articles | 15 |
Speeches/Meeting Papers | 7 |
Book/Product Reviews | 1 |
Education Level
Elementary Secondary Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Laws, Policies, & Programs
Race to the Top | 1 |
Assessments and Surveys
National Assessment of… | 1 |
Test of English as a Foreign… | 1 |
United States Medical… | 1 |
What Works Clearinghouse Rating
Peng, Yue; Yan, Wei; Cheng, Liying – Language Testing, 2021
This test review focuses on the current version (2009) of [Chinese characters omitted] (Hanyu Shuiping Kaoshi), literally translated as the Chinese Language Proficiency Test and abbreviated as HSK. Tailored to non-native speakers of the Chinese language, this test consists of six proficiency levels (Levels 1 and 2 as beginners, Levels 3 and 4 as…
Descriptors: Language Proficiency, Language Tests, Chinese, Decision Making
Aloisi, Cesare; Callaghan, A. – Higher Education Pedagogies, 2018
The University of Reading Learning Gain project is a three-year longitudinal project to test and evaluate a range of available methodologies and to draw conclusions on what might be the right combination of instruments for the measurement of Learning Gain in higher education. This paper analyses the validity of a measure of critical thinking…
Descriptors: Foreign Countries, Cognitive Tests, Critical Thinking, Thinking Skills
Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016
Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…
Descriptors: Evaluation Methods, Test Construction, Design, Scaling
Buratti, Sandra; Allwood, Carl Martin; Kleitman, Sabina – Metacognition and Learning, 2013
In learning contexts, people need to make realistic confidence judgments about their memory performance. The present study investigated whether second-order judgments of first-order confidence judgments could help people improve their confidence judgments of semantic memory information. Furthermore, we assessed whether different personality and…
Descriptors: Memory, Personality Traits, Semantics, Scoring
Everson, Mark D.; Sandoval, Jose Miguel; Berson, Nancy; Crowson, Mary; Robinson, Harriet – Journal of Child Sexual Abuse, 2012
In the absence of photographic or DNA evidence, a credible eyewitness, or perpetrator confession, forensic evaluators in cases of alleged child sexual abuse must rely on psychosocial or "soft" evidence, often requiring substantial professional judgment for case determination. This article offers a three-part rebuttal to Herman's (2009) argument…
Descriptors: Evidence, Evaluators, Persuasive Discourse, Sexual Abuse
Schraw, Gregory – Educational Psychologist, 2010
I provide a summary of the four invited articles in this special issue and compare and contrast different methods for measuring self-regulation in computer-based learning environments (CBLEs). I present a taxonomy that distinguishes between offline and online measures and further distinguishes subcategories within each of these categories. I…
Descriptors: Scoring Rubrics, Scoring, Cognitive Processes, Self Control
New Teacher Project, 2010
Race to the Top represented a new paradigm in federal education. Instead of spreading relatively modest dollars evenly across all jurisdictions through funding formulas--as virtually all federal education funding has been and continues to be spent--a small number of successful states received all of the available funding, and in turn made it…
Descriptors: Federal Programs, Competition, Federal Aid, Educational Improvement
van der Linden, Wim J.; Vos, Hans J. – 1994
This paper presents some Bayesian theories of simultaneous optimization of decision rules for test-based decisions. Simultaneous decision making arises when an institution has to make a series of selection, placement, or mastery decisions with respect to subjects from a population. An obvious example is the use of individualized instruction in…
Descriptors: Bayesian Statistics, Decision Making, Foreign Countries, Scores

Lunz, Mary E.; And Others – Educational and Psychological Measurement, 1994
In a study involving eight judges, analysis with the FACETS model provides evidence that judges grade differently, whether or not scores correlate well. This outcome suggests that adjustments for differences among judges should be made before student measures are estimated to produce reproducible decisions. (SLD)
Descriptors: Correlation, Decision Making, Evaluation Methods, Evaluators

Wilcox, Rand R.; Wilcox, Karen Thompson – Journal of Educational Measurement, 1988
Use of latent class models to examine strategies that examinees (92 college students) use for a specific task is illustrated, via a multiple-choice test of spatial ability. Under an answer-until-correct scoring procedure, models representing an improvement over simplistic random guessing are proposed. (SLD)
Descriptors: College Students, Decision Making, Guessing (Tests), Multiple Choice Tests
Crooks, Terry – 1996
A recently developed model of validation (T. J. Crooks, M. T. Kane, and A. S. Cohen, 1996) is briefly outlined. It conceptualizes assessment as divided into a chain of eight linked stages: (1) administration; (2) scoring; (3) aggregation; (4) generalization; (5) extrapolation; (6) evaluation; (7) decision; and (8) impact. The model is then used to…
Descriptors: Decision Making, Educational Assessment, Foreign Countries, Models

Case, Susan M.; Swanson, David B. – Teaching and Learning in Medicine, 1993
Extended matching, a test item format used currently in medical licensing examinations, is described. Procedures for writing and reviewing such test items are outlined, test development and psychometric advantages are discussed, and issues in test administration and scoring are examined. The extended matching form is also seen as having uses for…
Descriptors: Clinical Diagnosis, Decision Making, Higher Education, Licensing Examinations (Professions)

Mills, Craig N.; And Others – Educational Measurement: Issues and Practice, 1991
An approach is presented to the definition of minimal competence for judges to use in standard setting. Panelists in standard setting must receive training to ensure that differences in rating result from differences in perceptions of item difficulty, not in differences of opinion about the definition of minimal competence. (SLD)
Descriptors: Cutting Scores, Decision Making, Definitions, Difficulty Level

Hambleton, Ronald K.; Slater, Sharon C. – Applied Measurement in Education, 1997
A brief history of developments in the assessment of the reliability of credentialing examinations is presented, and some new results are outlined that highlight the interactions among scoring, standard setting, and the reliability and validity of pass-fail decisions. Decision consistency is an important concept in evaluating credentialing…
Descriptors: Certification, Credentials, Decision Making, Interaction
Kaiser, Paul D.; Brull, Harry – 1994
The design, administration, scoring, and results of the 1993 New York State Correctional Captain Examination are described. The examination was administered to 405 candidates. As in previous Sergeant and Lieutenant examinations, candidates also completed latent image written simulation problems and open/closed book multiple choice test components.…
Descriptors: Competitive Selection, Correctional Rehabilitation, Decision Making, Educational Innovation
Previous Page | Next Page »
Pages: 1 | 2