ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	8

Source

Applied Measurement in…	3
Educational Measurement:…	1
Educational Psychologist	1
Educational and Psychological…	1
Higher Education Pedagogies	1
Journal of Abnormal Child…	1
Journal of Child Sexual Abuse	1
Journal of Educational…	1
Journal of Personnel…	1
Language Testing	1
Metacognition and Learning	1
New Teacher Project	1
TESL Canada Journal	1
Teaching and Learning in…	1
More ▼

Publication Type

Reports - Evaluative	23
Journal Articles	15
Speeches/Meeting Papers	7
Book/Product Reviews	1

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

New Zealand	1
United Kingdom	1
United Kingdom (Reading)	1

Laws, Policies, & Programs

Race to the Top

Assessments and Surveys

National Assessment of…	1
Test of English as a Foreign…	1
United States Medical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Hanyu Shuiping Kaoshi (HSK): A Multi-Level, Multi-Purpose Proficiency Test

Peer reviewed

Direct link

Peng, Yue; Yan, Wei; Cheng, Liying – Language Testing, 2021

This test review focuses on the current version (2009) of [Chinese characters omitted] (Hanyu Shuiping Kaoshi), literally translated as the Chinese Language Proficiency Test and abbreviated as HSK. Tailored to non-native speakers of the Chinese language, this test consists of six proficiency levels (Levels 1 and 2 as beginners, Levels 3 and 4 as…

Descriptors: Language Proficiency, Language Tests, Chinese, Decision Making

Threats to the Validity of the Collegiate Learning Assessment (CLA+) as a Measure of Critical Thinking Skills and Implications for Learning Gain

Peer reviewed

Direct link

Aloisi, Cesare; Callaghan, A. – Higher Education Pedagogies, 2018

The University of Reading Learning Gain project is a three-year longitudinal project to test and evaluate a range of available methodologies and to draw conclusions on what might be the right combination of instruments for the measurement of Learning Gain in higher education. This paper analyses the validity of a measure of critical thinking…

Descriptors: Foreign Countries, Cognitive Tests, Critical Thinking, Thinking Skills

In Search of Validity Evidence in Support of the Interpretation and Use of Assessments of Complex Constructs: Discussion of Research on Assessing 21st Century Skills

Peer reviewed

Direct link

Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016

Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…

Descriptors: Evaluation Methods, Test Construction, Design, Scaling

First- and Second-Order Metacognitive Judgments of Semantic Memory Reports: The Influence of Personality Traits and Cognitive Styles

Peer reviewed

Direct link

Buratti, Sandra; Allwood, Carl Martin; Kleitman, Sabina – Metacognition and Learning, 2013

In learning contexts, people need to make realistic confidence judgments about their memory performance. The present study investigated whether second-order judgments of first-order confidence judgments could help people improve their confidence judgments of semantic memory information. Furthermore, we assessed whether different personality and…

Descriptors: Memory, Personality Traits, Semantics, Scoring

Reliability of Professional Judgments in Forensic Child Sexual Abuse Evaluations: Unsettled or Unsettling Science?

Peer reviewed

Direct link

Everson, Mark D.; Sandoval, Jose Miguel; Berson, Nancy; Crowson, Mary; Robinson, Harriet – Journal of Child Sexual Abuse, 2012

In the absence of photographic or DNA evidence, a credible eyewitness, or perpetrator confession, forensic evaluators in cases of alleged child sexual abuse must rely on psychosocial or "soft" evidence, often requiring substantial professional judgment for case determination. This article offers a three-part rebuttal to Herman's (2009) argument…

Descriptors: Evidence, Evaluators, Persuasive Discourse, Sexual Abuse

Measuring Self-Regulation in Computer-Based Learning Environments

Peer reviewed

Direct link

Schraw, Gregory – Educational Psychologist, 2010

I provide a summary of the four invited articles in this special issue and compare and contrast different methods for measuring self-regulation in computer-based learning environments (CBLEs). I present a taxonomy that distinguishes between offline and online measures and further distinguishes subcategories within each of these categories. I…

Descriptors: Scoring Rubrics, Scoring, Cognitive Processes, Self Control

Resetting Race to the Top: Why the Future of the Competition Depends on Improving the Scoring Process. Policy Brief

Download full text

New Teacher Project, 2010

Race to the Top represented a new paradigm in federal education. Instead of spreading relatively modest dollars evenly across all jurisdictions through funding formulas--as virtually all federal education funding has been and continues to be spent--a small number of successful states received all of the available funding, and in turn made it…

Descriptors: Federal Programs, Competition, Federal Aid, Educational Improvement

A Compensatory Approach to Optimal Selection with Mastery Scores. Research Report 94-2.

Download full text

van der Linden, Wim J.; Vos, Hans J. – 1994

This paper presents some Bayesian theories of simultaneous optimization of decision rules for test-based decisions. Simultaneous decision making arises when an institution has to make a series of selection, placement, or mastery decisions with respect to subjects from a population. An obvious example is the use of individualized instruction in…

Descriptors: Bayesian Statistics, Decision Making, Foreign Countries, Scores

Interjudge Reliability and Decision Reproducibility.

Peer reviewed

Lunz, Mary E.; And Others – Educational and Psychological Measurement, 1994

In a study involving eight judges, analysis with the FACETS model provides evidence that judges grade differently, whether or not scores correlate well. This outcome suggests that adjustments for differences among judges should be made before student measures are estimated to produce reproducible decisions. (SLD)

Descriptors: Correlation, Decision Making, Evaluation Methods, Evaluators

Models of Decisionmaking Processes for Multiple-Choice Test Items: An Analysis of Spatial Ability.

Peer reviewed

Wilcox, Rand R.; Wilcox, Karen Thompson – Journal of Educational Measurement, 1988

Use of latent class models to examine strategies that examinees (92 college students) use for a specific task is illustrated, via a multiple-choice test of spatial ability. Under an answer-until-correct scoring procedure, models representing an improvement over simplistic random guessing are proposed. (SLD)

Descriptors: College Students, Decision Making, Guessing (Tests), Multiple Choice Tests

Validity Issues in State or National Monitoring of Educational Outcomes.

Download full text

Crooks, Terry – 1996

A recently developed model of validation (T. J. Crooks, M. T. Kane, and A. S. Cohen, 1996) is briefly outlined. It conceptualizes assessment as divided into a chain of eight linked stages: (1) administration; (2) scoring; (3) aggregation; (4) generalization; (5) extrapolation; (6) evaluation; (7) decision; and (8) impact. The model is then used to…

Descriptors: Decision Making, Educational Assessment, Foreign Countries, Models

Extended-Matching Items: A Practical Alternative to Free-Response Questions.

Peer reviewed

Case, Susan M.; Swanson, David B. – Teaching and Learning in Medicine, 1993

Extended matching, a test item format used currently in medical licensing examinations, is described. Procedures for writing and reviewing such test items are outlined, test development and psychometric advantages are discussed, and issues in test administration and scoring are examined. The extended matching form is also seen as having uses for…

Descriptors: Clinical Diagnosis, Decision Making, Higher Education, Licensing Examinations (Professions)

Defining Minimal Competence.

Peer reviewed

Mills, Craig N.; And Others – Educational Measurement: Issues and Practice, 1991

An approach is presented to the definition of minimal competence for judges to use in standard setting. Panelists in standard setting must receive training to ensure that differences in rating result from differences in perceptions of item difficulty, not in differences of opinion about the definition of minimal competence. (SLD)

Descriptors: Cutting Scores, Decision Making, Definitions, Difficulty Level

Reliability of Credentialing Examinations and the Impact of Scoring Models and Standard-Setting Policies.

Peer reviewed

Hambleton, Ronald K.; Slater, Sharon C. – Applied Measurement in Education, 1997

A brief history of developments in the assessment of the reliability of credentialing examinations is presented, and some new results are outlined that highlight the interactions among scoring, standard setting, and the reliability and validity of pass-fail decisions. Decision consistency is an important concept in evaluating credentialing…

Descriptors: Certification, Credentials, Decision Making, Interaction

New Stuff in I/O (In-Baskets and Orals). The Development, Administration and Scoring of In-Baskets and Orals for the New York State Correction Captain Examination.

Download full text

Kaiser, Paul D.; Brull, Harry – 1994

The design, administration, scoring, and results of the 1993 New York State Correctional Captain Examination are described. The examination was administered to 405 candidates. As in previous Sergeant and Lieutenant examinations, candidates also completed latent image written simulation problems and open/closed book multiple choice test components.…

Descriptors: Competitive Selection, Correctional Rehabilitation, Decision Making, Educational Innovation

Previous Page | Next Page »

Pages: 1 | 2

Scoring	23
Decision Making	21
Test Validity	7
Scores	6
Test Use	6
Evaluators	5
Interrater Reliability	5
Models	5
Test Construction	5
Correlation	4
Evaluation Methods	4
Performance Based Assessment	4
Test Items	4
Test Reliability	4
Comparative Analysis	3
Elementary Secondary Education	3
Foreign Countries	3
Higher Education	3
Licensing Examinations…	3
Standard Setting (Scoring)	3
Standards	3
Test Format	3
Test Interpretation	3
Adolescents	2
Certification	2
More ▼

Allwood, Carl Martin	1
Aloisi, Cesare	1
Berson, Nancy	1
Brugman, Daniel	1
Brull, Harry	1
Buratti, Sandra	1
Callaghan, A.	1
Case, Susan M.	1
Cheng, Liying	1
Crooks, Terry	1
Crowson, Mary	1
Dekovic, Maja	1
Des Brisay, Margaret	1
Ercikan, Kadriye	1
Estabrooke, Marianna	1
Everson, Mark D.	1
Gibbs, John C.	1
Haladyna, Thomas M.	1
Hambleton, Ronald K.	1
Hansche, Linda	1
Kaiser, Paul D.	1
Kleitman, Sabina	1
Lunz, Mary E.	1
Mills, Craig N.	1
More ▼