NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)2
Since 2006 (last 20 years)11
Laws, Policies, & Programs
Assessments and Surveys
Trends in International…1
What Works Clearinghouse Rating
Showing 1 to 15 of 22 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018
Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…
Descriptors: Competence, Simulation, Allied Health Personnel, Certification
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Dorans, Neil J. – ETS Research Report Series, 2014
Simulations are widely used. Simulations produce numbers that are deductive demonstrations of what a model says will happen.They produce numerical results that are consistent with the premises of the model used to generate the numbers. These simulated numerical results are not empirical data that address aspects of the world that lies outside the…
Descriptors: Simulation, Equated Scores, Scores, Scientific Methodology
Peer reviewed Peer reviewed
Direct linkDirect link
Tendeiro, Jorge N.; Meijer, Rob R. – Journal of Educational Measurement, 2014
In recent guidelines for fair educational testing it is advised to check the validity of individual test scores through the use of person-fit statistics. For practitioners it is unclear on the basis of the existing literature which statistic to use. An overview of relatively simple existing nonparametric approaches to identify atypical response…
Descriptors: Educational Assessment, Test Validity, Scores, Statistical Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Van Norman, Ethan R.; Christ, Theodore J.; Zopluoglu, Cengiz – School Psychology Quarterly, 2013
This study examined the effect of baseline estimation on the quality of trend estimates derived from Curriculum Based Measurement of Oral Reading (CBM-R) progress monitoring data. The authors used a linear mixed effects regression (LMER) model to simulate progress monitoring data for schedules ranging from 6-20 weeks for datasets with high and low…
Descriptors: Curriculum Based Assessment, Oral Reading, Reading Fluency, Regression (Statistics)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zwick, Rebecca – ETS Research Report Series, 2012
Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…
Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods
Baker, Eva L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2010
This report provides an overview of what was known about alternative assessment at the time that the article was written in 1991. Topics include beliefs about assessment reform, overview of alternative assessment including research knowledge, evidence of assessment impact, and critical features of alternative assessment. The author notes that in…
Descriptors: Alternative Assessment, Evaluation Methods, Evaluation Research, Performance Based Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Curran, Vernon R.; Butler, Roger; Duke, Pauline; Eaton, William H.; Moffatt, Scott M.; Sherman, Greg P.; Pottle, Madge – Assessment & Evaluation in Higher Education, 2012
Clinical competence is a multidimensional concept and encompasses a variety of skills including procedural, problem-solving and clinical judgement. The initial stages of postgraduate medical training are believed to be a particularly important time for the development of clinical skill competencies. This study reports on an evaluation of a…
Descriptors: Medical Education, Physical Examinations, Focus Groups, Family Practice (Medicine)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Wang, Hsuan-Po; Kuo, Bor-Chen; Tsai, Ya-Hsun; Liao, Chen-Huei – Turkish Online Journal of Educational Technology - TOJET, 2012
In the era of globalization, the trend towards learning Chinese as a foreign language (CFL) has become increasingly popular worldwide. The increasing demand in learning CFL has raised the profile of the Chinese proficiency test (CPT). This study will analyze in depth the inadequacy of current CPT's utilizing the common European framework of…
Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Global Approach
Peer reviewed Peer reviewed
Direct linkDirect link
Chin, Jeffrey; Dukes, Richard; Gamson, William – Simulation & Gaming, 2009
This article examines the state of assessment in simulation and gaming over the past 40 years. While assessment has come slowly to many disciplines, members of the simulation and gaming community have been assessing the educational effectiveness of their experiential activities for years, in part because of skepticism from more traditional…
Descriptors: Simulation, Evaluation Research, Meta Analysis, Bibliometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Pommerich, Mary – Journal of Educational Measurement, 2006
Domain scores have been proposed as a user-friendly way of providing instructional feedback about examinees' skills. Domain performance typically cannot be measured directly; instead, scores must be estimated using available information. Simulation studies suggest that IRT-based methods yield accurate group domain score estimates. Because…
Descriptors: Test Validity, Scores, Simulation, Evaluation Methods
Peer reviewed Peer reviewed
Muijtjens, Arno M. M.; van Vollenhoven, Femke H. M.; van Luijk, Scheltus J.; van der Vleuten, Cees P. M. – Academic Medicine, 2000
Sequential, standardized patient-based tests were given to medical students at the University of Maastricht (Netherlands); the first test was a screening test that evaluated the efficiency/validity of cutoff scores. Findings indicated that stringent pass/fail cutoff scores on the screening test produced optimal results. Fewer than 0.2 percent of…
Descriptors: Efficiency, Evaluation Methods, Foreign Countries, Higher Education
Peer reviewed Peer reviewed
Streufert, Siegfried; And Others – Personnel Psychology, 1988
Evaluated quasi-experimental simulation technique designed to measure impact of individual differences in managerial styles on executive performance. Tested 20 simulation-based measures for reliability and validity. Data from two samples suggest that this quasi-experimental simulation technology may be useful in assessing managerial styles not…
Descriptors: Administrator Qualifications, Competence, Evaluation Methods, Individual Differences
Swaak, Janine; And Others – 1997
A study was conducted to develop a test that is able to capture knowledge of an intuitive nature, such as that acquired through discovery learning. The proposed test format is called the "what-if test." Test items in this format consist of the presentation of a situation. A change in the situation is introduced, and learners have to…
Descriptors: College Students, Discovery Learning, Educational Assessment, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Kupermintz, Haggai – Educational Evaluation and Policy Analysis, 2003
This article addresses the validity of teacher evaluation measures produced by the Tennessee Value Added Assessment System (TVAAS). The system analyzes student test score data and estimates the effects of individual teachers on score gains. These effects are used to construct teacher value-added measures of teaching effectiveness. We describe the…
Descriptors: Teacher Effectiveness, Test Validity, Scores, Socioeconomic Background
Previous Page | Next Page »
Pages: 1  |  2