Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 10 |
Descriptor
Evaluation Methods | 21 |
Simulation | 21 |
Test Reliability | 21 |
Test Validity | 7 |
Accuracy | 4 |
Correlation | 4 |
Testing | 4 |
Computation | 3 |
Evaluation Criteria | 3 |
Medical Education | 3 |
Observation | 3 |
More ▼ |
Source
Author
Atkins, David C. | 1 |
Beauchaine, Theodore P. | 1 |
Bedics, Jamie D. | 1 |
Bengston, John K. | 1 |
Boughton, Keith A. | 1 |
Brauer, J. | 1 |
Brown, James Dean | 1 |
Butler, Roger | 1 |
Christ, Theodore J. | 1 |
Clark, Amy K. | 1 |
Cohen, Stuart J. | 1 |
More ▼ |
Publication Type
Journal Articles | 17 |
Reports - Research | 11 |
Reports - Descriptive | 5 |
Reports - Evaluative | 3 |
Speeches/Meeting Papers | 2 |
Collected Works - Proceedings | 1 |
Information Analyses | 1 |
Tests/Questionnaires | 1 |
Education Level
Postsecondary Education | 4 |
Higher Education | 3 |
Adult Education | 2 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 8 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Practitioners | 2 |
Teachers | 1 |
Location
Russia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Trends in International… | 1 |
What Works Clearinghouse Rating
Guido Schwarzer; Gerta Rücker; Cristina Semaca – Research Synthesis Methods, 2024
The "LFK" index has been promoted as an improved method to detect bias in meta-analysis. Putatively, its performance does not depend on the number of studies in the meta-analysis. We conducted a simulation study, comparing the "LFK" index test to three standard tests for funnel plot asymmetry in settings with smaller or larger…
Descriptors: Bias, Meta Analysis, Simulation, Evaluation Methods
Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023
As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…
Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy
Delphine Franco; Ruben Vanderlinde; Martin Valcke – European Journal of Education, 2025
Complex competences, such as managing students' aggressive behaviour, are challenging to develop during teacher training. Recently, video-based simulations have been considered promising, yet suitable assessment instruments are limitedly available. This paper reports on the design and evaluation of a video-based assessment tool tailored to measure…
Descriptors: Preservice Teachers, Preservice Teacher Education, Student Behavior, Aggression
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2015
A latent variable modeling approach for scale reliability evaluation in heterogeneous populations is discussed. The method can be used for point and interval estimation of reliability of multicomponent measuring instruments in populations representing mixtures of an unknown number of latent classes or subpopulations. The procedure is helpful also…
Descriptors: Test Reliability, Evaluation Methods, Measurement Techniques, Computation
Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017
The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…
Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation
Dorans, Neil J. – ETS Research Report Series, 2014
Simulations are widely used. Simulations produce numbers that are deductive demonstrations of what a model says will happen.They produce numerical results that are consistent with the premises of the model used to generate the numbers. These simulated numerical results are not empirical data that address aspects of the world that lies outside the…
Descriptors: Simulation, Equated Scores, Scores, Scientific Methodology
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Van Norman, Ethan R.; Christ, Theodore J.; Zopluoglu, Cengiz – School Psychology Quarterly, 2013
This study examined the effect of baseline estimation on the quality of trend estimates derived from Curriculum Based Measurement of Oral Reading (CBM-R) progress monitoring data. The authors used a linear mixed effects regression (LMER) model to simulate progress monitoring data for schedules ranging from 6-20 weeks for datasets with high and low…
Descriptors: Curriculum Based Assessment, Oral Reading, Reading Fluency, Regression (Statistics)
Curran, Vernon R.; Butler, Roger; Duke, Pauline; Eaton, William H.; Moffatt, Scott M.; Sherman, Greg P.; Pottle, Madge – Assessment & Evaluation in Higher Education, 2012
Clinical competence is a multidimensional concept and encompasses a variety of skills including procedural, problem-solving and clinical judgement. The initial stages of postgraduate medical training are believed to be a particularly important time for the development of clinical skill competencies. This study reports on an evaluation of a…
Descriptors: Medical Education, Physical Examinations, Focus Groups, Family Practice (Medicine)
Feldman, Moshe; Lazzara, Elizabeth H.; Vanderbilt, Allison A.; DiazGranados, Deborah – Journal of Continuing Education in the Health Professions, 2012
Competency-based assessment and an emphasis on obtaining higher-level outcomes that reflect physicians' ability to demonstrate their skills has created a need for more advanced assessment practices. Simulation-based assessments provide medical education planners with tools to better evaluate the 6 Accreditation Council for Graduate Medical…
Descriptors: Performance Based Assessment, Physicians, Accuracy, High Stakes Tests
Gierl, Mark J.; Gotzmann, Andrea; Boughton, Keith A. – Applied Measurement in Education, 2004
Differential item functioning (DIF) analyses are used to identify items that operate differently between two groups, after controlling for ability. The Simultaneous Item Bias Test (SIBTEST) is a popular DIF detection method that matches examinees on a true score estimate of ability. However in some testing situations, like test translation and…
Descriptors: True Scores, Simulation, Test Bias, Student Evaluation
Galindo-Garre, Francisca; Vermunt, Jeroen K. – Psychometrika, 2004
This paper presents a row-column (RC) association model in which the estimated row and column scores are forced to be in agreement with a priori specified ordering. Two efficient algorithms for finding the order-restricted maximum likelihood (ML) estimates are proposed and their reliability under different degrees of association is investigated by…
Descriptors: Mathematics, Test Reliability, Computation, Testing

Renner, Richard R.; Greenwood, Gordon E. – Assessment and Evaluation in Higher Education, 1985
Fictitious student evaluations of a faculty member's teaching performance are presented to the reader in an exercise in interpreting such information. Evaluator comments reveal a widespread divergence of views. (MSE)
Descriptors: College Faculty, Evaluation Criteria, Evaluation Methods, Higher Education

Streufert, Siegfried; And Others – Personnel Psychology, 1988
Evaluated quasi-experimental simulation technique designed to measure impact of individual differences in managerial styles on executive performance. Tested 20 simulation-based measures for reliability and validity. Data from two samples suggest that this quasi-experimental simulation technology may be useful in assessing managerial styles not…
Descriptors: Administrator Qualifications, Competence, Evaluation Methods, Individual Differences
Kistner, Emily O.; Muller, Keith E. – Psychometrika, 2004
Intraclass correlation and Cronbach's alpha are widely used to describe reliability of tests and measurements. Even with Gaussian data, exact distributions are known only for compound symmetric covariance (equal variances and equal correlations). Recently, large sample Gaussian approximations were derived for the distribution functions. New exact…
Descriptors: Correlation, Test Reliability, Test Results, Probability
Previous Page | Next Page »
Pages: 1 | 2