ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	10

Descriptor

Evaluation Methods	21
Simulation	21
Test Reliability	21
Test Validity	7
Accuracy	4
Correlation	4
Testing	4
Computation	3
Evaluation Criteria	3
Medical Education	3
Observation	3
Student Evaluation	3
Competence	2
Educational Technology	2
Effect Size	2
Equated Scores	2
Evaluation Research	2
Foreign Countries	2
Higher Education	2
Item Analysis	2
Knowledge Level	2
Mathematical Models	2
Measurement Techniques	2
Models	2
Performance Based Assessment	2
More ▼

Source

Psychometrika	2
Advances in Health Sciences…	1
Applied Measurement in…	1
Assessment & Evaluation in…	1
Assessment and Evaluation in…	1
ETS Research Report Series	1
Educational Sciences: Theory…	1
Educational and Psychological…	1
European Journal of…	1
European Journal of Education	1
Journal of Consulting and…	1
Journal of Continuing…	1
Journal of Educational…	1
Personnel Psychology	1
Research Synthesis Methods	1
School Psychology Quarterly	1
More ▼

Publication Type

Journal Articles	17
Reports - Research	11
Reports - Descriptive	5
Reports - Evaluative	3
Speeches/Meeting Papers	2
Collected Works - Proceedings	1
Information Analyses	1
Tests/Questionnaires	1

Education Level

Postsecondary Education	4
Higher Education	3
Adult Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Practitioners	2
Teachers	1

Location

Russia

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

"LFK" Index Does Not Reliably Detect Small-Study Effects in Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Guido Schwarzer; Gerta Rücker; Cristina Semaca – Research Synthesis Methods, 2024

The "LFK" index has been promoted as an improved method to detect bias in meta-analysis. Putatively, its performance does not depend on the number of studies in the meta-analysis. We conducted a simulation study, comparing the "LFK" index test to three standard tests for funnel plot asymmetry in settings with smaller or larger…

Descriptors: Bias, Meta Analysis, Simulation, Evaluation Methods

Using Simulated Retests to Estimate the Reliability of Diagnostic Assessment Systems

Peer reviewed

Direct link

Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023

As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…

Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy

Capturing Competence: The Design, Evaluation, and Implementation of a Video-Based Instrument for Assessing Verbal Aggression Management Competence

Peer reviewed

Direct link

Delphine Franco; Ruben Vanderlinde; Martin Valcke – European Journal of Education, 2025

Complex competences, such as managing students' aggressive behaviour, are challenging to develop during teacher training. Recently, video-based simulations have been considered promising, yet suitable assessment instruments are limitedly available. This paper reports on the design and evaluation of a video-based assessment tool tailored to measure…

Descriptors: Preservice Teachers, Preservice Teacher Education, Student Behavior, Aggression

Scale Reliability Evaluation with Heterogeneous Populations

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2015

A latent variable modeling approach for scale reliability evaluation in heterogeneous populations is discussed. The method can be used for point and interval estimation of reliability of multicomponent measuring instruments in populations representing mixtures of an unknown number of latent classes or subpopulations. The procedure is helpful also…

Descriptors: Test Reliability, Evaluation Methods, Measurement Techniques, Computation

Testing Methodology in the Student Learning Process

Peer reviewed
PDF on ERIC

Download full text

Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017

The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…

Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation

Simulate to Understand Models, Not Nature. Research Report. ETS RR-14-16

Peer reviewed
PDF on ERIC

Download full text

Dorans, Neil J. – ETS Research Report Series, 2014

Simulations are widely used. Simulations produce numbers that are deductive demonstrations of what a model says will happen.They produce numerical results that are consistent with the premises of the model used to generate the numbers. These simulated numerical results are not empirical data that address aspects of the world that lies outside the…

Descriptors: Simulation, Equated Scores, Scores, Scientific Methodology

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

The Effects of Baseline Estimation on the Reliability, Validity, and Precision of CBM-R Growth Estimates

Peer reviewed

Direct link

Van Norman, Ethan R.; Christ, Theodore J.; Zopluoglu, Cengiz – School Psychology Quarterly, 2013

This study examined the effect of baseline estimation on the quality of trend estimates derived from Curriculum Based Measurement of Oral Reading (CBM-R) progress monitoring data. The authors used a linear mixed effects regression (LMER) model to simulate progress monitoring data for schedules ranging from 6-20 weeks for datasets with high and low…

Descriptors: Curriculum Based Assessment, Oral Reading, Reading Fluency, Regression (Statistics)

Effectiveness of a Simulated Clinical Examination in the Assessment of the Clinical Competencies of Entry-Level Trainees in a Family Medicine Residency Programme

Peer reviewed

Direct link

Curran, Vernon R.; Butler, Roger; Duke, Pauline; Eaton, William H.; Moffatt, Scott M.; Sherman, Greg P.; Pottle, Madge – Assessment & Evaluation in Higher Education, 2012

Clinical competence is a multidimensional concept and encompasses a variety of skills including procedural, problem-solving and clinical judgement. The initial stages of postgraduate medical training are believed to be a particularly important time for the development of clinical skill competencies. This study reports on an evaluation of a…

Descriptors: Medical Education, Physical Examinations, Focus Groups, Family Practice (Medicine)

Rater Training to Support High-Stakes Simulation-Based Assessments

Peer reviewed

Direct link

Feldman, Moshe; Lazzara, Elizabeth H.; Vanderbilt, Allison A.; DiazGranados, Deborah – Journal of Continuing Education in the Health Professions, 2012

Competency-based assessment and an emphasis on obtaining higher-level outcomes that reflect physicians' ability to demonstrate their skills has created a need for more advanced assessment practices. Simulation-based assessments provide medical education planners with tools to better evaluate the 6 Accreditation Council for Graduate Medical…

Descriptors: Performance Based Assessment, Physicians, Accuracy, High Stakes Tests

Performance of SIBTEST When the Percentage of DIF Items Is Large

Peer reviewed

Direct link

Gierl, Mark J.; Gotzmann, Andrea; Boughton, Keith A. – Applied Measurement in Education, 2004

Differential item functioning (DIF) analyses are used to identify items that operate differently between two groups, after controlling for ability. The Simultaneous Item Bias Test (SIBTEST) is a popular DIF detection method that matches examinees on a true score estimate of ability. However in some testing situations, like test translation and…

Descriptors: True Scores, Simulation, Test Bias, Student Evaluation

The Order-Restricted Association Model: Two Estimation Algorithms and Issues in Testing

Peer reviewed

Direct link

Galindo-Garre, Francisca; Vermunt, Jeroen K. – Psychometrika, 2004

This paper presents a row-column (RC) association model in which the estimated row and column scores are forced to be in agreement with a priori specified ordering. Two efficient algorithms for finding the order-restricted maximum likelihood (ML) estimates are proposed and their reliability under different degrees of association is investigated by…

Descriptors: Mathematics, Test Reliability, Computation, Testing

Professor "X": How Experts Rated His Student Ratings.

Peer reviewed

Renner, Richard R.; Greenwood, Gordon E. – Assessment and Evaluation in Higher Education, 1985

Fictitious student evaluations of a faculty member's teaching performance are presented to the reader in an exercise in interpreting such information. Evaluator comments reveal a widespread divergence of views. (MSE)

Descriptors: College Faculty, Evaluation Criteria, Evaluation Methods, Higher Education

Simulation-Based Assessment of Managerial Competence: Reliability and Validity.

Peer reviewed

Streufert, Siegfried; And Others – Personnel Psychology, 1988

Evaluated quasi-experimental simulation technique designed to measure impact of individual differences in managerial styles on executive performance. Tested 20 simulation-based measures for reliability and validity. Data from two samples suggest that this quasi-experimental simulation technology may be useful in assessing managerial styles not…

Descriptors: Administrator Qualifications, Competence, Evaluation Methods, Individual Differences

Exact Distributions of Intraclass Correlation and Cronbach's Alpha with Gaussian Data and General Covariance

Peer reviewed

Direct link

Kistner, Emily O.; Muller, Keith E. – Psychometrika, 2004

Intraclass correlation and Cronbach's alpha are widely used to describe reliability of tests and measurements. Even with Gaussian data, exact distributions are known only for compound symmetric covariance (equal variances and equal correlations). Recently, large sample Gaussian approximations were derived for the distribution functions. New exact…

Descriptors: Correlation, Test Reliability, Test Results, Probability

Previous Page | Next Page »

Pages: 1 | 2

Atkins, David C.	1
Beauchaine, Theodore P.	1
Bedics, Jamie D.	1
Bengston, John K.	1
Boughton, Keith A.	1
Brauer, J.	1
Brown, James Dean	1
Butler, Roger	1
Christ, Theodore J.	1
Clark, Amy K.	1
Cohen, Stuart J.	1
Cristina Semaca	1
Curran, Vernon R.	1
Delphine Franco	1
DiazGranados, Deborah	1
Dorans, Neil J.	1
Duke, Pauline	1
Eaton, William H.	1
Feldman, Moshe	1
Galindo-Garre, Francisca	1
Gerta Rücker	1
Gierl, Mark J.	1
Gorbunova, Tatiana N.	1
Gorter, S.	1
More ▼