ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	33
Since 2006 (last 20 years)	65

Descriptor

Simulation	122
Test Reliability	122
Test Validity	34
Test Items	31
Item Response Theory	26
Computer Assisted Testing	22
Scores	22
Evaluation Methods	21
Error of Measurement	20
Comparative Analysis	19
Statistical Analysis	19
Test Construction	18
Correlation	16
Item Analysis	16
Psychometrics	16
Adaptive Testing	15
Computation	13
Higher Education	13
Measurement Techniques	13
Difficulty Level	12
Test Bias	12
Accuracy	11
Mathematical Models	11
Models	11
Factor Analysis	10
More ▼

Publication Type

Journal Articles	81
Reports - Research	78
Reports - Evaluative	17
Reports - Descriptive	12
Speeches/Meeting Papers	10
Dissertations/Theses -…	3
Tests/Questionnaires	3
Numerical/Quantitative Data	2
Collected Works - General	1
Collected Works - Proceedings	1
Guides - Non-Classroom	1
Information Analyses	1
Reference Materials -…	1
Reports - General	1
More ▼

Education Level

Higher Education	11
Postsecondary Education	9
Elementary Secondary Education	4
Secondary Education	4
Adult Education	3
Junior High Schools	3
Middle Schools	3
Elementary Education	2
Grade 8	2
High Schools	2
Early Childhood Education	1
Grade 2	1
Grade 3	1
Grade 9	1
Primary Education	1
More ▼

Audience

Practitioners	3
Teachers	2
Administrators	1
Researchers	1

Location

Australia	1
Canada	1
Germany	1
Indonesia	1
Iran	1
Italy	1
North America	1
Russia	1
Sweden	1
Taiwan	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Armed Forces Qualification…	1
Early Childhood Longitudinal…	1
Graduate Record Examinations	1
National Survey of Student…	1
Pennsylvania Educational…	1
Stanford Binet Intelligence…	1
Trends in International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 122 results Save | Export

"LFK" Index Does Not Reliably Detect Small-Study Effects in Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Guido Schwarzer; Gerta Rücker; Cristina Semaca – Research Synthesis Methods, 2024

The "LFK" index has been promoted as an improved method to detect bias in meta-analysis. Putatively, its performance does not depend on the number of studies in the meta-analysis. We conducted a simulation study, comparing the "LFK" index test to three standard tests for funnel plot asymmetry in settings with smaller or larger…

Descriptors: Bias, Meta Analysis, Simulation, Evaluation Methods

The Psychometric Quality of Objective Structured Clinical Examinations within Psychology Programs: A Systematic Review

Peer reviewed

Direct link

Azaan Vhora; Ryan L. Davies; Kylie Rice – Psychology Learning and Teaching, 2024

Background: Objective Structured Clinical Examinations (OSCEs) are a simulation-based assessment tool used extensively in medical education for evaluating clinical competence. OSCEs are widely regarded as more valid, reliable, and valuable compared to traditional assessment measures, and are now emerging within professional psychology training…

Descriptors: Psychology, Higher Education, Psychometrics, Objective Tests

Using Simulated Retests to Estimate the Reliability of Diagnostic Assessment Systems

Peer reviewed

Direct link

Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023

As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…

Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy

KR20 and KR21 for Some Nondichotomous Data (It's Not Just Cronbach's Alpha)

Peer reviewed

Direct link

Foster, Robert C. – Educational and Psychological Measurement, 2021

This article presents some equivalent forms of the common Kuder-Richardson Formula 21 and 20 estimators for nondichotomous data belonging to certain other exponential families, such as Poisson count data, exponential data, or geometric counts of trials until failure. Using the generalized framework of Foster (2020), an equation for the reliability…

Descriptors: Test Reliability, Data, Computation, Mathematical Formulas

Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design

Peer reviewed

Direct link

Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023

Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…

Descriptors: Test Format, Equated Scores, Best Practices, Test Construction

Capturing Competence: The Design, Evaluation, and Implementation of a Video-Based Instrument for Assessing Verbal Aggression Management Competence

Peer reviewed

Direct link

Delphine Franco; Ruben Vanderlinde; Martin Valcke – European Journal of Education, 2025

Complex competences, such as managing students' aggressive behaviour, are challenging to develop during teacher training. Recently, video-based simulations have been considered promising, yet suitable assessment instruments are limitedly available. This paper reports on the design and evaluation of a video-based assessment tool tailored to measure…

Descriptors: Preservice Teachers, Preservice Teacher Education, Student Behavior, Aggression

Accuracy and Sensitivity of Coefficient Alpha and Its Alternatives with Unidimensional and Contaminated Scales

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023

We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…

Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

Examining of Internal Consistency Coefficients in Mixed-Format Tests in Different Simulation Conditions

Peer reviewed
PDF on ERIC

Download full text

Gurdil Ege, Hatice; Demir, Ergul – Eurasian Journal of Educational Research, 2020

Purpose: The present study aims to evaluate how the reliabilities computed using a, Stratified a, Angoff-Feldt, and Feldt-Raju estimators may differ when sample size (500, 1000, and 2000) and item type ratio of dichotomous to polytomous items (2:1; 1:1, 1:2) included in the scale are varied. Research Methods: In this study, Cronbach's a,…

Descriptors: Test Format, Simulation, Test Reliability, Sample Size

Short-Term Test-Retest Reliability of Contralateral Suppression of Click-Evoked Otoacoustic Emissions in Normal-Hearing Subjects

Peer reviewed

Direct link

Keppler, Hannah; Degeest, Sofie; Vinck, Bart – Journal of Speech, Language, and Hearing Research, 2021

Purpose: The objective of the current study was to investigate the short-term test-retest reliability of contralateral suppression (CS) of click-evoked otoacoustic emissions (CEOAEs) using commercially available otoacoustic emission equipment. Method: Twenty-three young normal-hearing subjects were tested. An otoscopic evaluation, admittance…

Descriptors: Test Reliability, Hearing (Physiology), Acoustics, Auditory Tests

A Data-Based Simulation Study of Reliability for an Adaptive Assessment Based on Knowledge Space Theory

Peer reviewed

Direct link

Doble, Christopher; Matayoshi, Jeffrey; Cosyn, Eric; Uzun, Hasan; Karami, Arash – International Journal of Artificial Intelligence in Education, 2019

A large-scale simulation study of the assessment effectiveness of a particular instantiation of knowledge space theory is described. In this study, data from more than 700,000 actual assessments in mathematics using the ALEKS (Assessment and LEarning in Knowledge Spaces) software were used to determine response probabilities for the same number of…

Descriptors: Test Reliability, Adaptive Testing, Mathematics Tests, Computer Assisted Testing

The Impact of Aberrant Response on Reliability and Validity

Peer reviewed

Direct link

Liu, Tour; Sun, Yicong; Li, Zhen; Xin, Tao – Measurement: Interdisciplinary Research and Perspectives, 2019

Aberrant response has an important impact on item parameter estimation, individuals' evaluation, and other statistical analysis. There are various types of aberrant response behaviors in educational and psychological tests, like sleeping, guessing, and plodding. Random response is the most common one. The purpose of this research was to clarify…

Descriptors: Test Reliability, Test Validity, Item Response Theory, Differences

Using Existing Data to Inform Development of New Item Types. Research Report. ETS RR-20-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Ling, Guangming; Frankel, Lois – ETS Research Report Series, 2020

With advances in technology, researchers and test developers are developing new item types to measure complex skills like problem solving and critical thinking. Analyzing such items is often challenging because of their complicated response patterns, and thus it is important to develop psychometric methods for practitioners and researchers to…

Descriptors: Test Construction, Test Items, Item Analysis, Psychometrics

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Systematic Comparison of Decision Accuracy of Complex Compensatory Decision Rules Combining Multiple Tests in a Higher Education Context

Peer reviewed

Direct link

Yocarini, Iris E.; Bouwmeester, Samantha; Smeets, Guus; Arends, Lidia R. – Educational Measurement: Issues and Practice, 2018

This real-data-guided simulation study systematically evaluated the decision accuracy of complex decision rules combining multiple tests within different realistic curricula. Specifically, complex decision rules combining conjunctive aspects and compensatory aspects were evaluated. A conjunctive aspect requires a minimum level of performance,…

Descriptors: Comparative Analysis, Decision Making, Accuracy, Higher Education

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Educational and Psychological…	8
Journal of Educational…	8
Psychometrika	7
ETS Research Report Series	6
Applied Psychological…	4
Applied Measurement in…	3
Journal of Educational and…	3
Measurement:…	3
ProQuest LLC	3
Academic Medicine	2
Educational Measurement:…	2
Psychological Methods	2
School Psychology Quarterly	2
Advances in Health Sciences…	1
American Journal of…	1
Applied Environmental…	1
Assessment & Evaluation in…	1
Assessment and Evaluation in…	1
Center for Education Data &…	1
EURASIA Journal of…	1
Education and Information…	1
Educational Research and…	1
Educational Sciences: Theory…	1
Eurasian Journal of…	1
European Journal of…	1
More ▼

Cliff, Norman	3
Segall, Daniel O.	3
Betz, Nancy E.	2
Edwards, Michael C.	2
Guo, Hongwen	2
Sijtsma, Klaas	2
Van Norman, Ethan R.	2
Wang, Wen-Chung	2
Weiss, David J.	2
Xin, Tao	2
Yao, Lihua	2
Afsoon Hassani Mehraban	1
Akram Azad	1
Algina, James	1
Andersson, Björn	1
Arends, Lidia R.	1
Asilkalkan, Abdullah	1
Atkins, David C.	1
Attali, Yigal	1
Azaan Vhora	1
Bacciu, Anna	1
Bates, Simon P.	1
Beauchaine, Theodore P.	1
Bedics, Jamie D.	1
More ▼